MY STORY

About me

Hiya! Welcome to my website!

WORK Experience

06/2019 - 08/2019

Microsoft(China) | Intern

– Developed a news aggregator using Node.js, with Redis cluster and MongonDB. 
– Used Requests+BeautifulSoup to crawl raw data from news and social media feeds, summarized the rules of web pages, used the regular expression to extract information concerning captions, contents, and pictures
– Employed CoreNLPParser, character recognition and image recognition to enhance the extraction efficiency and accuracy from 40% to 83%
– Assisted members to migrate the original crawler system to the distributed crawler system using Hadoop
– Participated actively in building a Twitter user graph using DFS to identify KOLs and pull their tweets as news sources

07/2018 - 08/2018

Jisuanke E-learning | Intern

– Led a team of interns to develop an online operation and management system
– Designed and built the MySQL database, used Django for backend development, developed and debugged modules concerning registration and login, basic information setting, course purchasing, and so on

RESEARCH ExperienceR

12/2020 - 05/2021

Chinese Text Summarization Based on Single Document | Graduation Project

– Focus on Abstract Text Summarization in Chinese
– Combined deep learning Seq2Seq+Attention model Pointer-Generator Network with Reinforcement Learning to improve semantics performance and the ROUGE Score on several Chinese data sets
– Designed model has increased the score of ROUGE-1, ROUGE-2, ROUGE-l by 6.97%, 15.66% and 7.73% respectively

05/2020 - 09/2020

Analysis and Optimization of Business Operation Data | Research Assistant

– Employed the Pandas to clean and convert original customers’ transaction data
– Applied K-means to classify transaction type, applied Apriori to mine relevant association rules between transaction types.
– Gave suggestions to increase 10% customer traffic based on the analysis

04/2020 - 06/2020

Design and Implementation of a Practice-taking WeChat Mini Program | Group Leader

– Used Scrapy in Python to crawl data from Leetcode, including questions, answers, categories, etc.
– Employed JavaScript to implement functions such as querying questions, discussing within groups, recommendations, etc
– Won the Second Price in North China Division, College Students WeChat Mini Program Development Competition

05/2020 - 09/2020

The Impact Factors and Consequences of Members’ Participation in SME-managed Online Brand Community | Research Assistant

– Constructed the theoretical model, raised a new presumption of the factors influencing the brand loyalty
– Designed and modified questionnaires, selected practicable samples, used SPSS to implement the factor analysis
– Analyzed the role of new factors played in the online brand community.

Education

2021-Present

USC-ECE

- Master of Machine Learning and Data Science

2016-2021

Nankai University

- Bachelor of Software Engineering
- Bachelor of Law(Minor)

Publication

Binhui Wang, Liuliu Chen, Huijing Niu, (2018) The Impact Factors and Consequence of Members’ Participation in SME-managed Online Brand Community, 2nd International Conference on Data Science and Business Analytics, ICDSBA2018

activities

09/2017 - 09/2018

Director | Student Activity Department, Student Union, College of Software, NKU

– Planed and organized the computer skills training seminars, invited experts in artificial intelligence and machine learning to give lectures to students
– Collected and collated students’ opinions and requirements of on-campus resources, coordinated with school leaders to formulate optimization strategies
– Assisted the College of Software in earning the honor of “Excellent College for Protection of Student’ Rights and Interests”

01/2018 - 01/2018

International Volunteer, Cambodia

– Taught students in international charity school English and fundamental computer skills enabled 16 students in my class to proficiently use basic software like Words
– Obtained the Certificate in International Volunteering

Skills

Language
Python 86%
C++ 80%
SQL 64%
JS & HTML & CSS 50%
Packages
Pytorch & TensorFlow 72%
Pandas & Matplotlib 52%
Scrapy & BeautifulSoup & requests 90%
NLTK & CoreNLP 76%
Models
CNN, RNN, LSTM... 81%
NLP(Seq2Seq+Attention, Word2Vec...) 77%
Statistic(K-Means, PCA... ) 85%

Contact