xinyue emma) li · • relevant courses: predictive analytics, data mining, data visualization,...

1
Xinyue (Emma) Li Emeryville CA 94608 · (530) 564-2418 · xinyueli2018@u.northwestern.edu EDUCATION Northwestern University Evanston, IL MS in Analytics, GPA: 3.80/4.00 Expected 12/2018 Relevant Courses: Predictive Analytics, Data Mining, Data Visualization, Analytics for Big Data (Hadoop/Spark/Hive), Database Design and Information Retrieval, A/B Testing University of California, Davis Davis, CA B.S. Applied Statistics, GPA: 3.91/4.00; B.S. Managerial Economics, GPA: 3.76/4.00 06/2017 SKILLS R, Python (Pandas, NumPy, Scikit-learn, Keras), SQL (MySQL, Hive, Netezza), Spark, Hadoop, Java, D3.js, HTML/CSS, Tableau, AWS, SAS, C, Stata, Matlab WORKING EXPERIENCE TransUnion Chicago, IL Data Science Intern 06/2018 09/2018 Constructed risk score models to review personal loan applications using various methods such as tree-based models (XGBoost, Gradient Boosting, C5.0, Random Forest), SVM and Artificial Neural Networks Researched and implemented methods of variable interpretation in Neural Networks for adverse selection Performed quantitative analysis on 1B+ trades records to identify the customerscapacity to absorb ongoing credit products as the interest rate increases, which improved its cycle readiness through early identification of a shift in consumers’ debt Graduate Analytics Consultant 09/2017 - 06/2018 Created graph components and generated features on 500k+ credit accounts with shared identity information Trained Boosting Trees with XGBoost to identify the fraudulent accounts and achieved 95.9% precision Researched and applied a Convolutional Neural Network using Graph Kernels to improve the fraud detection performance Detected undiscovered suspicious accounts and improved the previous model by 25% Agricultural Issues Center Davis, CA Undergraduate Researcher 07/2016 - 06/2017 Analyzed the effect of the legalization on Cannabis price in California through Hypothesis Testing in R Performed exploratory data analysis and analyzed and researched reasons for price change across different agricultural commodities through Functional Principal Component Analysis on the 1995-2015 California agricultural exports Standard Chartered Bank Shanghai Intern 08/2016 - 09/2016 Predicted key indicators (revenue growth, EBITDA, taxable profit, etc) of the client to ensure liquidity for debt issuance Explored potential collaboration opportunities between Standard Chartered Bank and Alibaba through analyzing Alibabas operational model and cash flow PROJECTS Predictive Modeling on Clothing Sales 10/2017 - 12/2017 Cleaned data inconsistencies and imputed missing values with MICE algorithm and KNN methods Generated features measuring the recency and frequency of consumerspurchasing behaviors Evaluated the efficacy of catalog-driven marketing through predicting customers’ future purchases with stacking Logistic Regression and Multiple Linear Regression models Estimated the expected profit to assist the company’s marketing strategy decision Gaming Analytics on Destiny II 01/2018 - 06/2018 Designed a Player versus Player recommendation system framework based on team play to improve team performance Implemented clustering analysis through K-means, GMM, Archetype Analysis on 16M+ matches from Destiny II to create player profiles Produced team profiles accordingly and provided recommendations via K-Nearest Neighbor method Submitted paper to AIIDE(Artificial Intelligence and Interactive Digital Entertainment Conference) 2018 Venmo Transaction Study 04/2018 - 06/2018 Conducted quantitative analysis with effective visualizations on 7M+ transactions via PySpark and SparkSQL to summarize Venmo's social network Analyzed different emoji use patterns in various time frames to learn users' spending habits using RDD and Spark data frame Clustered the transaction messages with PySpark using text-based attributes to improve the text classification algorithm in each segmentation ACTIVITIES AND LEADERSHIP Vice President of Career Development Department at CSSA Davis, CA 01/2016 - 06/2017 Academic Coordinator at UC Davis Statistics Club Davis, CA 06/2015 - 12/2016 Volunteer at NYBL Foundation of America Sacramento, CA 01/2015 - 03/2016

Upload: others

Post on 03-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Xinyue Emma) Li · • Relevant Courses: Predictive Analytics, Data Mining, Data Visualization, Analytics for Big Data (Hadoop/Spark/Hive), Database Design and Information Retrieval,

Xinyue (Emma) Li Emeryville CA 94608 · (530) 564-2418 · [email protected]

EDUCATION Northwestern University Evanston, IL MS in Analytics, GPA: 3.80/4.00 Expected 12/2018 • Relevant Courses: Predictive Analytics, Data Mining, Data Visualization, Analytics for Big Data (Hadoop/Spark/Hive),

Database Design and Information Retrieval, A/B Testing University of California, Davis Davis, CA B.S. Applied Statistics, GPA: 3.91/4.00; B.S. Managerial Economics, GPA: 3.76/4.00 06/2017 SKILLS R, Python (Pandas, NumPy, Scikit-learn, Keras), SQL (MySQL, Hive, Netezza), Spark, Hadoop, Java, D3.js, HTML/CSS, Tableau, AWS, SAS, C, Stata, Matlab WORKING EXPERIENCE TransUnion Chicago, IL Data Science Intern 06/2018 – 09/2018 • Constructed risk score models to review personal loan applications using various methods such as tree-based models

(XGBoost, Gradient Boosting, C5.0, Random Forest), SVM and Artificial Neural Networks • Researched and implemented methods of variable interpretation in Neural Networks for adverse selection • Performed quantitative analysis on 1B+ trades records to identify the customers’ capacity to absorb ongoing credit products

as the interest rate increases, which improved its cycle readiness through early identification of a shift in consumers’ debt Graduate Analytics Consultant 09/2017 - 06/2018 • Created graph components and generated features on 500k+ credit accounts with shared identity information • Trained Boosting Trees with XGBoost to identify the fraudulent accounts and achieved 95.9% precision • Researched and applied a Convolutional Neural Network using Graph Kernels to improve the fraud detection performance • Detected undiscovered suspicious accounts and improved the previous model by 25% Agricultural Issues Center Davis, CA Undergraduate Researcher 07/2016 - 06/2017 • Analyzed the effect of the legalization on Cannabis price in California through Hypothesis Testing in R • Performed exploratory data analysis and analyzed and researched reasons for price change across different agricultural

commodities through Functional Principal Component Analysis on the 1995-2015 California agricultural exports Standard Chartered Bank Shanghai Intern 08/2016 - 09/2016 • Predicted key indicators (revenue growth, EBITDA, taxable profit, etc) of the client to ensure liquidity for debt issuance • Explored potential collaboration opportunities between Standard Chartered Bank and Alibaba through analyzing Alibaba’s

operational model and cash flow

PROJECTS Predictive Modeling on Clothing Sales 10/2017 - 12/2017 • Cleaned data inconsistencies and imputed missing values with MICE algorithm and KNN methods • Generated features measuring the recency and frequency of consumers’ purchasing behaviors • Evaluated the efficacy of catalog-driven marketing through predicting customers’ future purchases with stacking Logistic

Regression and Multiple Linear Regression models • Estimated the expected profit to assist the company’s marketing strategy decision Gaming Analytics on Destiny II 01/2018 - 06/2018 • Designed a Player versus Player recommendation system framework based on team play to improve team performance • Implemented clustering analysis through K-means, GMM, Archetype Analysis on 16M+ matches from Destiny II to create

player profiles • Produced team profiles accordingly and provided recommendations via K-Nearest Neighbor method • Submitted paper to AIIDE(Artificial Intelligence and Interactive Digital Entertainment Conference) 2018 Venmo Transaction Study 04/2018 - 06/2018 • Conducted quantitative analysis with effective visualizations on 7M+ transactions via PySpark and SparkSQL to summarize

Venmo's social network • Analyzed different emoji use patterns in various time frames to learn users' spending habits using RDD and Spark data frame • Clustered the transaction messages with PySpark using text-based attributes to improve the text classification algorithm in

each segmentation ACTIVITIES AND LEADERSHIP • Vice President of Career Development Department at CSSA – Davis, CA 01/2016 - 06/2017 • Academic Coordinator at UC Davis Statistics Club – Davis, CA 06/2015 - 12/2016 • Volunteer at NYBL Foundation of America – Sacramento, CA 01/2015 - 03/2016