I am a recent graduate of Boston University and an aspiring data scientist. I double majored in statistics and computer science, giving me a strong theoretical and programming background for developing and evaluating machine learning and statistical models. I enjoy the journey of cleaning and exploring data to make sense of the unknown, especially when it comes to NLP and sports analysis.
Name:
Jeffrey Wu
Age:
22 years
Location:
Vienna, VA, United States, Earth
Sept 2017 - Jan 2019
Grader
– Collaborated with 2 professors and 3 other graders to grade assignments for a
class of 200+ students in a timely manner
– Formulated rubrics for exercises involving discrete mathematics and combinatoric
structures
Boston, MA
Sept 2015 - Jan 2019
Tutor
– Collaborated with 4 peers to design and execute constructive reading and math
activities and lessons in different classroom settings
– Promoted literacy and mathematics development in 3rd through 5th grade
classrooms
– Supported instructors in assessing students’ reading and mathematics proficiency
Boston, MA
|
https://www.bu.edu/seo/students/build/
September 2015 - January 2019
BA in Computer Science and Statistics
Concentration in Statistics
Cum Laude
Cum. GPA: 3.64/4.0
- Collaborated with a teammate to develop and train a decision forest and LSTM
neural network in Python using Tensorflow to recognize whether a given full name
is Brazilian or not
– Achieved an 0.842 accuracy with the decision forest and 0.905 accuracy with the
neural network on a test set of 12,000 names
Tools Used: Python, Tensorflow, Neural Networks, Decision Forest
– Extracted features from IMDb movie reviews using Python’s NLTK, Gensim, and
Scikit-learn libraries
– Implemented k-Means++ algorithm in Python to cluster IMDb movies based on
movie summaries and the content of each movie’s reviews in order to recommend
movies depending on previously liked movies
Tools Used: Python, NTLK, Gensim, Scikit-learn, k-Means++
– Collaborated with a team of 4 to develop and evaluate a linear regression model in
R for predicting daily bike rental counts using Boston bike share and weather data
pulled from 2011
– Identified outliers and significant predictors to optimize the model in R
Tools Used: R, Linear Regression
– Developed a web scraper in Python to pull NBA and college basketball data from
ESPN
– Conducted data preparation using Python and outlier detection using JMP
– Facilitated a team of 3 to develop and evaluate a linear regression model in JMP
for predicting professional basketball efficiency from college team and individual
statistics
Tools Used: Python, BeautifulSoup, JMP