
Me as a UCDavis DSAC member
Directors’ Student Advisory Council
Bridge the Gap
Drive business values through data analysis.
3+ years experiences in analytics and BI using SQL and Tableau for actionable business insights.
2+ years of experience with Python and R for ETL (extract, transform, load), statistical analysis and ML.
Design idea credited to @Adham Dannaway
#MySQL, #PyMysql
#PySpark
#Hive
#MongoDB, #MongoClient
#ETL - Extract, Transform, Load:
(Python:numpy,pandas; R:dplyr)
#Web Scraping:
(Python:BeautifulSoup; R:rvest; Java:jsoup)
#Regular Expression (Python:re)
#Python (Matplotlib, Seaborn)
#R (ggplot2)
#Tableau Desktop Certified Associate
#AWS Certified Practitioner (EC2, S3, Lambda)
#GCP (AutoML)
#Salesforce (Einstein)
#Regression
#Classification:
(LogisticRegression, SVM, RandomForest)
#Clustering
#NLP:
(Sentiment Analysis, Topic Modeling, Word2Vec)
#A/B Testing
#EOD (Experiment of Design)
#Retention Analysis
A wholistic summary of the most common syntax and functions of the "re" module in Python with hands-on Regex practices for ETL (Extract, Transform, Load).
Curious about the differences between sponsored and non-sponsored items on eBay?
Check out this Python script that web-scraped and stored information into a database via "BeautifulSoup" and "PyMysql".
You can fetch the real-time headlines and the first three lines of the top stories for simplicity and time in either Python (BeautifulSoup, tokenize), R (rvest, dplyr, stringr) or Java (jsoup, BreakIterator).
Isn't it cool to have your own local movie database to track all the movies you watched and want to watch? Check out the script utilizing OMDb database API in either Python (json, PyMysql) or Java (gson, java.sql).
This project handles unstructured big data storage and manipulation (Map-Reduce) via MongoDB by using MongoClient in Python. The data can also be transformed and transferred from MongoDB to SQL database.
Design promotion bundles? Want to know what products should be recommended next? Check out this recommender system which generated frequent itemsets and association rules via "Apriori" algorithm in Python.
Are rating stars everything? It's inevitable that people have different standards to give stars. This project helps you build a more pragmatic and useful rating system based on reviews sentiment analysis with uniformed standards.
By seperately applying LogisticRegression, SVM, RandomForest classification algorithms with GridSearchCV and RecursiveFeatureElimination (RFE), this projects reached an accuracy of 90.3% in diabetes prediction.
By employing topic modeling, sentiment analysis and generating word-clouds on New York Times articles, this project tells you what is mainly talked about and the media sentiments regarding Joe Biden and Bernie Sanders.
Based on my experiences from practicum, I summarized three points to improve team productivity as a team member but not the leader.
“This is the dawn of the era of data capitalism.” In the age of big data, capitalism has been reinvented. Check out to see how my practicum make good use of data and drive business value for our client?
How to deliver outcomes that will "WOW" your client even in circumstances of lacking data? Check out my four practicle suggestions and tips.
As an analyst, I assume the art of data science is exactly like digging gold. How to devise a precise and easy-to-understand roadmap which is the key to treasures and also key of data analysis?
A dashboard for risk investors to get a grasp of the most investment-worthy
#startups #quadrant #investments
An #animated dashboard showing how important indicators of countries have been developing over decades. Drill into individual countries via #trailRun and #highlight
An #extension plan dashboard for a laundry startup via #ClusterAnalytics.
#RegionalProfitability #TableauMaps #analytics #Groups