Accomplished Data Scientist with a proven track record at Fidelity Investments, leveraging Python and advanced machine learning techniques to enhance NLP capabilities and develop predictive models. Achieved a 3.2x improvement in savings behavior prediction, driving significant engagement through data-driven insights and effective stakeholder collaboration.
Programming: Python (Pandas, NumPy, Scikit-learn, PyTorch), SQL
Databases: Snowflake, MySQL
Machine Learning: Regression, Decision Trees, Random Forest, XGBoost, LightGBM, PCA
NLP: Text preprocessing, sentiment analysis, sentence embedding models, Cosine Similarity
Model Development: Feature engineering, hyperparameter tuning, ensemble methods
Cloud: AWS (SageMaker, S3, EC2, Lambda)
Visualization & Statistical Analysis: Matplotlib, Seaborn, Tableau, Hypothesis testing (A/B), Causal Inference
Collaboration: Git, GitHub/GitLab, Jira, Confluence