Project #1
Developed an end-to-end data pipeline for large public health datasets, including ETL, data wrangling, feature engineering, and normalization, followed by EDA and regression modeling to explore links between substance use, personality traits, and demographics. Stack : Python (pandas, numpy, seaborn, matplotlib, statsmodels, patsy), Jupyter Notebook, Excel

