Engineering and Analytics Intern
i2k Connect
Spring & Summer 2024
Engineered data analysis and visualizations to detect trends in topics extracted from news articles. Constructed interactive visualizations in Python using Pandas, NumPy, and Plotly's Dash. Leveraged unsupervised machine learning with Python and Scikit-Learn including clustering and trend detection, large language models for text understanding (GPT-4 and Llama-2 locally), and prompt engineering. Assisted in developing an automated pipeline for constructing a domain-specific training dataset and large language model (LLM). Generated data quality analytics and performance metrics for model evaluation.