
Sydney Anuyah
Data Scientist, Ph.D. Student
Indianapolis, Indiana USA
Available for work
About me
I am Sydney Anuyah, a lover of Jesus, a Data Scientist, and a Ph.D. student studying Data Science in the Human Centered Computing Department at the Luddy School of Informatics, Engineering and Computing, Indianapolis, Indiana. I am currently advised by Dr. Sunandan Chakraborty and co-advised by Dr. Arjan Durresi.
Before starting a Ph.D., I earned a masters degree in the same institution and worked under the tutelage of Dr. Sunandan Chakraborty and Dr. Davide Bolchini.
My work falls under understanding human-AI collaboration, causal mechanisms, causal structures, and large language models. Through technological solutions, I am focused on solving real-world challenges, like building customized LLM-agents which are used to assist scientists with deep research.
I created a Ph.D. reading club which encourages Ph.D. students and candidates to share their research with the public, which builds confidence. I also host leetcode sessions for Luddy students to teach them the practicality of interview coding questions.
My hobbies include web design, model training and heated debates on causality. I also love singing, playing the piano, and spending time with family and friends.
Experience
Fishers, IN
Indiana IOT Laboratory
Jan 2024 - May 2024
Data Science Intern
- Responsible for the installation, operations, and data analytics of Vision AI platform within an Indianapolis-area manufacturer’s existing operation.
Bellevue, WA
Amazon
May 2023 - Sept 2023
Data Science Intern
- Developed Project Toucan, a computer vision model for Amazon warehouses which saved the company $1.2 Million anually per site. Performed a lot of ground training and model finetuning.
- Created Machine Learning models which made optimized the entire work flow of Amazon fulfilment centers.
- Got my internship extended because of the enormous progress I added to the team.
Lagos, Nigeria
Chaka Technologies
July 2021 - June 2022
Data Engineer
- Developed, constructed, tested, and maintained data architectures, aligning these architectures with business requirements.
- Used Python, MSSQL, Mixpanel, Azure, and many others for data reporting thereby identifying ways to improve data reliability, efficiency,and quality.
- Created automated pipelines using Apache Airflow hosted using Docker containers for automatic stored procedures in SQL.
Education
Indianapolis, IN
2024 - 2028
Ph.D. in Data Science
Indiana University
Deep Learning, Natural Language Processing, Large Language Models, Causality and Correlation, Computer Vision.
Indianapolis, IN
2022 - 2024
M.Sc. Applied Data Science,
Indiana University
Calculus and Distributed Systems: PySpark Machine Learning, Deep Learning, Reinforcement Learning, Computer Vision, Natural Language Processing, Data Mining, Database Systems.
New Orleans, USA
2021 - 2023
M.Sc. Financial Engineering,
WorldQuant University,
Machine Learning in Finance, Portfolio Optimization with Python, Financial Data and Markets, Discrete and Continuous-time Stochastic Processes.
Lagos, Nigeria
2014 - 2019
Bachelor of Science in Electrical and Electronics Engineering,
University of Lagos,
Engineering Mathematics, Switching And Logic System, Computer Programming, Engineering Statistics & Computer System.
Tech Stack
Python
Programming Language

R
Programming Language
SQL
Query Language
PyTorch
Computer Vision
PySpark
Machine Learning
Git
Version Control
Docker
Containerization
AWS
Cloud Computing
Tableau
Data Visualization
Excel
Spreadsheet
Scikit-learn
Machine Learning
Hugging Face
Transformers
Spacy
NLP

NLTK
NLP

LangChain
LLM Framework
Airflow
Library
Flask
Web Framework
LoRA
Ranking
QLoRA
Ranking
PEFT
Fine-tuning
Let's talk
© 2025 All rights reserved