Profile Blog About

Yaroslav Kopotilov

Senior Lead Data Scientist

Remote positions only

Toptal LinkedIn GitHub

datascienceREMOVE@tuta.io

Skills

Stack: Python | C++ | SQL | NoSQL | Linux | MacOS

Machine Learning

sklearn statsmodels lightgbm skopt tpot numpyro pytorch tensorflow langchain

Data Engineering

pandas dask SQL PostgreSQL SQLAlchemy MongoDB AWS-S3 AWS-Athena Spark requests imaplib STOMP

Reproducibility, Deployment & MLOps

docker Git poetry mlflow Ansible pytest pre-commit prefect GitHub-Actions Grafana ELK cloud-VMs

Numerical Computation

numpy scipy numba jax

Data Visualization

jupyter matplotlib plotly dash streamlit flask fastapi HTML asyncio

Industry Knowledge

Team & Management

Language: English | French | Russian

Hobbies: tabletop & live-action roleplaying games, organizing game festivals like this one, rock singing & guitar, theater, travelling

Work Experience

07/2020 - present: Senior Data Scientist, Toptal, Remote
Building AI systems for clients as Toptal's consultant
  • Consulted as a Data Scientist in both startups and larger companies
  • Developed ETL and ML systems for the Commodities, Equity, and GIS industries
  • Tweaked LLM pipelines to improve response quality
  • Optimized numerical algorithms for large datasets
  • Delivered data insights in a business-friendly way
  • Organized onsite meetups for remote companies
01/2022 - 06/2024: Founder and Lead Developer, YAFinData, Remote
Building the financial data analytics platform www.yafin.app
  • Designed and implemented a data platform for the UK electricity markets
  • Extracted insights from time series data using Machine Learning
  • Built monitoring systems to ensure high performance and data accuracy
  • Strategized and defined the overarching product development roadmap
  • Led and managed a team of data scientists and software developers
01/2019 - 07/2020: Data Scientist, Vitol, London
Conquering energy markets with the power of Machine Learning
  • Created market analysis tools and systematic trading strategies
  • Worked with coal, power and crude desks
  • Led data science projects from idea brainstorming to deployment
  • Assisted in the internal python and machine learning training
01/2019 - 06/2020: Model Governance Associate, JPMorgan, London
Challenging pricing models for commodity and FX options
  • Created a custom Extended Kalman Filter to calibrate pricing models
  • Reviewed 10 option pricing models in commodities and credit
  • Mentored junior employees
05/2016 - 10/2016: Algorithmic Trading Intern, Credit Suisse, London
Seeking alpha in commodity and equity markets
02/2015 - 07/2015: Research Intern, Novosibirsk State University
Writing a paper on image classification for a specific type of shapes

Academic Degrees

MSc in Financial Mathematics, Pierre and Marie Curie University
DEA El Karoui: top Financial Mathematics program in France
MSc in Applied Mathematics, Ecole Polytechnique
Cycle d'Ingénieur Polytechnicien: top Engineering program in France
MSc in Mathematics and Computer Science, Novosibirsk State University
BSc in Probability and Statistics, Novosibirsk State University
One of the best universities in Russia for STEM fields

Personal Projects

Cancer Treatment Research
Helped to identify promising drug candidates using embeddings and LLMs
Stranger News
Real-world news with a supernatural twist powered by LLMs (link)
GPT Telegram bot (team effort)
A bot powered by GPT with cool additional features (link)
Forecast Sales Kaggle competition
A surprisingly simple model that won the competition (link)
Interactive website
This very website powered by Flask + Dash (link)
Data Science in examples
Notebooks to tackle various ML and numerical problems (link)
Python data wrangling tools
Data pipelines with caching (link) & an efficient XML parser (link)
Popular classification algorithms
A comparison study using data from Titanic Kaggle competition (link)
Data Science meetups and workshops
Organized several non-profit Data Science meetups and workshops (example)