Hi, I'm
Dustin
Data Scientist
MSc Computer Science @ ETH Zürich
Headshot

About

Welcome to my website!

I hold a Master's degree in Computer Science from ETH Zürich, where I specialized in Machine Intelligence and Data Management. My interests lie at the intersection of software engineering, statistics, and machine learning, with a strong focus on building reliable, data-driven systems.

I have contributed to projects in forecasting, data engineering, and machine learning across various industries, including logistics and energy. My work also includes experience with LLM-based agents and open-source contributions to the Darts time series forecasting library, where I help improve and extend tools for modern time series modeling.

Driven by curiosity and a passion for turning complex data into practical insights, I enjoy solving challenging problems and continuously learning about new developments in AI. Feel free to connect with me.

Skills

Programming Languages

  • Python
  • R
  • C++
  • Java
  • TypeScript
  • Haskell

Frameworks & Libraries

  • Pandas
  • NumPy
  • scikit-learn
  • PyTorch
  • XGBoost
  • LightGBM
  • CatBoost
  • Matplotlib
  • Plotly
  • SciPy
  • Flask
  • Optuna
  • PySpark
  • Hugging Face
  • SHAP
  • statsmodels
  • wandb

Data Management

  • SQL
  • Spark
  • Hadoop
  • HBase
  • MongoDB
  • Neo4j

DevOps & Tooling

  • Docker
  • Git
  • CI/CD
  • GitHub Actions
  • Jenkins
  • Bash
  • pre-commit
  • mypy
  • Confluence
  • Notion
  • Jira
  • Slurm

Focus Areas

  • Forecasting
  • Visualization
  • Explainability
  • Causal Inference
  • Natural Language Processing (NLP)
  • Generative AI (GenAI)

Languages

  • German (Native)
  • English (Proficient)
  • French (Elementary)

Experience

Data & AI Engineer | Unit8

Sep 2025 - Present | Zürich, Switzerland
  • Helping organizations turn data into value through advanced analytics, AI, and data engineering by building end-to-end solutions that drive smarter decisions.

Research Assistant | Oracle Labs

Feb 2025 - Sep 2025 | Zürich, Switzerland
  • Built an LLM agent for financial institutions, automating data retrieval, analysis, and report generation, reducing decision-making time by 80%.
  • Built Python script and Gradio app to analyze runbooks with an LLM agent, automatically extracting configurations, cutting processing time by 50%.

Data Scientist | Unit8

Sep 2021 - Aug 2022 | Zürich, Switzerland
  • Developed and implemented a route planning tool using PySpark and Palantir Foundry, optimizing logistics for a newspaper company.
  • Core contributor to the Darts library, optimizing model efficiency while reviewing pull requests, suggesting enhancements, and improving documentation.
  • Collaborated with clients to deliver high-impact data solutions in energy and manufacturing sectors.

Education

ETH Zürich

Master of Science in Computer Science

2022 - 2024

Major: Machine Intelligence | Minor: Data Management

GPA: 5.46 / 6.0

ETH Zürich

Bachelor of Science in Computer Science

2018 - 2021

Projects

Recommender Systems for Swiss Politics

Master's Thesis (Feb 2024 - Aug 2024)

Identified 11 vulnerabilities in the swiss voting advice application Smartvote with some allowing for more than 3.5x visibility gains for individual parties. Proposed 10 mitigations to significantly reduce or eliminate these vulnerabilities. Findings are being adopted in Smartvote's redesign for the next elections.

Python, Pandas, D-Tale, SciPy, Optuna, Plotly, LaTex, Notion
RSFP Manipulation

DataComp Challenge

Semester Project (Sep 2023 - Dec 2023)

Ranked 4th out of 12 teams in the small track of the DataComp Challenge, an ML benchmark where the goal was to filter a CommonCrawl image-text dataset to train a CLIP model evaluated on 38 zero-shot downstream tasks, using a combination of cross-modality filtering and content alignment.

Python, PyTorch Lightning, SLURM (Cluster), CLIP (Contrastive Language-Image Pretraining)
DataComp Workflow

BasketXplainer

Interactive ML Project (Feb 2023 - Jul 2023)

Developed an interactive dashboard to predict basketball game outcomes based on in-game stats and explain predictions using SHAP values. Users could modify team statistics to explore what-if scenarios.

Python, scikit-learn, SHAP, Flask, Javascript, React, Gitlab Pipelines
BasketXplainer Prediction

Darts Forecasting Library

Open-Source Contributor (Sep 2021 - Aug 2022)

Core contributor to the open-source time series forecasting library Darts by Unit8. Optimized the most popular regression forecasting models by vectorizing computations achieving a speedup of up to 400x.

Python, PyTorch, scikit-learn, Matplotlib, Git, GitHub Actions
pip install darts
Darts Logo

Distance Preserving Graph Embedding

Bachelor's Thesis (Feb 2021 - Aug 2021)

Developed a model that enables constant-time approximate shortest path distance queries on road networks, achieving an average mean relative error of less than 10%.

Python, NetworkX, PyTorch Geometric, (Hyperbolic) Graph Convolutional Networks ((H)GCNs)
Winterthur
Winterthur, Switzerland
Surat
Surat, India
Dongguan
Dongguan, China

Hobbies

Sports

  • Squash
  • Badminton
  • Gym
  • Padel Tennis
  • Table Tennis
  • Volleyball
  • Tennis

Interests

  • Photography
  • Music
  • Chess
  • Geoguessr
  • Geography
  • Coding
  • History
  • Philosophy

Social Activities

  • Board Game Night with Friends
  • Good Conversations over Dinner
  • Exploring Nature and Hiking