Senior Data Scientist and Deep Learning Engineer

As a Data Scientist with a strong background in Software Engineering, I have experience with Data Science and Machine Learning in the automotive industry, Deep Learning in both Computer Vision (specifically in the dental field) and Natural Language Processing in the entertainment industry, and Classical Computer Vision methods in Robotics. Additionally, I have experience in Data Engineering in the Cloud.

Industry Experience

  • medical
  • robotics
  • automotive
  • music
  • insurance
  • sports
  • consumer electronics
  • e-commerce

Formal Education

My educational background with an M.Sc. in Data Science, concluded with distinction gives me a strong mathematical and theoretical foundation towards Data Science and AI methods. In addition I bring an interesting perspective to AI with a B.Sc. in Psychology.

Skill Summary

  • Machine Learning
    • Computer Vision w/ SqueezeNet, GANs, ResNet, AutoEncoders
    • NLP w/ BERT models
    • random forests
    • logistic/linear regression
    • clustering algorithms
  • Data Visualization
    • Google Looker Studio
    • R Shiny
    • matplotlib, seaborn, ggplot
    • D3.js
  • Computer Vision
    • Image Classification
    • Object Pose Estimation
    • Image Processing
    • Image Segmentation
    • Contour Extraction and Alignment
    • Camera Calibration
  • Feature Engineering
    • Dimensionality Reduction
    • Feature Generation (visual descriptors, tf-idf)
    • Feature Embeddings
    • Feature Selection methods (Entropy based, model based, correlation based)
  • Data Engineering
    • ETL Pipelines in BigQuery
    • Airflow
    • Snowflake
    • AWS RDS
    • PySpark
    • PostgreSQL
    • MSSQL
  • Cloud Platforms
    • Amazon Web Services (RDS, S3, EC2, Lambda, VPC, SageMaker)
    • Google Cloud Platform (Compute Engine, Cloud Scheduler, Google API, Cloud Function Triggers, Google Looker Studio)
  • Coding Tools and CI/CD
    • Docker
    • git, bitbucket, github, gitlab
    • Confluence, Jira
    • Jenkins
  • OS and Programming Languages
    • Linux
    • macOS
    • Python
    • R
    • C#
    • JavaScript
  • Favorite Python Packages
    • pandas, numpy, pyspark, pyodbc
    • tensorflow, keras, pytorch, sklearn, scipy
    • opencv, pillow, trimesh
    • huggingface, nltk
    • seaborn, matplotlib
    • multiprocessing, dask
    • pytest, attrs, flask, pyzbar

Project Experience

Cloud Data Engineering

Mindfuel GmbH
Main Technologies of the project

GCP Compute Engine, Docker, BigQuery, Google Looker Studio, AWS VPC, AWS RDS, OpenVPN, Postgres, JavaScript

Accomplishments
  • created and maintained a cross-cloud data pipeline
  • developed, delivered, and deployed solution in a docker image
  • created a new Google Looker Studio community visualization tool in js: Sankey Diagram
Project Description

NLP Data Scientist

ITSP GmbH
Main Technologies of the project

BERT, Python, Pandas, GitHub, Huggingface, random forest

Accomplishments
  • text preprocessing
  • Feature Engineering from the processed text (lemmatization, stemming, counts, template matches, creation of meta-features)
  • Trained Random Forest for classification of simple vs complex chats
  • Used BERT model for sentiment classification of customers/agents, aggregation over time and visualization of sentiment
Project Description

Computer Vision Engineer

Dental Manufacturing Unit GmbH
Main Technologies of the project

OpenCV, Python, Flask, Linux, RaspberryPi, CI/CD, Docker, PyTorch, scipy

Accomplishments
  • brought stable computer vision solution from r&d to market
  • implemented robust, efficient, and reliable algorithm for finding the best affine transformation matrix
  • implemented a highly reliable and fast image segmentation algorithm
  • determined optimal lighting conditions for image capture
  • developed extensive test suite for Quality Assurance
  • managed CI/CD pipeline
  • object-oriented programming throughout the project
  • Post-Launch On-site early customer support w.r.t. vision system as well as network communication
Project Description

Deep Learning Computer Vision Engineer

Dental Manufacturing Unit GmbH
Main Technologies of the project

Python, TensorFlow, OpenCV, ONNX, CUDA, .NET, C#

Accomplishments
  • successful implementation of CNN architecture with low latency and high accuracy
  • deployment of TensorFlow model through ONNX in C# codebase
  • wrote unit, regression, and e2e tests
Project Description

Data Scientist

Porsche Informatik GmbH
Main Technologies of the project

R, Python, PySpark, Bash

Accomplishments
  • successful implementation of python module for big data feature selection in pyspark
  • documentation of implemented feature as well as user guide
  • thorough research and understanding of feature selection methods
  • data restructuring, processing and preparation, exploratory data analysis
  • Code Parallelization in R
Project Description

Data Scientist

Kyocera AVX Group
Main Technologies of the project

R, R Shiny, SQL

Accomplishments
  • Visualization of KPI’s
  • Performance improvements to existing Shiny app
  • Creation of automated reports of production data