About me
I’m an autonomous, curious and self-driven Data Scientist, with a solid theoretical background - MSc in Computer Science @ UBA with GPA 9.4 - and an extensive professional experience as Software Engineer - more than 9 years in the industry in different roles, projects and types of companies. I'm a Kaggle kernel expert - reached position 52nd over more than 100K users (I left a link to my public kernels as my portfolio).
Experience
February 2019 - Currently

Data Science Professor. I'm teaching a semestral data science course which covers the following topics: Python fundamentals for Data Science (Numpy, Pandas and visualization libraries), SQL, Descriptive and inferential statistics, data cleaning and preprocessing, APIs and web scraping, Machine Learning fundamentals, Supervised and unsupervised machine learning algorithms, Recommender Systems and Text mining.

Technologies

PythonNumPyMatplotlibPandasSklearn
February 2019 - Currently

Etermax is a mobile game development company, mostly known for the game Trivia Crack. My main projects at this position are the following: Implemented an image moderation system with CNNs using fast.ai. Explored various improvements on the question serving process using state-of-art and old fashioned NLP, clustering techniques, Pyspark and various ml models (question recommendation strategies based on collaborative filtering, trivia embeddings for detecting duplicates, similarity models for increasing content diversity). Implemented a simple rule-based categorization system for questions with high coverage for more than 100 classes. Currently, we are working on a win-rate prediction project for content personalization, trying factorization machines (FFM), collaborative filtering (Pyspark ALS, LightFM) and classification approaches. Technology stack: Pytorch, fast.ai, transformers, Spark, Databricks, AWS (EC2, S3, SageMaker, Lambda), Python Data stack, Redshift, MySQL, mlflow, airflow.

Technologies

PythonSQLPyTorchPySparkRecommendation SystemsAWS EC2
August 2017 - February 2019

Navent is the leading online classified ads company for jobs and Real Estate in Latam, with a presence in 8 countries. Some of my main achievements at Navent are the following: Implemented a prod2vec item-item recommendation model from scratch and scaled it up with k-means, annoy and multiprocessing obtaining a 17.5% increase of CTR against previous model. Scaled-up the principal recommendation system to handle 10x its original data using spark, annoy, Cassandra and GCP obtaining a 30% increase of CTR against previous model. Did reporting with Jupyter notebooks and Google Data Studio and segmented users using clustering techniques. Participated in a duplication detection system project using phashes, Mongo, Cassandra and GCP. Improved and scaled the recommender events tracking system using PubSub, Dataflow and BigQuery. Presented our recommendations ecosystem at Universidad Austral and at UADE (austral.edu.ar/ingenieria-posgrados/jornadadm/Navent-Sistemasderecomendacion.pdf).

Technologies

PythonSQLGoogle CloudSparkRecommendation SystemsBigQuery
October 2014 - August 2017

Worked abroad (in Israel) for almost 2 years, remotely from B.A. after that. I was part of various teams involving members from different countries (Israel, US, India, Ukraine and China) handling different time zones and levels of expertise. Worked mostly on building a CI system from scratch with my main team, but we also coded a few utilities for the main C developers.

Technologies

PythonRequirements AnalysisJenkinsBash
May 2012 - August 2014

Back-end developer. Worked mostly building, deploying and maintaining an e-mail delivery system from scratch with PHP. I learned a lot about development, databases, Linux and requirement analysis in this position.

Technologies

SQLLinuxRequirements AnalysisApacheBash
March 2008 - February 2010

Simple CRUD web-based apps with PHP.

Technologies

SQLLinuxApache
Education
January 2006 - January 2016
MS in Computer Science
University of Buenos Aires