Data Engineer/Analysis with 5+ years of experience in building efficient, scalable, and resilient distributed data pipelines for collecting, cleaning, and aggregating large volumes of data.
University teacher, technical writer, and hackathon judge, mentor.
My main technical skills
Languages : Python, SQL, PL/SQL, VBA
Technologies : pySpark, Hadoop, Text Mining, Oracle apex, Airflow, Cron, Linux/Unix/Windows cli, Fast load, Fast export, xml, VS Business intelligence
Databases : Oracle, Greenplum, Hive, Teradata, MS sql, MySql, Impala