We are looking for a Data Engineer to be part of the team of an American Company, where the big data space is growing rapidly, beyond the data pipeline, you will also be instrumental in bringing the data-driven products and features to life and helping build the future of the company.
You will also have the opportunity to work with the product teams, building models and APIs to drive new features delivers analysis to further improve engagement in existing features, and empowers our business with real-time insights to drive growth in market share, engagement, and revenue.
We see 1 trillion events per year and process 10TB of data daily. The challenges of this position are : Manage & improve a Hadoop-based data pipeline (EMR / Cloudera, Impala, Airflow, among others) Requirements At least 3 years of experience as a data engineer.
Experience working with Hive and Apache AirflowExperience working with HadoopExperience in Python, primary with scripts Optional Skills Knowledge in Redshift / EMRExperience with ElasticsearchExperience working with a massive volume of Data.
You'll have the opportunity to contribute with your ideas!