Location: Philadelphia, PA
Duration: Permanent and Full time
- Building data pipelines, writing ETL logic in PySpark, or Scala.
- Ingestion of data into AWS from trafficking systems, on premise databases (Teradata, MS SQL Server, Client Vertica etc.), on premise Hadoop cluster etc.
- Spinning up appropriate EMR, EC2 instances for the job
- Scheduling the jobs and automating the jobs
- Creating the data sets in the appropriate format, i.e Parquet
- Ingesting the data into S3/Redshift or any data ware house for external team to consume into their application
Company Description:Looking for a great career? Global Geek Force Recruiting can improve candidate sourcing, interviewing and applicant tracking for a streamlined hiring process. Candidates: please email your resume to careers@globalgeelforce.com