Position Overview
We are looking for a skilled Data Engineer to architect and maintain high-capacity data solutions. This role is focused on the end-to-end data lifecycle, from ingestion and curation to quality assurance and distribution, ensuring a robust foundation for organizational analytics.
Key Responsibilities
Data Architecture & ETL: Design and deploy large-scale data systems using Informatica and Snaplogic. You will automate complex data loading and extraction processes by developing customized UNIX shell scripts as part of the ETL workflow.
Cloud Data Management: Manage and optimize queries within modern cloud repositories and data lakes, specifically Google BigQuery and AWS Redshift.
Advanced Engineering: Write structured, production-grade PySpark code to implement machine learning use cases and data processing logic.
Visualization & Insights: Support data-driven decision-making by facilitating data visualization through Tableau and MicroStrategy.
Quality Assurance: Lead the data validation process by creating comprehensive test plans, scripts, and cases to ensure the highest standards of data integrity and quality.
Work Environment & Requirements
Location: Primary workstation in Weddington, NC, with the requirement to work at various unanticipated client sites across the U.S.
Travel: Candidates must be prepared to travel and/or relocate to meet the demands of diverse client projects.
Job Type: Full Time