Data QA Engineer

Our TechOps team is looking to bring on an experienced Data QA Engineer. If you are interested in being a guiding voice on a critical feature delivery team, this position is for you.  The team is responsible for implementing and configuring the data pipelines that ingest and do the ETL and process diverse sources of data. This includes building large batch processing and streaming systems. You will be exposed to every area of our platform and AWS services (Serverless – Lambda, EMR – Hadoop/Spark, EKS – Kubernetes, etc.) but even more so, help drive the evolution of the product and team as we continue to grow rapidly. We believe heavily in architecture evolution, and you will bring your experience to best practices as we build out new components of the platform. 

You Will: 

  • Review and Identify business requirement gaps from mapping specification document 
  • Effort estimation of QA activities/tasks 
  • Identify mapping test scenarios and authoring ETL test cases 
  • Understand the file layout of various input file formats and perform parser validation 
  • Write Source and Target SQLs for data validation 
  • Automation of Source to target validation in Pyspark, Databricks 
  • Identify data needs and perform test execution with Synthetic data that resembles real time data   
  • Writing SQLs for User Acceptance Criteria 
  • User Acceptance testing (UAT) support 
  • Perform day-to-day activities using Agile methodologies 
  • Capture QA activities using test management tools and Jira 
  • Onsite – offshore coordination on day-to-day activities 

 
What We're Looking For: 

  •  In-depth Healthcare data knowledge with ETL background 
  • Bachelor's degree or higher in a quantitative/technical field (e.g. Computer Science, Statistics, Engineering) 
  • Knowledge of data management fundamentals and data storage principles  
  • Strong experience with hands-on data analysis of large data sets. 
  • Advanced SQL skills  
  • Big Data Testing, ETL Testing experience 
  • Python scripting experience 
  • Strong experience of Healthcare Payer Data testing 
  • Experience with Pyspark and Databricks 
  • Experience with AWS services like Redshift, S3, Athena 

Nice to Have: 

  • Experience with other Cloud Data Warehouses (Big Query, Snowflake) 
  • Hands on experience of API testing  
  • ETL testing automation experience 




Subscribe now!

Want to receive frequent updates by email? Subscribe to our automatic job service!

Related vacancies