Match score not available

Data Engineer(Python/Pandas, Ukraine) #14731

Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

University degree in Computer Related Sciences or equivalent experience, 3+ years with BigData technologies (PySpark preferred), 3+ years of Python development experience (Python, Pandas), Strong experience with Spark and DataBricks, Experience with cloud services (Azure preferred).

Key responsabilities:

  • Build and modify ETL solutions in marketing/sales domain
  • Develop scalable data processing software
  • Ensure high quality development standards
  • Collaborate with product management for client needs
  • Report on task status and technical risks
Capgemini Engineering logo
Capgemini Engineering Information Technology & Services XLarge https://www.capgemini.com/
10001 Employees
See more Capgemini Engineering offers

Job description

Purpose Of The Job

As Data Engineer you will build and modify an ETL solution in marketing&sales domain, which involves data transformation, imputation and loading for further usage in the analytics solution. Another important aspect is refactoring of existing Data Science code and embedding it as part of PySpark pipeline .

Main Tasks And Responsibilities

  • Design, develop, deliver and operate scalable, high-performance data processing software.
  • Ensure high quality development standards (unit/integration tests, etc.)
  • Collaborate with the product management team to incorporate the needs of the client.
  • Proactively raise technical risks and issues.
  • Report to supervisor about current tasks status.
  • Work in close contact with team members and other relevant stakeholders.
  • Take responsibility for personal professional development.
  • Follow established Company policies and processes, pass obligatory trainings.

Education, Skills And Experience

MUST HAVE:

  • University degree in Computer Related Sciences or equivalent working experience.
  • 3+ years with BigData technologies and tools (PySpark preferred).
  • 3+ years of Python development experience (Python, Pandas)
  • Strong hands-on experience with Spark, DataBricks
  • Experience with cloud services (Azure preferred)
  • Good English (verbal & written) and communication skills in general

Would Be a Plus

Would be a plus:

  • MLOps experience
  • Handling scalability issues, pipeline/cluster optimization
  • Experience with Data Science algorithms (e.g. Random Forest, Linear Regression)

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Information Technology & Services
Spoken language(s):
EnglishEnglish
Check out the description to know which languages are mandatory.

Other Skills

  • Verbal Communication Skills
  • Collaboration
  • Leadership Development

Data Engineer Related jobs