Offer summary
Qualifications:
5+ years of experience as a data engineer, Strong proficiency in Python (5+ years), Experience with Spark for large data pipelines, Strong SQL skills with data operations, Ability to balance technical expertise and creativity.
Key responsabilities:
- Build and optimize academic research paper pipeline
- Architect solutions for scalable data needs
- Process, deduplicate, and index research papers
- Enhance data infrastructure efficiency and quality
- Implement robust data quality management processes