Agentic Data Engineer

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Bachelor's or master's degree in computer science, AI, Data Science, or a related field., Strong programming skills in Python and experience with AI/ML frameworks., Experience with big data frameworks like Spark/Databricks and cloud computing skills., Proficiency in vector databases and embedding models for retrieval tasks..

Key responsibilities:

  • Design and develop data pipelines for agentic systems and manage ELT processes.
  • Collaborate with data scientists and engineers to preprocess data and integrate AI into applications.
  • Implement data pipelines that facilitate feedback loops for system performance improvement.
  • Optimize data storage and retrieval for high performance.

TalentBurst, an Inc 5000 company logo
TalentBurst, an Inc 5000 company Human Resources, Staffing & Recruiting Large https://www.talentburst.com/
1001 - 5000 Employees
See all jobs

Job description

 Role: Agentic Data Engineer
Duration: 7+ months
Location: Richmond, VA
Details: Remote
 
Seeking a highly skilled Agentic Data Engineer to design, develop, and deploy data pipelines that leverage agentic AI that solve real-world problems. The ideal candidate will have experience in designing data processes to support agentic systems, ensure data quality and facilitate interaction between agents and data.
 
Responsibilities:

  • Designing and developing data pipelines for agentic systems, develop robust data flows to handle complex interactions between AI agents and data sources.
  • Ability to train and fine-tune large language models.
  • Design and build the data architecture, including databases, data lakes to support various data engineering tasks.
  • Develop and manage Extract, Load, Transform (ELT) processes to ensure data is accurately and efficiently moved from source systems to analytical platforms used in data science.
  • Implement data pipelines that facilitate feedback loops, allowing human input to improve system performance in human-in-the-loop systems.
  • Work with vector databases to store and retrieve embeddings efficiently.
  • Collaborate with data scientists and engineers to preprocess data, train models, and integrate AI into applications.
  • Optimize data storage and retrieval with high performance.
  • Statistical analysis, trends, patterns to create data formats from multiple sources.
 
Qualifications:
  • Strong data engineering fundamentals.
  • Utilize big data frameworks like Spark/Databricks.
  • Training LLMs with structured and unstructured data sets.
  • Understanding of Graph DB.
  • Experience with Azure Blob Storage, Azure Data Lakes, Azure Databricks.
  • Experience implementing Azure Machine Learning, Azure Computer Vision, Azure Video Indexer, Azure OpenAI models, Azure Media Services, Azure AI Search.
  • Determine effective data partitioning criteria.
  • Utilize data storage system spark to implement partition schemes.
  • Understanding core machine learning concepts and algorithms.
  • Familiarity with cloud computing skills.
  • Strong programming skills in Python and experience with AI/ML frameworks.
  • Proficiency in vector databases and embedding models for retrieval tasks.
  • Expertise in integrating with AI agent frameworks.
  • Experience with cloud AI services (Azure AI).
  • Experience with GIS spatial data to create markers on maps (lat long nearest topology of road, geo-locate between datasets, correlation etc.).
  • Experience with Department of Transportation Data Domains developing an AI Composite Agentic Solution designed to identify and analyze data models, connect & correlate information to validate hypotheses, forecast, predict and recommend potential strategies and conduct What-if analysis.
  • Bachelor's or master's degree in computer science, AI, Data Science, or a related field.

Required profile

Experience

Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration

Data Engineer Related jobs