AI Data Engineer
Position Summary
The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data.
You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high-quality, high-volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement
Key Responsibilities
- Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine-tuning.
- Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets.
- Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets.
- Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity.
- Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models.
- Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA).
- Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability.
- Document data engineering processes, data models, and data dictionaries.
- Stay up-to-date with the latest advancements in data engineering, big data technologies, and machine learning.
Required
- Bachelor's degree in Computer Science, Engineering, or a related field.
- Proven experience as a Data Engineer, with a focus on big data technologies.
- Strong proficiency in programming languages such as Python, Scala, or Java.
- Extensive experience with data warehousing, ETL processes, and data modeling.
- Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services.
- Hands-on experience with big data frameworks like Apache Spark for distributed processing.
- Excellent problem-solving skills and the ability to work independently and as part of a team.
- Strong communication and interpersonal skills.
Preferred
- Master's degree in a related field.
- Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7).
- Familiarity with machine learning concepts and LLM fine-tuning processes.
- Experience with data orchestration tools (e.g., Apache Airflow).
Why Join Us?
Joining C the Signs is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.
Benefits:
- Competitive salary and benefits package.
- Flexible working arrangements (remote or hybrid options available).
- The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
- Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
- Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Recommended Jobs
Patient Transporter (24 hours, Days)
At Boston Medical Center South, we are committed to improving the health of our communities by delivering exceptional, personalized health care with dignity, compassion and respect. Our continued focu…
Customer Service - Self Storage Manager
Description ~Our Property Managers get to work independently at multiple locations; spending time both inside and outside ~We assess customer storage needs and make suggestions, including selling…
Per Diem Direct Care Staff
Benefits: Flexible schedule Opportunity for advancement Training & development Per Diem Direct Care Staff ServiceNet, Inc Employment: Per-Diem Hiring in: Berkshire, Franklin, H…
PreK SLP-Diagnostician
Speech-Language Pathologist (CCC) – Diagnostician Location: Brockton, MA School Year: 2025–2026 Start Date: 2 weeks from offer Schedule: Monday–Friday, 8:45 AM – 3:30 PM Pay Rate:…
Maintenance Technician, Multifamily
Job Title Maintenance Technician, Multifamily Job Description Summary The Multifamily Maintenance Technician provides support and is accountable for delivering on our commitments to our residents. …
335 NewburySt SunglassHut Seasonal 2/1 Sales Associate (next to TJX)
Requisition ID: 912872 Store # : 005646 Sunglass Hut Position: Seasonal/Temporary Total Rewards: Benefits/Incentive Information At Sunglass Hut, we're always in the sun. You’ll find a d…
Armed Transportation Officer - Boston, MA
Asset Protection & Security Services, a 30-year company, with 24 years of those years specializing in detention and transportation, is looking for people to be part of our team. If you meet the requir…
Travel Social Worker - LCSW - Hyannis, MA
Position Title: Travel Social Worker – LCSW Location: Hyannis, MA 02601 Duration: 13 Weeks Shift: Day • 4×8s Travel Pay: $58–$65/hour Position Summary "Navitas Healt…
CDL Front Load Driver
Join Waste Connections, a leader in integrated waste services. As a CDL Front Load Driver, you will play a crucial role in our commitment to providing reliable and efficient waste management services.…
Full Stack Software Engineer
MORSE Corp is an employee owned, small business based in Cambridge, MA, Arlington, VA, and Seattle, WA with a history of fielding cutting-edge technology. MORSE boasts a specially selected team of …