Back to Search
Overview
New Grad

Data Engineer I

Confirmed live in the last 24 hours

Bristol-Myers Squibb

Bristol-Myers Squibb

Hyderabad - TS - IN
On-site
Posted April 17, 2026

Job Description

Working with Us
Challenging. Meaningful. Life-changing. Those aren’t words that are usually associated with a job. But working at Bristol Myers Squibb is anything but usual. Here, uniquely interesting work happens every day, in every department. From optimizing a production line to the latest breakthroughs in cell therapy, this is work that transforms the lives of patients, and the careers of those who do it. You’ll get the chance to grow and thrive through opportunities uncommon in scale and scope, alongside high-achieving teams. Take your career farther than you thought possible.

Bristol Myers Squibb recognizes the importance of balance and flexibility in our work environment. We offer a wide variety of competitive benefits, services and programs that provide our employees with the resources to pursue their goals, both at work and in their personal lives. Read more: careers.bms.com/working-with-us.

Position Summary

We are looking for an earlycareer Data Engineer to help build, operate, and continuously improve reliable data pipelines and curated datasets that support commercial analytics and reporting. In this role, you will transform raw data into highquality, analyticsready tables in the refined layer, applying data modeling best practices, validation checks, and governance standards. You will work closely with engineers, analysts, and business stakeholders to understand data requirements, implement scalable transformations, and ensure data products are trusted, secure, and welldocumented.

Key Responsibilities

·       Design, build, and support reliable batch and incremental data pipelines for commercial data products, ensuring scalability, maintainability, and operational readiness.

·       Ingest, transform, and integrate large-scale structured and semi-structured pharma datasets (e.g., claims, patient, HUB, and specialty pharmacy data) into curated refined-layer tables.

·       Implement transformation logic including cross-domain joins and slowly changing dimensions (SCD) and maintain clear technical documentation for datasets and pipelines.

·       Ensure data quality through cleansing, standardization, deduplication, reconciliation, automated validation checks, anomaly detection, and monitoring/alerting; triage and resolve data issues.

·       Apply data governance practices including documentation, lineage, access controls, and compliant handling of sensitive/regulated data.

·       Partner with analysts and business stakeholders to translate requirements into data specifications, define dataset readiness criteria, and support adoption of published data products.

 

 Skills & Competancies

  • Proficiency in SQL and Python for data transformation, validation, and pipeline development.

  • Experience building ETL/ELT pipelines and working with lakehouse concepts (e.g., medallion architecture; bronze/silver/gold layers).

  • Hands-on experience with Databricks (notebooks/jobs/workflows) and Delta Lake concepts (ACID tables, incremental processing, upserts/merge).

  • Familiarity with data modeling for analytics-ready datasets.

  • Experience with cloud data engineering fundamentals (e.g., AWS storage/compute, IAM concepts)

  • Understanding of data quality practices and operational support.

  • Working knowledge of engineering best practices (Git/version control, code reviews, basic CI/CD concepts).

  • Understanding of data governance and secure data handling (documentation, lineage, access controls, PII/PHI awareness).

  • Familiarity with BI/visualization tools (Tableau/Power BI) is a plus for downstream consumption.

  • Strong problem-solving, communication, and time-management skills

  • Ability to work with both technical and business partners.

 

 

Qualifications & Experience

  • Bachelor’s or Master’s degree in Computer Science, Engineering, Information Systems, Statistics/Mathematics, or a related field (or equivalent practical experience).

  • 1–3 years of hands-on experience in data engineering or related roles, building and supporting ETL/ELT pipelines and curated datasets; biopharma/pharmaceutical experience is a plus.

  • Experience processing large-scale structured and semi-structured datasets using SQL and Python, with attention to data quality, performance, and maintainability.

  • Preferred: experience with Databricks and Delta Lake concepts.

  • Preferred: experience working with commercial pharma datasets (e.g., claims, sales, payer, patient, HUB/specialty pharmacy) and understanding common identifiers and integration challenges.

If you come across a role that intrigues you but doesn’t perfectly line up with your resume, we encourage you to apply anyway. You could be one step away from work that will transform your life and career.

Uniquely Interesting Work, Life-changing Careers
With a single vision as inspiring as “Transforming patients’ lives through science™ ”, every BMS employee plays an integral role in work that goes far beyond ordinary. Each of us is empowered to apply our individual talents and unique perspectives in a supportive culture, promoting global participation in clinical trials, while our shared values of passion, innovation, urgency, accountability, inclusion and integrity bring out the highest potential of each of our colleagues.

On-site Protocol

BMS has an occupancy structure that determines where an employee is required to conduct their work. This structure includes site-essential, site-by-design, field-based and remote-by-design jobs. The occupancy type that you are assigned is determined by the nature and responsibilities of your role:

Site-essential roles require 100% of shifts onsite at your assigned facility. Site-by-design roles may be eligible for a hybrid work model with at least 50% onsite at your assigned facility. For these roles, onsite presence is considered an essential job function and is critical to collaboration, innovation, productivity, and a positive Company culture. For field-based and remote-by-design roles the ability to physically travel to visit customers, patients or business partners and to attend meetings on behalf of BMS as directed is an essential job function.

Supporting People with Disabilities

BMS is dedicated to ensuring that people with disabilities can excel through a transparent recruitment process, reasonable workplace accommodations/adjustments and ongoing support in their roles. Applicants can request a reasonable workplace accommodation/adjustment prior to accepting a job offer. If you require reasonable accommodations/adjustments in completing this application, or in any part of the recruitment process, direct your inquiries to adastaffingsupport@bms.com. Visit careers.bms.com/eeo-accessibility to access our complete Equal Employment Opportunity statement.

Candidate Rights

BMS will consider for employment qualified applicants with arrest and conviction records, pursuant to applicable laws in your area.

If you live in or expect to work from Los Angeles County if hired for this position, please visit this page for important additional information: https://careers.bms.com/california-residents/

Data Protection

We will never request payments, financial information, or social security numbers during our application or recruitment process. Learn more about protecting yourself at https://careers.bms.com/fraud-protection.

Any data processed in connection with role applications will be treated in accordance with applicable data privacy policies and regulations.

If you believe that the job posting is missing information required by local law or incorrect in any way, please contact BMS at TAEnablement@bms.com. Please provide the Job Title and Requisition number so we can review. Communications related to your application should not be sent to this email and you will not receive a response. Inquiries related to the status of your application should be directed to Chat with Ripley.

R1601481 : Data Engineer I
data