ABOUT ASTRAZENECA
AstraZeneca is a global, science-led, patient-focused biopharmaceutical company that focuses on the discovery, development and commercialization of prescription medicines for some of the world’s most serious diseases. But we are more than one of the world’s leading pharmaceutical companies. At AstraZeneca we’re dedicated to being a Great Place to Work.
About the Role
We are seeking a visionary and experienced Director of Data Engineering to lead the development and management of our data products. The ideal candidate will possess deep expertise in AWS technologies, data warehouse, data lake and machine learning to drive innovation and improve data accessibility and utilization across the organization.
Key Responsibilities
Identify and prioritize business needs in collaboration with business product owner.
Delivery of data & analytics solutions of the data platform meet the strategic objective of R&D business and facilitating groundbreaking research in bioinformatics and data science.
Design and implement scalable and efficient data architectures, leveraging AWS technologies to integrate multi-modal data.
Drive the operationalizing and automating of all capabilities to ensure secure, supported and scalable solutions.
Lead initiatives to incorporate machine learning models, helping to unlock new insights and predictive capabilities.
Work closely with R&D bioinformatics, data scientists, and IT teams to understand data needs and ensure seamless data flow and accessibility.
Ensure data integrity, security, and compliance with industry standards and regulations.
Understanding of ethical considerations in data science, including data privacy, security, and responsible AI usage.
Stay abreast of emerging trends in bioinformatics, data science, AI and cloud technologies, continuously improving processes and capabilities.
Qualifications:
Education
Master's degree in Computer Science, Bioinformatics, Data Science or a related field.
Experience
Minimum 5 years of experience in data engineering
Experience in bioinformatics, NGS project, or multi-omics data processing
Proven experience in the pharmaceutical R&D industry
Experience in machine learning and its application in data engineering.
Technical Skills
Experienced in Business Intelligence, Enterprise Data Warehouse / Data Lake solutions specifically schema design and dimensional data modeling for business analytics
Knowledge and skills with data analytics and visualization technologies including but not limited to AWS Cloud Platform, ETL, SQL, Python and R.
Good understanding of multi-omics data, clinical data, real world data and their applications in drug development.
Broad cloud technology background (SaaS, IaaS, PaaS) and DevOps solutions.
Professional Skills
Strong organization skills with ability to handle complex situations and act transversally.
Teamwork, excellent social skills, intellectual flexibility.
Agile product/project management.
Demonstrated ability to communicate complex technical information in a condensed manner to various stakeholders verbally and in writing.
Understand and develop continuous improvement initiatives.
Demonstrated ability to be a team player with a high level of initiative.
People management and development.
Ability to work independently under little supervision within a fast paced client centric environment, manage priorities, and meet commitments
Self-motivated and results oriented
Well-developed communication and interpersonal skills with the ability to influence and lead functional groups towards alignment
Ability to engage in healthy debates, effectively problem solve within a team environment, and demonstrate resiliency
Experience working in an environment with Agile values and principles.
Fluent both in Chinese and English