Alexis Roldan

20 Years Building Production AI/ML for Biopharma

Sr. Data Engineer & Data Scientist

Sr. Data Engineer & Data Scientist at Takeda Pharmaceutical. 30+ deployed solutions spanning data pipelines, ML models, and GenAI systems.

Data science illustration with charts and analytics

About Me

About Me

Alexis Roldan profile photo

I'm Alexis Roldan

A seasoned Full-Stack Data Scientist & Software Developer with a passion for leveraging Advanced Analytics and Statistical Modeling to provide insightful data-driven solutions.

  • End-to-End Data Science & ML
  • Data Engineering & Pipelines
  • Business Intelligence & Visualization
  • Process Optimization & Digital Twins
  • Generative AI & LLM Applications

Certified Lean Six Sigma Green Belt, AGILE Champion, Databricks Champion, and Mulesoft Champion; experienced in Project Management, Agile, SCRUM and DMAIC methodologies with strong leadership experience in biopharma.

English & Spanish (Native) · Japanese (Conversational)
  • 20

    Years in
    Biopharma
  • 30+

    Solutions
    in Production
  • 9+

    Successful
    Projects Led
Link to Resume

Recognition

Race4Value - FTE Optimization Winner

2023

Star of the Quarter Q3 FY22

2022

Employee of the Quarter

2017

Skills

Skills

Education & Skills

Here's my journey and technologies that I currently use.

R Python SQL .NET HTML / CSS Java

Shiny Dash Minitab JMP SIMCA / OPLS Apache Spark

SQL Server PostgreSQL Oracle

AWS Databricks Docker Git

Qlik Sense Power BI Tableau

LLM Integration RAG Architecture Prompt Engineering Text-to-SQL Agentic AI Claude / GPT APIs Voice AI (TTS/STT)

Six Sigma SCRUM / Agile DMAIC MLOps Project Management

  • 2023

    Oxford Machine Learning Summer School - AI for Global Goals

  • 2020

    Data Science Professional - HarvardX

  • 2020

    Statistical Learning, Inference & Modeling - HarvardX / Stanford Online

  • 2023

    ODSC AI Bootcamp - Open Data Science Conference

  • 2012

    Computer Science - Cal State Northridge

  • 2025

    Optimizing Performance in Shiny - ShinyConf 2025

  • 2024

    Rhino Masterclass - ShinyConf 2024

  • 2023

    Developing & Testing Your Shiny Application - Open Source in Pharma

  • 2022

    Building Production-Quality Shiny Applications - Open Source in Pharma

  • 2022

    API Development - Mulesoft

  • 2021

    PI System Asset & Visualization - OSIsoft

Work & Experience

Baxter (2005-2015) Baxalta (2015-2016) Shire (2016-2019) Takeda (2019-Present)

Sr. Data Engineer & Data Scientist (Sr. Manager)

Full Time
Apr 2025 - Present

Support Los Angeles Digital Roadmap, AI/ML Development and Advanced Analytics to support data-driven decisions.

  • Leading LA Digital Roadmap & AI/ML development
  • 30+ solutions deployed to production
  • Designing, building, and maintaining local data pipelines for critical business data
  • Developing innovative ML models to support production workflows and site objectives
  • Architected hybrid RAG systems achieving 70% token reduction and 59% deviation reduction
  • Built agentic BI dashboards and AI-powered text-to-SQL applications for quality management

Data Scientist III (Sr. Manager)

Full Time
Nov 2022 - Mar 2024

Focus on AGILE 4.0 Digital Workstream, AI/ML and cutting edge data analytics programs to accelerate Takeda's digital capabilities across all GMS sites.

  • Drove AGILE 4.0 Digital Workstream across all GMS sites
  • Executed local and global GMS/GQ big data and analytics strategy
  • Built and maintained data pipelines and BI applications across all GMS plasma sites

Data Scientist II (Manager)

Full Time
Apr 2021 - Nov 2022

Build AI/ML and cutting edge data analytics programs to gain insight on different processes and accelerate Takeda's digital capabilities.

  • Achieved ~50% reduction in process variation via digital twin models (PIMS project)
  • Designed A/B tests and developed End-to-End ML Applications
  • Los Angeles lead data scientist for global analytics initiatives

Data Scientist, Analytics

Full Time
Dec 2019 - Apr 2021

  • Developed & deployed PIMS (Process Improvement Monitoring System) — real-time analytics across LA site
  • BI stream owner for TechOps LA: Qlik Sense, Power BI dashboards
  • Statistical advisor; deployed ML models for predictive analytics (R/Shiny)

Operations Leader

Full Time
Apr 2018 - Nov 2019

  • Led $1.2M VIP project reducing equipment waste across LA site
  • Built BI apps and PIMS real-time analytics platform
  • Led DMAIC problem-solving initiatives and process simulations (ProModel)

Senior Operations Analyst

Full Time
Jun 2016 - Mar 2018

  • Data analysis, process improvement, and BI development
  • Continued building PIMS and analytics capabilities
  • Six Sigma & advanced analytics support

Senior Operations Analyst

Full Time
Mar 2015 - May 2016

  • Transitioned with Baxalta spin-off from Baxter
  • System development, data analysis, and project management

Operations Analyst

Full Time
Nov 2013 - Feb 2015

  • Data analysis and process improvement
  • Early analytics and reporting automation

Manufacturing Technician 3

Full Time
Jul 2005 - Oct 2013

  • 8+ years of hands-on manufacturing floor experience
  • Foundation in biopharma production processes

My Latest Projects

My Latest Projects

Take a look at my recent work.

{AMIRA} - AI Manufacturing Analytics

R/Shiny/LLM
AMIRA AI Manufacturing Analytics platform
R Shiny Databricks LLM RAG AWS

Enterprise AI platform serving biopharma manufacturing — 59% deviation reduction, 70% token optimization via hybrid RAG.

Horizon CAPA Dashboard

R/Shiny/Text-to-SQL
Horizon CAPA Dashboard interface
R Shiny Text-to-SQL GPT-5 AWS

AI-powered text-to-SQL enabling quality teams to query CAPA data in plain English — replacing manual report generation.

MIA - Multimodal Intelligent Assistant

R/Shiny
MIA Multimodal Intelligent Assistant application interface
R Shiny OpenAI AWS GenAI

AI learning assistant processing 4 input modalities — text, image, voice, and documents — with persistent memory.

PennyTrail

Next.js / TypeScript
PennyTrail expense tracker application
Next.js 16 TypeScript Tailwind v4 AWS S3 OpenAI

Mobile-first PWA with AI-powered spending insights, 3D-styled UI, and receipt capture.

Shiny App Valuation Toolkit

R/Shiny
Shiny App Valuation Toolkit dashboard
R Shiny COCOMO II Plotly

Parametric cost estimation with 3 analysis modes, scenario comparison, and AI-assisted project planning.

{RIOT} R/Shiny Image OCR App

R/Shiny/AWS
RIOT OCR application landing page
R Shiny AWS OCR

Extracts structured text from images via AWS Textract OCR for downstream analytics pipelines.

TubeScout

Python
TubeScout application landing page
Python Streamlit YouTube API

Discovers high-signal YouTube videos using view-to-subscriber ratio analysis — algorithm-independent discovery.

Voice Assistant AI

Python
Voice Assistant AI application interface
Python Streamlit Speech Recognition

Voice-controlled assistant with speech recognition, real-time financial data, and weather integration.

{roldanpack}

R
roldanpack R package logo and documentation
R Package

Personal R package encapsulating reusable analytics functions and custom visualization themes.

Youtube Video Feed Dashboard

R/Shiny/AWS
Youtube Video Feed Dashboard mobile view
R Shiny AWS YouTube API

Automated video feed dashboard with AWS-hosted data pipeline and YouTube API integration.

Contact Me

Contact Me

Let's get in touch!

Please use this form to
contact me.