Clinical Development: Phases I to III

This is Part 5 in The Drug Development Playbook that maps the path from molecule to medicine. In Part 1, we set the roadmap. In Parts 2 through 4, we covered target selection, hit discovery and lead optimization, and preclinical IND readiness. In this article, we move into clinical development. I show what each phase must deliver, which design choices matter most, and how to convert early human data into rigorous decisions. Read this if you want to design trials that protect patient safety, generate credible evidence, and keep value in the program.

Key Summary

  • Clinical development tests hypotheses in people. Phase I proves safety and PK, Phase II finds signal and dose, and Phase III proves benefit to regulators and payers.
  • Design choices determine whether a program survives or fails. Quality by design and clear critical to quality factors cut wasted spend and time.
  • Adaptive and model based trials can speed programs but they require simulation, pre specification, and tight governance.

Acronyms used in this article

IND: Investigational New Drug application
FIH: First-in-Human
SAD: Single-Ascending Dose
MAD: Multiple-Ascending Dose
MRSD: Maximum Recommended Starting Dose
MABEL: Minimum Anticipated Biological Effect Level
PK/PD: Pharmacokinetics / Pharmacodynamics
ADME: Absorption, Distribution, Metabolism, Excretion
DDI: Drug-Drug Interaction
GLP: Good Laboratory Practice
GMP: Good Manufacturing Practice
CMC: Chemistry, Manufacturing, and Controls
DMPK: Drug Metabolism and Pharmacokinetics
NOAEL: No Observed Adverse Effect Level
DSMB: Data and Safety Monitoring Board
DMC: Data Monitoring Committee
GCP: Good Clinical Practice
CtQ: Critical to Quality
TPP: Target Product Profile
rNPV: Risk-Adjusted Net Present Value

A compact case study

A biotech ran a tidy Phase I and saw clean PK and a modest PD signal. They rushed into a Phase II with an endpoint that did not match the biology. Recruitment lagged, the signal was diluted in a heterogeneous population, and the trial missed its primary endpoint. Investors lost patience. That company lost more value than the one that paused to refine endpoints and enrich the right patients. Clinical design can destroy or preserve value. Treat it as strategic work, not checklist work.

Framing: what clinical development does for you

Regulators define clinical research as testing an investigational product in humans to evaluate safety and efficacy for a specific indication. Phase I focuses on safety and dose. Phase II focuses on preliminary efficacy and dose selection. Phase III provides definitive evidence for labeling and payer decisions. Strategically, ask three questions:

  • Phase I: Is it safe and how does it behave in humans?
  • Phase II: Does it work in the right patients at realistic doses?
  • Phase III: Will regulators and payers change practice because of this evidence?

Trial design acts as the primary tool to reduce scientific uncertainty, manage safety risk, and protect commercial optionality.

High-level map: goals, sizes, and durations

Use these anchors when you plan or present.

  • Phase I: 20 to 100 participants, months, single or few sites. Goals: safety, tolerability, PK, early PD.
  • Phase II: 100 to 300 patients, 1 to 3 years. Goals: proof of concept and dose finding.
  • Phase III: 300 to 3,000 plus patients, 2 to 5 years or more. Goals: confirmatory evidence for registration and payer adoption.

Phase II remains the main attrition point. Industry analyses put Phase II transition success near 29 percent and overall Phase I to approval probability below 10 percent. That concentration of risk means Phase II design must earn its keep.

Core design principles: quality by design and GCP

ICH E8(R1) promotes quality by design. Identify Critical to Quality factors early. Protect endpoints, inclusion criteria, randomization, and blinding. Keep protocol complexity proportionate to the study’s purpose. ICH E6 spells out Good Clinical Practice obligations around ethics, monitoring, and data integrity. Use both guidelines to build trials that answer decision-critical questions and protect participants.

Phase I: first-in-human and early human studies

Core objective

Prove human safety and describe how the drug behaves in the human body. Get clean PK and early PD data you can use to pick doses for Phase II.

Typical study types and when to use them

  • SAD studies: dose one cohort, observe predefined window, then escalate. Use when you need a rapid safety scan across dose ranges.
  • MAD studies: dose repeated cohorts to characterize accumulation and steady state. Use these when chronic dosing will matter in clinic.
  • Food effect: crossover or parallel arms to test fed versus fasted PK for oral drugs. Do this early if formulation or solubility looks marginal.
  • DDI substudies: include probe substrates when in vitro CYP or transporter work suggests interaction risk.
  • Special designs: microdosing or microtracer studies when you want human PK with minimal exposure, or adaptive PK cohorts when early PK surprises require fast course correction.

Population choice and recruitment

  • Healthy volunteers speed recruitment and reduce variability for basic PK. Use them when risk is low.
  • Patients make sense in oncology, severe immune diseases, and when the mechanism risks irreversible effects.
  • Screen participants carefully for comorbidities, concomitant meds, and prior exposure to similar modalities. That reduces noise and SAE risk.

Safety monitoring and governance

  • Use sentinel dosing at each new level and require pre specified observation windows before dosing remaining cohort members.
  • Instrument continuous ECG monitoring when any cardiac liability exists.
  • Define clinically actionable lab thresholds and SAE reporting timelines up front.
  • Set up 24/7 medical backup during escalation and a small safety review team for daily triage.

PK, PD and translational modeling

  • Collect intensive PK in early cohorts and sparse PK in later cohorts.
  • Run PK/PD analyses in near real time to update exposure targets and escalation rules.
  • Use model based approaches such as Bayesian CRM if you need efficient, statistically principled escalation. These methods require simulations and a statistician comfortable with operating characteristics.

Statistical and operational considerations

  • Define the primary safety stopping rules and all interim checks in the protocol and SAP.
  • Pre plan the sample size by cohort, not by a single overall N. Powering usually does not apply to Phase I safety objectives, but plan for enough subjects to estimate variability for PK/PD modeling.
  • Keep data cleaning tight. Late queries in Phase I can delay dose decisions and inflate risk.

Decision gates and deliverables

  • Gate to Phase II when safety profile matches nonclinical predictions, PK achieves exposures consistent with efficacy targets or allows dose exploration, and any PD biomarker shows expected direction.
  • Deliverables: FIH protocol, SAP, safety monitoring plan, PK/PD analysis plan, integrated safety report, and a Phase I clinical study report with model outputs.

Common pitfalls and how to avoid them

Pitfall: unclear stopping rules create ad hoc decisions. Fix: predefine rules and authority lines in the protocol.

Pitfall: starting dose chosen without coherent PK/PD grounding. Fix: calculate MRSD and MABEL, then reconcile both with PK/PD modeling.

Pitfall: slow lab turnarounds delay escalation. Fix: confirm lab logistics and on site assays before first dose.

Phase II: proof of concept and dose selection

Core objective

Demonstrate a credible efficacy signal while defining dose, regimen, and target population. Phase II should reduce uncertainty enough to decide whether to fund Phase III.

Phase IIa and Phase IIb distinctions

  • Phase IIa: exploratory proof of concept. Use flexible designs, small samples, and biomarker endpoints.
  • Phase IIb: confirm dose response and pick the dose or doses for Phase III. Use formal statistical models and randomized controls.

Design options and when to pick them

  • Parallel randomized controlled trial: best when you have a clear endpoint and recruitment capacity.
  • Dose-response designs (Emax, logistic): use when you need an explicit estimate of the dose effect.
  • MCP-Mod: validated by regulators for dose finding when you expect non linear responses.
  • Adaptive seamless Phase II/III: choose if you want efficiency and can pre specify adaptations, control Type I error, and run robust simulations.

Endpoints and enrichment strategy

  • Choose endpoints tied to mechanism and to meaningful clinical benefit. If no clear clinical endpoint exists, use validated surrogates and explain the validation.
  • Use prognostic or predictive biomarkers to enrich the population if biology suggests heterogeneity. Biomarker selection increases power and improves Phase II to Phase III translation.

Statistical planning and simulations

  • Ask statisticians to simulate a range of true effect sizes, variability, and recruitment scenarios. Use those sims to set sample sizes and interim rules.
  • Pre define futility and efficacy thresholds for any interim looks. If you plan sample size re-estimation, define triggers and impact on Type I error.

Trial operations and logistics

  • Lock down central labs and assay validation early for any PD endpoints. Assay drift kills interpretability.
  • Pre qualify sites for biomarker logistics and special procedures. Train sites on sample handling to avoid pre analytic variability.
  • Build recruitment funnels and pre screen lists. Track screen failure causes to iterate inclusion criteria.

Decision gates and deliverables

  • Gates to Phase III should include pre specified efficacy thresholds or Bayesian predictive probabilities, clear safety tolerability at chosen doses, and operational feasibility metrics.
  • Deliverables: Phase II protocol and SAP, biomarker validation report, exposure response models, interim analysis report, and an updated rNPV model showing Phase III value under plausible outcomes.

Common pitfalls and mitigation

  • Pitfall: endpoint does not reflect biological mechanism. Fix: pilot endpoint validation and consult regulators early.
  • Pitfall: underpowered trial due to optimistic assumptions. Fix: run sensitivity analyses and inflate sample size for plausible variability.
  • Pitfall: biomarker assay not reliable. Fix: lock down analytic validation and run blinded quality checks.

Phase III: pivotal trials and registrational strategy

Phase III must convince regulators and payers.

Core objective

Provide definitive evidence that the intervention improves clinically meaningful outcomes in the intended patient population under conditions close to clinical practice. Secure a label that meets regulatory and payer needs.

Trial scope and endpoint selection

  • Choose primary endpoints that regulators and HTA bodies accept as clinically meaningful. For many chronic diseases these will be morbidity or mortality outcomes, not small surrogate changes.
  • If you use a surrogate, document strong evidence linking the surrogate to patient benefit and discuss this in regulatory meetings.

Trial structure and multiplicity control

  • If you test multiple endpoints, multiple doses, or subgroups, build a hierarchical testing plan or alpha splitting. Pre specification prevents Type I error inflation.
  • Define primary and secondary estimands and align data capture and statistical methods to those estimands.

Global strategy and regional considerations

  • Decide whether to use a single global pivotal trial or separate regional trials that account for local standard of care differences.
  • Plan for bridging studies if regulators require region specific evidence. Anticipate labeling differences and plan dossier language accordingly.

Operational scale and vendor management

  • Scale operations for global logistics. Manage supply chain for investigational product, central labs, imaging cores, and data transfer across regions.
  • Negotiate CRO and CDMO contracts with milestone payments tied to enrollment, data quality, and submission readiness.

Interim analyses and governance

  • If you include interim efficacy analyses, pre specify criteria and use independent DMC oversight. Guard against operational bias by limiting who sees unblinded interim results.
  • Plan statistical methods to adjust for interim looks so that Type I error stays controlled.

Payer and HTA alignment

  • Integrate payer evidence needs into trial design early. Plan head to head comparators when payers require them and include patient reported outcomes where payers consider QoL.
  • Consider co primary or hierarchical endpoints that address both regulatory and reimbursement thresholds.

Decision gates and deliverables

  • Gate for submission when pivotal trial(s) meet primary endpoints per protocol, safety remains acceptable, CMC supports commercial product, and global regulatory strategy aligns.
  • Deliverables: full pivotal protocol and SAP, integrated clinical study reports, pooled safety database, submission-ready modules for clinical and statistical sections, and a payer evidence dossier.

Common pitfalls and mitigation

  • Pitfall: comparator becomes outdated by the time results read out. Fix: horizon scan the field and plan adaptive elements or additional comparator arms if feasible.
  • Pitfall: underpowered subgroup analyses. Fix: power for primary endpoint first and prespecify any subgroup analyses as exploratory unless you can power them.
  • Pitfall: operational drift across regions in endpoint assessment. Fix: centralized training, blinded adjudication committees, and site level monitoring focused on CtQ items.

Adaptive and innovative designs: use them with rigor

Adaptive designs let you update elements based on interim data. They can speed trials and reduce patient exposure to inferior arms. They require pre-specification, simulations to demonstrate Type I error control, strict governance, and independent DMC oversight. Treat adaptive designs as tools that demand planning and discipline.

Safety monitoring, reporting, and signal management

Set up a clear safety governance model early. Define DSMB charters, stopping rules, and expedited reporting logic. Use aggregate approaches to detect real signals and avoid swamping regulators with noise. A clear safety plan reduces regulatory friction and protects participants.

Operational execution: site, CRO, data systems, and recruitment

Operational failures derail good science. Focus on site selection based on past performance and patient access. Use milestone based CRO contracts and KPI dashboards that track enrollment, data quality, and deviations. Invest in validated EDC, ePRO, and centralized labs. Plan recruitment with realistic screen failure assumptions and pre qualified sites.

Major cost drivers include site fees, patient recruitment, data management, central imaging and labs, and product logistics. Budget realistically and update estimates as vendor quotes arrive.

Statistics and multiplicity

Adopt the estimand framework to align trial objectives with analysis. Specify how you will handle treatment discontinuation, rescue medication, and missing data. Pre register primary and sensitivity analyses to increase credibility and avoid post hoc disputes.

Biomarker strategy and companion diagnostics

Treat biomarker and CDx planning as strategic. Decide early if a CDx will limit the label and how that CDx will validate analytically and clinically. Co develop the assay and drug if the test will guide safe and effective use.

Global regulatory strategy

Plan regulatory interactions. Use pre Phase II and end of Phase II advice meetings to align on endpoints and pivotal requirements. Prepare for IND amendments and regional CTA submissions. Anticipate additional data requirements for specific markets when those markets matter for commercial uptake.

Cost and timeline ballparks

Use conservative ranges for planning. Expect Phase I to cost roughly 1 to 5 million USD and take 1 to 2 years including setup. Expect Phase II to cost roughly 7 to 20 million USD. Expect Phase III to cost tens to hundreds of millions depending on size and complexity. Get vendor quotes early and update models frequently.

Decision gates and go no go criteria

Define measurable go no go criteria in advance. Example gate from Phase I to Phase II:

  • Safety: no unexpected serious toxicities inconsistent with preclinical data.
  • PK: exposure matches predictions or allows exploration of efficacious ranges.
  • PD: evidence of target engagement at tolerable doses.
  • Feasibility: site and recruitment metrics justify scaling.

Score these criteria on a heatmap and link outcomes to rNPV scenarios to make decisions transparent.

Common failure modes and mitigation

Programs fail because teams pick weak endpoints, miss enrichment opportunities, underpower studies, or underestimate operational risk. Prevent failures by stress testing protocols, running feasibility pilots, building contingency budgets, and keeping critical decisions in house.

Consulting playbook: what a consultant does differently

A consultant adds value by turning trials into portfolio decisions. You should:

  • Model rNPV under alternative designs and enrollment scenarios.
  • Pressure test endpoints, estimands, and power assumptions.
  • Craft milestone based CRO contracts and KPI dashboards.
  • Align clinical design with regulator and payer needs.
  • Coordinate biomarker and CDx development to avoid downstream bottlenecks.

Good consulting makes uncertainty visible and actionable.

Next article in the series

Part 6 is where the playbook shifts from “generating evidence” to “getting a decision.” Regulators are no longer asking “Is this molecule interesting?” but “Is there enough, of the right kind, to justify this label, this risk profile, and this manufacturing process for real‑world use?”

Parts of this series

References

  • US Food and Drug Administration. The Drug Development Process: Step 3 – Clinical Research (overview).
  • FDA. Guidance for Industry: Estimating the Maximum Safe Starting Dose in Initial Clinical Trials for Therapeutics in Adult Healthy Volunteers (MRSD guidance). July 2005.
  • EMA. Strategies to Identify and Mitigate Risks for First-in-Human and Early Clinical Trials. (Revised FIH guidance, 2017).
  • FDA. Adaptive Designs for Clinical Trials of Drugs and Biologics. Guidance for Industry (finalized 2019).
  • EMA. From laboratory to patient – the journey of a centrally authorised medicine (lifecycle overview).
  • ICH E8(R1). General considerations for clinical studies, quality by design and critical-to-quality factors.
  • ICH E6(R2/R3). Good Clinical Practice.
  • ICH E9(R1). Statistical Principles and the Estimand Framework.
  • ICH M3(R2). Nonclinical Safety Studies for the Conduct of Human Clinical Trials and Marketing Authorization.
  • ICH S7A / S7B. Safety pharmacology and nonclinical evaluation of QT/QTc and proarrhythmic risk.
  • ICH S6(R1). Preclinical safety evaluation of biotechnology-derived pharmaceuticals.
  • ICH Q1A(R2). Stability testing of new drug substances and products.
  • ICH Q2(R1). Validation of analytical procedures.
  • FDA. Clinical Trials Guidance Documents index and related resources.
  • EMA and FDA documents on adaptive designs, seamless trials and adaptive features.
  • EMA / European discussion on MCP-Mod and model-based dose finding (see EMA scientific advice pages and MCP-Mod literature cited in guidance).
  • Refs and guidelines on nonclinical safety expectations (GLP pivotal tox, safety pharmacology, genotoxicity, DART), summarized in ICH M3(R2) and S6(R1).
  • ICH E14/S7B Questions and Answers and integrated cardiac safety material.
  • Guidance and review on PK/PD modelling, translational biomarkers and MABEL approaches (regulatory and review articles cited under EMA and FDA FIH guidance).
  • ICH Q1A, Q2 and related CMC guidance pages for analytical methods, stability, and method validation.
  • EMA and FDA pages on GMP expectations for clinical material and early CMC requirements (ICH Q8-Q10 family for quality systems).
  • BIO / Informa Pharma Intelligence. Clinical Development Success Rates and Contributing Factors 2011–2020 (phase transition probabilities, attrition funnel).
  • Tufts Center for the Study of Drug Development. Cost estimates for developing a new prescription drug (Tufts CSDD reports estimating approx 2.6 billion USD including cost of failures).
  • Wong CH, Siah KW, Lo AW. Estimation and analysis of clinical success rates for investigational drugs. Nature Reviews Drug Discovery (review of success rates and methodology).
  • Recent market analyses reporting slight changes in LOA and updated pipelines (Norstella / Citeline industry analyses).
  • Nelson MR, et al. Studies showing increased likelihood of success with genetically supported drug targets (examples and analyses).
  • King EA, et al. / related analyses on genetic support and drug discovery outcomes.
  • 2024 Nature analysis showing higher probability of success when mechanisms have genetic support (summary analyses and numbers around 2.6x uplift).
  • Rees DC, Congreve M, Murray CW, Carr R. Fragment-based lead discovery. Nature Reviews Drug Discovery. 2004.
  • Carr RAE et al. Fragment-based lead discovery: leads by design. Reviews on FBLD.
  • Szymański P et al. Review on high-throughput screening and assay technologies.
  • Reviews on structure-based drug design, virtual screening and integration of biophysics with HTS.
  • Reviews and methodological sources on in vitro ADME, microsomal stability, Caco-2/PAMPA, CYP phenotyping and DDI testing. (common academic reviews and professional guidance from regulatory pages).
  • ICH S7B and related E14 Q&A on cardiac safety and hERG testing.
  • FDA guidance on In Vitro Companion Diagnostic Devices; CDx co-development advice for sponsors.
  • EMA materials and scientific advice pages on biomarker and companion diagnostic development.
  • Industry white papers and reviews on CRO selection, milestone-based contracting, KPI use, and risk-based monitoring (FDA clinical trials guidance index, CRO benchmarking reports).
  • Reviews of adaptive designs, modern dose finding, and model-based approaches.
  • Case study composites and regulatory hold examples discussed in EMA/FDA public pages and regulatory news summaries.

Comments

6 responses to “Clinical Development: Phases I to III”

  1. From Idea to Patient: How Drug Development Actually Works – Aryan Kenia Avatar

    […] Part 5 – Clinical Development: Phases I to III […]

    Like

  2. Disease Understanding and Target Identification: Picking the Biology Worth Backing – Aryan Kenia Avatar

    […] Preclinical Development and IND/CTA Readiness: making the candidate safe enough to test in peoplePart 5 – Clinical Development: Phases I to IIIPart 6 – Regulatory Submission and ReviewPart 7 – Manufacturing Scale-up and Quality […]

    Like

  3. Preclinical Development and IND/CTA Readiness: making the candidate safe enough to test in people – Aryan Kenia Avatar

    […] Part 5 talks about the transition to in-human clinical trials and what happens from phase 1 to 3. […]

    Like

  4. From Pivotal Data to Approval: The Regulatory Review Strategy – Aryan Kenia Avatar

    […] Part 5 – Clinical Development: Phases I to III […]

    Like

Leave a reply to From Idea to Patient: How Drug Development Actually Works – Aryan Kenia Cancel reply