Astraea Technical & Regulatory FAQ | Part 11, CDISC & Validation

Question 1

How does Astraea satisfy 21 CFR Part 11 for the records it generates?

Accepted Answer

Astraea treats Part 11 as a design constraint on every record and signature FDA relies on: computer-generated, time-stamped audit trails; role-based access and authority checks; risk-based system validation; secure record retention and retrieval; and electronic signatures linked to their records. Because Astraea runs inside your environment, the predicate-rule records it produces inherit those controls rather than requiring a separate compliance layer bolted on afterward.

Question 2

Do Astraea's outputs meet ALCOA+ data integrity expectations?

Accepted Answer

Yes. Every derived value is Attributable, Legible, Contemporaneous, Original, and Accurate, and the 'plus' attributes — Complete, Consistent, Enduring, and Available — follow from a versioned, retained audit trail that can be extracted on demand during monitoring or inspection.

Question 3

What exactly does the audit trail capture, and can it be altered?

Accepted Answer

It captures the who, what, when, and prior value for creation, modification, and deletion of critical data, including AI-proposed actions and the human decision that accepted, corrected, or rejected them. Entries are time-stamped against a controlled clock, attributable to an identified individual or agent, and write-protected so they cannot be edited or overwritten after the fact.

Question 4

How are electronic signatures implemented?

Accepted Answer

Signed records carry the signer's printed name, the date and time of signing, and the meaning of the signature per 21 CFR 11.50, are cryptographically bound to their records under 11.70 so they cannot be excised or transferred, and non-biometric signatures use at least two distinct identification components consistent with 11.100/11.200.

Question 5

How does Astraea generate SDTM-conformant domains?

Accepted Answer

Astraea's standards-mapping agents map collected study data to SDTM domains using the appropriate SDTMIG version and CDISC Controlled Terminology, respecting domain structure and variable roles. Mappings are proposed with rationale and confirmed by your programmers, and datasets are designed to pass conformance checks before moving downstream.

Question 6

How are ADaM datasets derived, and is traceability preserved?

Accepted Answer

ADaM datasets are derived from SDTM inputs and the statistical analysis plan following ADaM principles — analysis-ready structure, documented derivations, and metadata-driven traceability back to SDTM. Astraea preserves the SDTM-to-ADaM lineage variable by variable so any analysis value can be traced to its source.

Question 7

Does Astraea produce Define-XML and reviewer's guide content?

Accepted Answer

Yes. Astraea generates Define-XML v2.x describing datasets, variables, controlled terms, value-level metadata, and derivations, plus dataset-level documentation and data reviewer's guide content. Define-XML is FDA- and PMDA-required for every study in a submission, so it is produced as a first-class output.

Question 8

Can Astraea reconcile legacy or non-standard source data?

Accepted Answer

Yes. Astraea's annotation and standards agents reconcile heterogeneous, legacy, and non-standard formats into CDISC-conformant structures, surfacing ambiguous mappings for human adjudication and capturing every reconciliation in the audit trail.

Question 9

How does Astraea maintain end-to-end data provenance?

Accepted Answer

Every transformation from raw source through SDTM, ADaM, and into TFLs is recorded as a linked, versioned lineage. Any figure or analysis value can be traced backward to the SDTM record and original source, and forward to every artifact that consumed it. Provenance is a structural property of the pipeline, not documentation assembled at the end.

Question 10

Is the pipeline reproducible for an inspector or independent reviewer?

Accepted Answer

Yes. Derivations are deterministic and versioned against specific inputs, SAP logic, and standards versions, so a given dataset or output can be regenerated and independently reconciled. Reproducibility plus preserved lineage lets a reviewer confirm that what was submitted is exactly what the source data and analysis plan support.

Question 11

Who is accountable for AI-generated tables, figures, and listings?

Accepted Answer

Your qualified team is. Astraea proposes shells, programs, and outputs and makes them fully auditable, but statistical sign-off and regulatory accountability remain with your biostatisticians and programmers, exactly where your SOPs and regulatory obligations require them.

Question 12

How do you prevent unreviewed AI output from reaching a submission?

Accepted Answer

Critical outputs cannot advance on machine confidence alone. Automated edit checks and conformance rules run first, then a required human validation gate must be cleared before an output is accepted. Uncertain cases are escalated for review rather than pushed through silently, and the review state is enforced by the workflow.

Question 13

What is behind the 99%+ precision figure, and how should we read it?

Accepted Answer

It refers to validated outputs — results that have passed both automated checks and human quality control — not raw, unreviewed model output. The figure reflects a system designed with biostatisticians and clinical programmers where AI accelerates execution and experts remain the final quality gate.

Question 14

How is Astraea validated as a computerized system?

Accepted Answer

Astraea follows a risk-based approach to computerized system validation aligned with GAMP 5, FDA's Part 11 Scope and Application guidance, and Computer Software Assurance (CSA) thinking. Validation concentrates on functions with the greatest impact on data integrity and patient safety, with intended use, controls, and testing evidence documented for inspection.

Question 15

Does Astraea ever see or hold our patient data?

Accepted Answer

No. Astraea is software your team runs inside your own environment — not a web-hosted service that ingests your data, and not a CRO that runs the work for you. Your proprietary study data and PHI stay within your security boundary, under your access controls and data-residency requirements.

Built for the people accountable for the data.

Compliance, standards, and oversight — in depth.

21 CFR Part 11 & Data Integrity

CDISC Standards — SDTM, ADaM & Define-XML

Data Provenance & Traceability

Human Oversight of AI-Generated TFLs

Validation, Deployment & Security

Want to go deeper with our team?