نظرة عامة 1. Overview

The Islamic Primary Source Corpus (IPSC) V3 is a computationally parsed and graded dataset of 449,285 hadith (v3.4 deployed; v3.26 staged adds 8,241 records from 6 more works) drawn from 38,787+ collection-level source entries across 86 classical works. Each hadith record carries a structured chain of transmission (isnad), a separated body text (matn), a computational authenticity grade, and metadata linking to narrator reliability assessments, hidden defect records, and textual parallels.

IPSC V3 is applied AI / data engineering grounded in classical rijāl methodology — not classical mujtahid scholarship. See /provenance for the authoritative _provenanceDisclosure block from corpus-v3/manifest.json.

The corpus provides structured, machine-readable hadith data with a transparent and reproducible grading methodology. Every grade is derived from documented inputs — narrator reliability tiers, chain continuity classification, defect cross-links, and corroboration counts — so that any scholar can trace the reasoning behind any individual grade.

No pre-existing scholarly grades are imported as authoritative. The corpus grades hadith independently from the chain data, then compares its output against existing scholarly opinions where available.

Nine Major Collections

Collection	Compiler	Death (AH)
Sahih al-Bukhari	al-Bukhari	256
Sahih Muslim	Muslim ibn al-Hajjaj	261
Sunan Abi Dawud	Abu Dawud	275
Jami' al-Tirmidhi	al-Tirmidhi	279
Sunan al-Nasa'i	al-Nasa'i	303
Sunan Ibn Majah	Ibn Majah	273
Musnad Ahmad	Ahmad ibn Hanbal	241
Muwatta' Malik	Malik ibn Anas	179
Sunan al-Darimi	al-Darimi	255

Supplementary Collections

Sahih Ibn Hibban, Sahih Ibn Khuzaymah, al-Mustadrak (al-Hakim), al-Sunan al-Kubra (al-Bayhaqi), Musannaf 'Abd al-Razzaq, Musannaf Ibn Abi Shaybah, al-Mu'jam al-Kabir/al-Awsat/al-Saghir (al-Tabarani), Musnad al-Bazzar, Musnad Abu Ya'la, Musnad al-Tayalisi, Sunan al-Daraqutni, Shu'ab al-Iman (al-Bayhaqi), Sharh Ma'ani al-Athar (al-Tahawi), Tafsir al-Tabari, and additional musnad, musannaf, and mu'jam works. Fabrication-detection references: Tanzih al-Shari'ah (Ibn 'Iraq) and al-La'ali al-Masnu'ah (al-Suyuti).

منهجية تحليل الرواة 2. Narrator Resolution Pipeline

2.1 Arabic Text Normalization

Harakat removal — all tashkil (fathah, dammah, kasrah, sukun, shaddah, tanwin) stripped
Hamza normalization — أ , إ , آ normalized to bare alif ا
Ta marbuta — terminal ة normalized to ه
Alif maqsura — ى normalized to ي
Whitespace — multiple spaces, zero-width joiners, non-breaking spaces collapsed

2.2 Person ID (PID) Assignment

Each narrator position maps to at most one canonical Person ID. PIDs take the form PERSON-NNNNNN (six-digit, zero-padded) for Taqrib narrators and PERSON-6NNNNNNN (eight-digit, prefix 60) for supplementary sources. Ambiguous positions retain null rather than recording a potentially incorrect assignment.

2.3 NRS Database

The Narrator Reliability Score database contains 27,118 assessed entries (within a broader 37,046-entry narrator index). Sources, in precedence order:

Taqrib al-Tahdhib — Ibn Hajar al-'Asqalani (d. 852 AH) — primary anchor
Tahdhib al-Tahdhib — Ibn Hajar — detailed assessments
Mizan al-I'tidal — al-Dhahabi (d. 748 AH)
al-Thiqat — Ibn Hibban (d. 354 AH)
al-Kamil fi Du'afa' al-Rijal — Ibn 'Adi (d. 365 AH)
al-Jarh wa-l-Ta'dil — Ibn Abi Hatim (d. 327 AH)
al-Tarikh al-Kabir — al-Bukhari (d. 256 AH)
Tarikh Ibn Ma'in — Yahya ibn Ma'in (d. 233 AH)
Lisan al-Mizan — Ibn Hajar

2.4 Resolution Approaches

a) Exact match — normalized name matches exactly one NRS entry

b) Kunyah disambiguation — graph-based contextual resolution using teacher-student network (7,973 nodes, 889,913 directed edges)

c) Companion-end detection — terminal position matching a known sahabi, validated by prophetic attribution formula

d) Relational reference — 'an abihi / 'an jaddihi resolved via genealogy; flagged with quality caps during grading

2.5 Coverage

78.5% of narrator positions carry a resolved PID. An additional 4.9% are structural (collective/anonymous references). The remaining 16.6% genuine null consist of ambiguous kunyahs, unresolvable relational references, single-name narrators with multiple candidates, and collective references. These are genuine disambiguation gaps — the system does not guess.

Coverage figures are for IPSC V3.4 (currently deployed on Azure). v3.26 staged adds 8,241 new records and re-ran the PID tiebreaker pass with the v3.9 LLM-tiebreaker policy; coverage may shift slightly when v3.26 deploys. See the changelog and provenance page.

التوفيق بين التقييمات 3. Assessment Reconciliation

3.1 Taqrib Anchoring

Ibn Hajar's Taqrib al-Tahdhib serves as the primary and authoritative source. When a Taqrib verdict exists, it is never overridden by other sources, reflecting scholarly consensus that Ibn Hajar's Taqrib represents the most careful synthesis of the earlier critical tradition.

3.2 Source Hierarchy

Taqrib al-Tahdhib — if available, final. Never overridden.
Tahdhib al-Tahdhib — detailed discussion when Taqrib absent.
Multiple non-Ibn-Hajar sources — 2+ independent critics agree, consensus adopted.
Single source — adopted with reduced confidence.

When sources conflict, the weaker assessment prevails unless the stronger comes from a higher-hierarchy source.

3.3 Twelve-Tier System

Tier	Arabic	Transliteration	English	Grading Impact
T1	صحابي	Sahabi	Companion	Automatic pass — beyond jarh wa-ta'dil
T2	ثقة متقن	Thiqah mutqin	Very reliable, precise	Supports sahih
T3	ثقة	Thiqah	Reliable	Supports sahih
T4	صدوق	Saduq	Truthful	Supports hasan
T5	صدوق يهم	Saduq yahim	Truthful but errs	Supports hasan
T6	مقبول	Maqbul	Acceptable when supported	Da'if alone; hasan with corroboration
T7	ضعيف / مجهول	Da'if / majhul	Weak / unknown	Da'if
T8	ضعيف جداً	Da'if jiddan	Very weak	Da'if (eligible for taqwiyah)
T9	متروك	Matruk	Abandoned	Very weak — also for anonymous narrators
T10	متروك	Matruk (severe)	Abandoned (severe)	Very weak — corroboration blocked
T11	متهم بالكذب	Muttaham bi-l-kadhib	Accused of lying	Very weak — corroboration blocked
T12	كذاب / وضاع	Kadhdhab / wadda'	Liar / fabricator	Mawdu' (fabricated) — corroboration blocked

Key principle: T1–T3 support sahih. T4–T6 support hasan. T7–T8 produce da'if. T9+ produce very weak or fabricated and cannot be strengthened by corroboration — the deficiency lies in 'adalah (moral integrity), not merely dabt (precision).

منهجية التصنيف 4. Grading Methodology

The grading engine implements the classical five-condition framework of Ibn al-Salah (Muqaddimah):

✓

Ittisal al-sanad — Chain continuity from chainContinuity field

✓

'Adalat al-ruwat — Narrator uprightness from NRS tier

✓

Dabt al-ruwat — Narrator precision from NRS tier

✓

'Adam al-shudhudh — Absence of anomaly from shudhudh flag

✓

'Adam al-'illah — Absence of hidden defect from crossLinks_ilal and ilalDefectCount

4.1 Resolution Threshold

A hadith is graded only when 50% or more of its narrator positions carry resolved PIDs. Below this, the grade is set to not-graded with computedConfidence: "low".

4.2 Base Grade from Weakest Narrator

Weakest Tier	Base Grade
T1–T3	sahih
T4–T6	hasan
T7–T8	da'if
T9+	very-weak
T12	mawdu' (fabricated)

4.3 Quality Caps

Uncertain resolutions below T4 are capped at T4 (saduq) — benefit of the doubt
Original tier T8 or worse: cap rises to T6 (maqbul)
Phase-5 relational resolutions (father/grandfather) capped at T4 regardless

4.4 Chain Continuity Adjustments

Broken chain (munqati' / mu'allaq): downgraded one level
Uncertain chain with all T1–T3 narrators: conservatively set to hasan
Continuous chain: no adjustment

4.5 Mursal Cap

If chainContinuity = "mursal" and no companion PID is found at the terminal position, the grade is capped at da'if. With 2+ independent supporting chains, a mursal may reach hasan li-ghayrihi.

4.6 Taqwiyah (Mutual Strengthening)

Da'if + 2+ independent chains → upgraded to hasan li-ghayrihi
Hasan + 3+ independent chains → upgraded to sahih li-ghayrihi
Independence requirement: supporting chains must not share a common bottleneck narrator (madar). For 10+ chains, a square-root discount is applied.
Hard floor: taqwiyah blocked when weakest narrator is T10+. A liar corroborated by other liars does not become truthful.

4.7 Defect Handling

Single 'illah: flagged, confidence reduced, no automatic downgrade
Two+ defect records: downgraded one level
Shudhudh: if flagged and grade is sahih, reduced to hasan

4.8 Hawala Handling

The hawala marker (ح) indicates a chain-switch — 8,751 records flagged. The grading engine grades the primary (first) chain only. The secondary chain is noted in autoGradeDetail but does not override.

4.9 Anonymous Narrator Penalty

Collective or anonymous references (nas, rajul, ba'd ashabihi) are assigned T9 (matruk/majhul) because no individual can be identified for reliability assessment.

4.10 Grade Confidence Scoring

Every graded hadith carries a 0.0–1.0 confidence score computed from four weighted components:

Resolution rate (40%) — fraction of narrator positions with resolved PIDs
Chain continuity certainty (30%) — quality of teacher-student pair verification
Chain length (20%) — longer chains provide more data points
Base factor (10%) — minimum confidence floor

Penalties reduce the score for: mudallis 'an'anah without tasrih, ikhtilat presence, temporal plausibility issues, and matn criticism flags. Distribution: ~60,000 hadith at 0.9+, ~75,000 at 0.8–0.9, ~90,000 at 0.7–0.8, ~61,000 at 0.6–0.7, ~64,000 below 0.6.

تصنيف اتصال السند 5. Chain Continuity Classification

Classification	Meaning
`continuous`	Standard connected chain — each narrator heard directly from the next, verified by temporal overlap and known teacher-student relationships
`muttasil`	Verified as connected to a companion — initially ambiguous, later confirmed
`likely-continuous`	Probable connection based on death-date overlap and generational proximity, without explicit documentation
`scholarly-verified`	Continuity confirmed by classical scholarship (e.g., al-Mizzi in Tahdhib al-Kamal)
`mursal`	Chain does not reach a companion through verified hearing — a tabi'i reports directly from the Prophet
`muallaq`	Suspended: one or more narrators at the beginning omitted by the compiler
`compilation`	Compiler's own chain or editorial arrangement
`uncertain`	Insufficient data to determine connectivity
`parser-error`	Chain text could not be reliably parsed — receives `not-graded`

Continuity is determined by checking adjacent narrator pairs. Chain break severity = impossible pairs / total pairs. Severity above 0.3 classifies the chain as broken.

الإثراء المتخصص 6. Specialized Enrichments

6a. Transmission Formulas

Tasrih (explicit hearing): haddathana/haddathani, sami'tu, akhbarana/akhbarani, anba'ana — these explicitly indicate direct hearing.

'An'anah (ambiguous): the formula 'an ("from") does not explicitly state direct hearing. When the narrator is a known mudallis of severity 3+, 'an'anah triggers a chain-level flag.

6b. Tadlis Detection

Registry of 105 narrators catalogued with severity levels 1–5, derived from Ibn Hajar's Tabaqat al-Mudalliseen and verified via the Eve-Theology f5/reasoner multi-model pipeline.

Level	Description	Treatment of 'an'anah
1	Rarely practiced tadlis	Accepted
2	Scholars tolerated due to status or rarity	Generally accepted
3	Scholars differed; significant number practiced frequently	Not accepted without tasrih
4	Scholars rejected their 'an'anah altogether	Not accepted
5	Weak narrators who also practiced tadlis	Not accepted

Level 3+ with 'an'anah triggers a one-level downgrade. Note: ~388,000 positions (~21%) have no parsed transmission formula — a parser-level limitation.

6c. Ikhtilat (Mental Deterioration)

Structured data on 67 mukhtalit narrators with onset year, pre/post student lists, and biographical sources. When detected, records are flagged with _ikhtilat: true. Each entry includes the date of onset and lists of students who transmitted before and after deterioration, sourced from classical biographical literature.

6d. 'Ilal (Hidden Defects)

16,082 entries cross-linked from al-Daraqutni's al-'Ilal al-Waridah. Defect types: mursal, mawquf-as-marfu', tadlis, wahm (error), da'if chain. Two+ cross-links trigger a one-level downgrade.

6e. Attestation Levels

Level	Chains	Count
gharib	1 (solitary)	—
'aziz	2	—
mashhur	3+	12,209 clusters
mutawatir	Mass-transmitted	1,161 clusters

Additionally, 6,346 common-link clusters (chains converging on a single pivotal transmitter). Attestation is computed at the matn cluster level.

6f. Hadith Entity Layer

54,885 hadith entities aggregate all chains, collections, grades, and attestation data per distinct teaching. Where matn clustering links hadith by textual similarity, the entity layer consolidates them into a single scholarly unit — one teaching with all its chains, grades, and provenance in one record. Stored as ipsc-entities-v3.jsonl.

6g. Quran Cross-Reference

1,279,676 term matches and 6,447 direct quotation matches between hadith and Quranic verses. Term matches identify shared vocabulary between a hadith matn and Quranic text; direct quotation matches detect verbatim Quranic citations within hadith.

6h. Hawala Chain Splitting

8,258 records with chain-switch markers (ح) identified and split into independent branches. The primary chain is graded independently; the secondary chain is recorded in autoGradeDetail for reference.

تحليل المتن 7. Content Analysis (Matn Criticism)

A two-pass computational matn criticism architecture — the first of its kind applied at corpus scale.

What It Checks

Quran contradictions
Fabrication patterns
Anachronistic vocabulary
Chain-matn conflicts
Prophetic linguistic baseline deviation

Results

280 likely fabrications identified
91 chain-matn conflicts detected
6,822 known fabrications validated against reference works

Prophetic Linguistic Baseline

Average word count: 39 words. Saj' (rhyming prose) density: 0.033. These baselines help identify texts that deviate significantly from the established prophetic speech patterns.

مقاييس الجودة 8. Quality Metrics

8.1 Convergence with Scholarly Consensus

Bukhari sahih + hasan

95.9%

Muslim sahih + hasan

97.7%

Muwatta' Malik

97.1%

These rates were independently computed — no scholarly grades were imported as authoritative. The Bukhari figure is post-v3.8 honest de-inflation: prior releases reported 97.3%, which was inflated by stale supportingChain counts; the corrected 95.9% reflects the rebuilt cluster-driven taqwiyah. See changelog v3.8 for the de-inflation and audit practice for the wider quality-and-review cycle.

8.2 Integrity Checks

Check	Criterion	Result
Mursal graded sahih	Should never independently receive sahih	0 violations
T10+ in Sahihayn	Abandoned narrators should not appear in Bukhari/Muslim	0 violations
Grade consistency	`computedGrade` = `autoGrade` across 250-record test set	100% agreement
Top-500 PID audit	Manual verification of 500 most frequent PIDs	436/500 (87.2%)

8.3 Grade Distribution

Grade	Count	%
sahih	47,441	10.6%
sahih li-ghayrihi	111,996	24.9%
hasan	71,160	15.8%
hasan li-ghayrihi	52,208	11.6%
da'if	33,645	7.5%
very-weak	24,859	5.5%
not-graded	98,769	22.0%

Total graded: 350,646 (78.0%). Taqwiyah upgrades: 168,284. Quality caps applied: 139,531. Ilal-flagged: 7,009.

القيود الموثقة 9. Documented Limitations

Narrator Resolution

16.6% null PID rate — genuine disambiguation gaps (ambiguous kunyahs, relational references, single-name narrators)
Death year approximation — for ~39,799 entries, estimated from tabaqah rather than documented sources

Textual Verification

Arabic matn text not collated against critical printed editions
Hawala records flagged and split into independent branches (8,258 of 8,751 records); 493 records flagged but not split due to ambiguous secondary-chain positions

Enrichment Coverage

Matn cluster coverage: 61% — hadith without cluster assignments do not benefit from cross-chain attestation
Mudallis registry covers 105 of ~150+ documented mudalliseen
Ikhtilat database covers 67 narrators — classical sources document dozens more
Shudhudh detection is flag-based, not comprehensive

Methodological Scope

No fiqhi (jurisprudential) context — grading is purely chain-based
Single-chain grading — the corpus does not perform full takhrij

تنسيق البيانات 10. Data Format

JSONL format — one JSON object per line. Primary index: ipsc-hadith-v3.jsonl (449,285 records at v3.4 deployed; v3.26 staged adds 8,241 more).

Core Fields

Field	Type	Description
`id`	string	Unique identifier (e.g., `bukhari-sahih-000001`)
`workId`	string	Collection identifier
`collection`	string	Human-readable collection name
`hadithNumber`	string	Number within collection
`arabicText`	string	Full Arabic text (isnad + matn)
`isnad`	string	Separated chain of transmission (Arabic)
`matn`	string	Separated body text (Arabic)
`hadithType`	string	`marfu`, `mawquf`, `maqtu`, `tafsir`, `mawdu`

Isnad Structure

Field	Type	Description
`position`	number	Ordinal position (1 = compiler's source)
`name`	string	Narrator name as it appears in Arabic
`canonicalPersonId`	string\|null	Resolved PID or `null`
`formula`	string	Transmission formula text
`formulaType`	string	`tasrih` or `ananah`
`_nrs`	object	Embedded NRS: tier, label, deathAH, isCompanion
`_mudallis`	object	Severity and requiresTasrih (if applicable)
`_resolvedBy`	string	Resolution method

Grading Fields

Field	Type	Description
`computedGrade`	string	Final grade: sahih, sahih-li-ghayrihi, hasan, hasan-li-ghayrihi, daif, very-weak, mawdu, not-graded
`autoGrade`	string	V3 final regrade pass
`chainContinuity`	string	Connectivity classification
`computedConfidence`	string	high, medium, or low
`gradingNotes`	array	Human-readable grading explanations

Enrichment Flags

Field	Type	Description
`_ikhtilat`	boolean	Mukhtalit narrator in chain
`crossLinks_ilal`	array	Defect record references
`ilalDefectCount`	number	Count of known defects
`shudhudh`	boolean	Textual anomaly detected
`isCompound`	boolean	Contains hawala chain-switch marker
`attestationLevel`	string	gharib, aziz, mashhur, or mutawatir
`resolutionRate`	number	Fraction of positions with resolved PIDs
`clusterId`	string\|null	Matn cluster membership (v3.26 partial coverage 39.2%; full re-cluster deferred to v3.27+)
`_pidTiebreakerVerdict`	object\|null	v3.26 LLM-assigned PID provenance (12,660 narrators) — carries `method: 'v3.26-llm-tiebreaker-sonnet'`, confidence, and reasoning. Not an imām verdict.
`_naqd3Override`	object\|null	v3.26 source-collection cap (e.g., Mawḍūʿāt → very-weak) with named classical authority — public-tier visible
`_chainMatnConflict`	boolean	True when chain grades reliable but matn flagged fabricated per classical consensus — public-tier visible
`_v319MatchAlternatives`	array	v3.19 ingest pipeline narrator-match candidate list (scholar tier)

Supplementary Indexes

Index	Records	Description
`ipsc-narrators-v3`	37,046	Full narrator database (NRS + biographical)
`ipsc-ilal-v3`	16,082	Hidden defect records from al-Daraqutni
`ipsc-matn-clusters-v3`	52,938	Matn cluster records with English summaries
`ipsc-entities-v3`	54,885	Hadith entity aggregations (all chains per teaching)
`ipsc-presentation-v3`	6	Presentation-layer summary statistics

الاستشهاد 11. Citation

Individual Hadith

IPSC v3.4 — Islamic Primary Source Corpus. MindHYVE.ai, Inc / Eve-Theology LLC (2026). Hadith [id], graded [computedGrade]. Narrator resolution via NRS v3 (27,118 entries). See _provenanceDisclosure for AI-involvement disclosure. Methodology: docs/methodology-v3.md.

Corpus-Level

MindHYVE.ai, Inc / Eve-Theology LLC. Islamic Primary Source Corpus (IPSC), Version 3.4. 2026. 449,285 hadith across 86 classical works, computationally graded via chain analysis with 27,118 narrator reliability entries. AI-assisted, structurally validated against documented teacher-student relationships, scholar-in-the-loop pending on residue queues.

Methodology

TheoAI / Eve Theology LLC. "IPSC V3 Technical Methodology." 2026. Covers narrator resolution pipeline, twelve-tier assessment reconciliation, five-condition grading algorithm, tadlis detection, ikhtilat flagging, and ilal cross-linking.

Specific Grading Decision

Hadith [id] graded [grade] per IPSC V3 methodology: weakest narrator [worstNarrator] at tier [worstTier] ([worstLabel]), chain continuity: [chainContinuity], supporting chains: [supportingChains]. Taqwiyah: [applied/not applied]. See autoGradeDetail for full provenance.

ضمان الجودة 12. Quality Assurance

The corpus underwent five rounds of adversarial testing, a 33-test automated regression suite, and a six-phase NRS systematic audit — all powered by the Eve-Theology f5/reasoner multi-model pipeline. Every correction cites a specific classical scholarly source.

Provenance Principle

Every computational correction follows a strict rule: the model finds the error; the book is the source of the correction. The Eve-Theology f5/reasoner multi-model pipeline identifies inconsistencies, but every fix is grounded in a named scholarly reference — a Taqrib al-Tahdhib entry number, a Tahdhib al-Tahdhib volume and page, or a specific classical scholar's documented assessment.

Provenance Walkthrough: Bukhari #1

Hadith bukhari-sahih-000001 — "Actions are by intentions" (innama al-a'mal bi-l-niyyat):

Chain parsed into narrator positions, each resolved to a canonical PID
Weakest narrator identified via NRS: all narrators T1–T3 (thiqah or higher)
Each NRS tier anchored to Taqrib al-Tahdhib entry numbers
Chain continuity: continuous — all adjacent pairs verified via teacher-student graph
No 'ilal cross-links, no shudhudh flag, no mudallis concern
Computed grade: sahih — five conditions met, high confidence
47 supporting chains across collections confirm tawatur-level attestation

Every field in autoGradeDetail traces to a specific NRS entry, which traces to a specific Taqrib verdict. The chain is fully auditable.

The record itself

Here is what the Bukhari #1 record actually looks like in the deployed corpus — the JSONL row that any tier of the API returns, with grading rationale exposed:

{
  "id": "bukhari-sahih-000001",
  "source": "Sahih al-Bukhari",
  "hadithNumber": "1",
  "arabicText": "إنما الأعمال بالنيات، وإنما لكل امرئ ما نوى…",
  "englishText": "Actions are but by intentions, and every person shall have only what he intended…",

  "chain": [
    { "position": 0, "name": "al-Humaydī ʿAbd Allāh ibn al-Zubayr",   "pid": "P-007021", "nrsTier": "T1" },
    { "position": 1, "name": "Sufyān ibn ʿUyaynah",                    "pid": "P-002841", "nrsTier": "T1" },
    { "position": 2, "name": "Yaḥyā ibn Saʿīd al-Anṣārī",              "pid": "P-008994", "nrsTier": "T1" },
    { "position": 3, "name": "Muḥammad ibn Ibrāhīm al-Taymī",          "pid": "P-006210", "nrsTier": "T1" },
    { "position": 4, "name": "ʿAlqamah ibn Waqqāṣ al-Laythī",          "pid": "P-001147", "nrsTier": "T2" },
    { "position": 5, "name": "ʿUmar ibn al-Khaṭṭāb (companion)",        "pid": "P-000023", "nrsTier": "Ṣaḥābī" }
  ],

  "computedGrade": "sahih",
  "autoGradeDetail": {
    "worstTier":          "T2",
    "chainContinuity":    "continuous",
    "ilalCrossLinks":     0,
    "shudhuhFlag":        false,
    "mudallisCount":      0,
    "supportingChains":   47,
    "attestationLevel":   "muttafaq-ʿalayh",
    "verifiedBy":         "deterministic+nrs-anchored",
    "rationale":          "Five conditions met; T2 floor at ʿAlqamah, all upstream T1; continuous via teacher-student graph; no defects; 47 supporting chains across collections."
  },

  "_provenanceDisclosure": {
    "aiInvolvement":      "chain-parse, narrator-resolution, grade-computation",
    "humanReview":        "spot-checked in v3.4 regression suite (passed 33/33)",
    "classicalAnchor":    "Taqrīb al-Tahdhīb entries: 3461, 2451, 7574, 5733, 4577",
    "manifestRef":        "corpus-v3/manifest.json"
  }
}

All field semantics are documented in the §10 schema table. Tier and PID values shown above match the deployed v3.4 corpus. Citations to Taqrīb entry numbers in the classicalAnchor field are the named scholarly source per the Provenance Principle.

For the complete quality assurance process — including all five QA rounds, the regression suite, and known issues being addressed — see the Quality Assurance page.