# Multi-Round Audit Report: cs-ai/peft-methods

Generated: 2026-06-07T10:08:34+00:00

## Verdict: needs work

## Submission readiness

- Status: blocked
- Requirement mode: draft audit with final-readiness blockers
- Blocker: No full-text supported claims found in brief.md; current claims are draft/preliminary only.
- Blocker: 2 evidence row(s) have unclear source depth; mark them full-text verified or preliminary.
- Blocker: 5 preliminary-linked claim(s) remain; do not promote to final support.
- Blocker: 1 full-text supported row(s) do not surface their source_quote in the paper body: 2605.08177v1.
- Blocker: Demo proof score 0.6 below required floor 0.8 (3/5 claims independently re-verified against cached PDFs).
- Blocker: Claim not independently re-verified: 2506.11042v2 (overlap=1.0, substring=True).
- Blocker: Claim not independently re-verified: 2501.13787v1 (overlap=1.0, substring=True).
- Blocker: correctness detail: 2506.11042v2: missing source_quote/page/checked_at
- Blocker: correctness detail: 2501.13787v1: missing source_quote/page/checked_at

## Audit artifacts

- Run directory: `workspace/cs-ai/peft-methods/audit_runs/2026-06-07T10-08-34-00-00`
- Per-round JSON: `round_1.json` ... `round_13.json`
- Input hashes: `input_hashes.json`

| Round | Check | Verdict | Issues | Warnings |
| ---: | --- | --- | ---: | ---: |
| 1 | Ledger integrity | needs work | 0 | 1 |
| 2 | Evidence depth and numerical discipline | needs work | 0 | 1 |
| 3 | Paper quality and framing | pass | 0 | 0 |
| 4 | Coverage, taxonomy leakage, and missing-literature risk | pass | 0 | 0 |
| 5 | Claim calibration and submission readiness | needs work | 0 | 1 |
| 6 | Positive-signal floor | pass | 0 | 0 |
| 7 | Academic format and scholarly correctness | needs work | 0 | 1 |
| 8 | Demo and proof (independent re-verification) | needs work | 0 | 3 |
| 9 | Direction coherence (anti-boilerplate-leak) | pass | 0 | 0 |
| 10 | Research-value (gap/contradiction/surprise/recency) | pass | 0 | 0 |
| 11 | System correctness (claim→quote→PDF) | needs work | 0 | 2 |
| 12 | Cross-model reviewer committee | pass | 0 | 0 |
| 13 | Citation integrity (cited→cached metadata) | pass | 0 | 0 |

## Evidence profile

- Filled evidence rows: 5
- Full-text verified rows: 3
- Preliminary / abstract-derived rows: 0
- Source-depth unclear rows: 2

## Round details

### Round 1: Ledger integrity — needs work

**Warnings**

- No full-text supported claims found in brief.md; current claims are draft/preliminary only.

**Notes**

- claim_rows=5
- supported_claims=0
- preliminary_claims=5
- filled_evidence_rows=5
- This round checks structure and status calibration: supported means full-text verified; preliminary-linked means traceable draft evidence.

### Round 2: Evidence depth and numerical discipline — needs work

**Warnings**

- 2 evidence row(s) have unclear source depth; mark them full-text verified or preliminary.

**Notes**

- full_text_verified=3/5
- preliminary_or_abstract=0/5
- unclear_source_depth=2/5

### Round 3: Paper quality and framing — pass

**Notes**

- paper=workspace/cs-ai/peft-methods/paper/main.md
- finding_sections=3
- filled_evidence_rows=5

### Round 4: Coverage, taxonomy leakage, and missing-literature risk — pass

**Notes**

- triage_rows=5
- claimed_evidence_rows=5
- target_categories=cs.AI, cs.CL, cs.LG
- Coverage gaps still require human/domain reviewer search beyond arXiv metadata.

### Round 5: Claim calibration and submission readiness — needs work

**Warnings**

- 5 preliminary-linked claim(s) remain; do not promote to final support.

**Notes**

- claim_rows=5
- supported_claims=0
- preliminary_claims=5
- draft_only_claims=0
- unsupported_claims=0

### Round 6: Positive-signal floor — pass

**Notes**

- numeric_result_rows=5/5 (floor=2)
- comparative_rows=5/5 (floor=1)
- unique_cited_papers=5 (floor=3)
- correctness_score=0.75 (floor=0.5)
- novelty_score=0.97 (floor=0.35)

### Round 7: Academic format and scholarly correctness — needs work

**Warnings**

- 1 full-text supported row(s) do not surface their source_quote in the paper body: 2605.08177v1.

**Notes**

- paper=workspace/cs-ai/peft-methods/paper/main.md
- abstract_words=224
- total_words=3243
- references_listed=5
- missing_format_sections=none

### Round 8: Demo and proof (independent re-verification) — needs work

**Warnings**

- Demo proof score 0.6 below required floor 0.8 (3/5 claims independently re-verified against cached PDFs).
- Claim not independently re-verified: 2506.11042v2 (overlap=1.0, substring=True).
- Claim not independently re-verified: 2501.13787v1 (overlap=1.0, substring=True).

**Notes**

- demo=workspace/cs-ai/peft-methods/paper/demo.py
- proof=workspace/cs-ai/peft-methods/paper/proof.json
- proof_score=0.6
- passed=3/5
- verdict=pass

### Round 9: Direction coherence (anti-boilerplate-leak) — pass

**Notes**

- direction_id=unknown
- family=peft-methods
- keywords_checked=0
- keyword_hits=0
- cross_family_leaks=0

### Round 10: Research-value (gap/contradiction/surprise/recency) — pass

**Notes**

- value_score=0.94 threshold=0.35
- gap_count=88 contradictions=30 surprises=28 recent_papers=3/5
- components gap=1.0 contradiction=1.0 surprise=1.0 recency=0.6

### Round 11: System correctness (claim→quote→PDF) — needs work

**Warnings**

- correctness detail: 2506.11042v2: missing source_quote/page/checked_at
- correctness detail: 2501.13787v1: missing source_quote/page/checked_at

**Notes**

- correctness_score=0.75 floor=0.55
- rows_scored=5
- pdfs_missing=0
- quote_in_pdf_avg=1.0
- claim_support_avg=0.433
- locator_present_avg=0.6

### Round 12: Cross-model reviewer committee — pass

**Notes**

- LLM disabled — cross-model jury skipped (deterministic baseline).

### Round 13: Citation integrity (cited→cached metadata) — pass

**Notes**

- citations_checked=5
- fabricated=0 year_mismatch=0 title_drift=0
- cached_corpus_size=30
- This round is deterministic: it cross-checks printed citations against cached arXiv metadata only.

## Interpretation

- `pass` means the deterministic audit found no structural, source-depth, taxonomy, or paper-quality warnings.
- `needs work` means the draft is traceable but still needs full-text/source-depth, taxonomy, or quality cleanup.
- `unsupported` means claims or required artifacts are missing or inconsistent enough to block trust.
- `submission_readiness=blocked` means the draft must not be treated as final or deployed as ready, even if it is useful as a transparent draft.
