Hugging Face Daily Papers · · 5 min read

AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification

Mirrored from Hugging Face Daily Papers for archival readability. Support the source by reading on the original site.

Structured financial audit verification is difficult for language-model agents because correctness depends on structured evidence rather than text alone. A model must link reported facts to taxonomy concepts, traverse calculation or dimensional relations, and recompute expected values before applying an audit rule. We propose AuditFlow, a graph-grounded multi-agent framework that separates adaptive search from deterministic verification. AuditFlow builds a symbolic environment from a static US-GAAP taxonomy graph and a dynamic XBRL filing graph, and exposes it through typed tools for fact retrieval, taxonomy traversal, numerical checking, and rule evaluation. Two junior auditors inspect each case from regulatory and evidentiary views, while a senior auditor resolves disagreements and can request further investigation. The final reports are fused through evidential aggregation to produce an audit verdict, expected value, evidence trail, and trustworthiness score. On a FinAuditing-derived FinMR sample, AuditFlow reaches 82.09% joint audit accuracy under GPT-5.5, outperforming the strongest baseline by 14.93 points. Removing deterministic checks drops accuracy to 17.91%, showing that the symbolic environment performs the verification step that the model cannot reliably replace.</p>\n","updatedAt":"2026-06-04T02:28:17.551Z","author":{"_id":"65d76cc5b9b7b8bf88faa916","avatarUrl":"/avatars/d95232cd0c307efab6197ade1a66190b.svg","fullname":"Yan Wang","name":"YanAdjeNole","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":6,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8978923559188843},"editors":["YanAdjeNole"],"editorAvatarUrls":["/avatars/d95232cd0c307efab6197ade1a66190b.svg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2606.03031","authors":[{"_id":"6a20e27f15100c5272a846b1","name":"Yan Wang","hidden":false},{"_id":"6a20e27f15100c5272a846b2","name":"Xuguang Ai","hidden":false},{"_id":"6a20e27f15100c5272a846b3","name":"Jaisal Patel","hidden":false},{"_id":"6a20e27f15100c5272a846b4","name":"Xueqing Peng","hidden":false},{"_id":"6a20e27f15100c5272a846b5","name":"Fengran Mo","hidden":false},{"_id":"6a20e27f15100c5272a846b6","name":"Yupeng Cao","hidden":false},{"_id":"6a20e27f15100c5272a846b7","name":"Haohang Li","hidden":false},{"_id":"6a20e27f15100c5272a846b8","name":"Mingyu Cao","hidden":false},{"_id":"6a20e27f15100c5272a846b9","name":"Lingfei Qian","hidden":false},{"_id":"6a20e27f15100c5272a846ba","name":"Víctor Gutiérrez-Basulto","hidden":false}],"publishedAt":"2026-06-02T00:00:00.000Z","submittedOnDailyAt":"2026-06-04T00:00:00.000Z","title":"AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification","submittedOnDailyBy":{"_id":"65d76cc5b9b7b8bf88faa916","avatarUrl":"/avatars/d95232cd0c307efab6197ade1a66190b.svg","isPro":true,"fullname":"Yan Wang","user":"YanAdjeNole","type":"user","name":"YanAdjeNole"},"summary":"Structured financial audit verification is difficult for language-model agents because correctness depends on structured evidence rather than text alone. A model must link reported facts to taxonomy concepts, traverse calculation or dimensional relations, and recompute expected values before applying an audit rule. We propose AuditFlow, a graph-grounded multi-agent framework that separates adaptive search from deterministic verification. AuditFlow builds a symbolic environment from a static US-GAAP taxonomy graph and a dynamic XBRL filing graph, and exposes it through typed tools for fact retrieval, taxonomy traversal, numerical checking, and rule evaluation. Two junior auditors inspect each case from regulatory and evidentiary views, while a senior auditor resolves disagreements and can request further investigation. The final reports are fused through evidential aggregation to produce an audit verdict, expected value, evidence trail, and trustworthiness score. On a FinAuditing-derived FinMR sample, AuditFlow reaches 82.09% joint audit accuracy under GPT-5.5, outperforming the strongest baseline by 14.93 points. Removing deterministic checks drops accuracy to 17.91%, showing that the symbolic environment performs the verification step that the model cannot reliably replace.","upvotes":6,"discussionId":"6a20e27f15100c5272a846bb","organization":{"_id":"658f4413674349122c0708e9","name":"TheFinAI","fullname":"The Fin AI","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/63b58ed5889aa6707f0bb0f4/ZK5nQKw34W3-eH3p4NAYc.jpeg"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"65d76cc5b9b7b8bf88faa916","avatarUrl":"/avatars/d95232cd0c307efab6197ade1a66190b.svg","isPro":true,"fullname":"Yan Wang","user":"YanAdjeNole","type":"user"},{"_id":"65d50fd888f4a90589017398","avatarUrl":"/avatars/5e58259399ffe0d4e2e5a8c81fe65b30.svg","isPro":false,"fullname":"Lingfei Qian","user":"lfqian","type":"user"},{"_id":"6335150931a2be3938c99db6","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6335150931a2be3938c99db6/g8pUPvi9ZI2ztW9fwee5_.png","isPro":false,"fullname":"Dokyoon","user":"leeloolee","type":"user"},{"_id":"63b58ed5889aa6707f0bb0f4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63b58ed5889aa6707f0bb0f4/znl74_aMswlV8VtHrfj3G.jpeg","isPro":true,"fullname":"Jimin Huang","user":"jiminHuang","type":"user"},{"_id":"698306bb31ae762ba6215b22","avatarUrl":"/avatars/c0b4aae0db80a5e54f08211816c9abd9.svg","isPro":false,"fullname":"Fatima Al-Mansouri","user":"post-and-run","type":"user"},{"_id":"620783f24e28382272337ba4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/620783f24e28382272337ba4/zkUveQPNiDfYjgGhuFErj.jpeg","isPro":false,"fullname":"GuoLiangTang","user":"Tommy930","type":"user"}],"acceptLanguages":["en"],"dailyPaperRank":0,"organization":{"_id":"658f4413674349122c0708e9","name":"TheFinAI","fullname":"The Fin AI","avatar":"https://cdn-avatars.huggingface.co/v1/production/uploads/63b58ed5889aa6707f0bb0f4/ZK5nQKw34W3-eH3p4NAYc.jpeg"},"markdownContentUrl":"https://huggingface.co/buckets/huggingchat/papers-content/resolve/2606/2606.03031.md"}">
Papers
arxiv:2606.03031

AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification

Published on Jun 2
· Submitted by
Yan Wang
on Jun 4
Authors:
,
,
,
,
,
,
,
,
,

Abstract

Structured financial audit verification is difficult for language-model agents because correctness depends on structured evidence rather than text alone. A model must link reported facts to taxonomy concepts, traverse calculation or dimensional relations, and recompute expected values before applying an audit rule. We propose AuditFlow, a graph-grounded multi-agent framework that separates adaptive search from deterministic verification. AuditFlow builds a symbolic environment from a static US-GAAP taxonomy graph and a dynamic XBRL filing graph, and exposes it through typed tools for fact retrieval, taxonomy traversal, numerical checking, and rule evaluation. Two junior auditors inspect each case from regulatory and evidentiary views, while a senior auditor resolves disagreements and can request further investigation. The final reports are fused through evidential aggregation to produce an audit verdict, expected value, evidence trail, and trustworthiness score. On a FinAuditing-derived FinMR sample, AuditFlow reaches 82.09% joint audit accuracy under GPT-5.5, outperforming the strongest baseline by 14.93 points. Removing deterministic checks drops accuracy to 17.91%, showing that the symbolic environment performs the verification step that the model cannot reliably replace.

Community

Paper submitter about 7 hours ago

Structured financial audit verification is difficult for language-model agents because correctness depends on structured evidence rather than text alone. A model must link reported facts to taxonomy concepts, traverse calculation or dimensional relations, and recompute expected values before applying an audit rule. We propose AuditFlow, a graph-grounded multi-agent framework that separates adaptive search from deterministic verification. AuditFlow builds a symbolic environment from a static US-GAAP taxonomy graph and a dynamic XBRL filing graph, and exposes it through typed tools for fact retrieval, taxonomy traversal, numerical checking, and rule evaluation. Two junior auditors inspect each case from regulatory and evidentiary views, while a senior auditor resolves disagreements and can request further investigation. The final reports are fused through evidential aggregation to produce an audit verdict, expected value, evidence trail, and trustworthiness score. On a FinAuditing-derived FinMR sample, AuditFlow reaches 82.09% joint audit accuracy under GPT-5.5, outperforming the strongest baseline by 14.93 points. Removing deterministic checks drops accuracy to 17.91%, showing that the symbolic environment performs the verification step that the model cannot reliably replace.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Tap or paste here to upload images

· Sign up or log in to comment

Get this paper in your agent:

hf papers read 2606.03031
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2606.03031 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2606.03031 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2606.03031 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from Hugging Face Daily Papers