Can LLMs Identify Tax Abuse?

Blair-Stanek, Andrew; Holzenberger, Nils; Van Durme, Benjamin

Can LLMs Identify Tax Abuse?

Andrew Blair-Stanek, Nils Holzenberger and Benjamin Van Durme

Abstract: We investigate whether large language models can discover and analyze U.S. tax-minimization strategies. This real-world domain challenges even seasoned human experts, and progress can reduce tax revenue lost from well-advised, wealthy taxpayers. We evaluate the most advanced LLMs on their ability to (1) interpret and verify tax strategies, (2) fill in gaps in partially specified strategies, and (3) generate complete, end-to-end strategies from scratch. This domain should be of particular interest to the LLM reasoning community: unlike synthetic challenge problems or scientific reasoning tasks, U.S. tax law involves navigating hundreds of thousands of pages of statutes, case law, and administrative guidance, all updated regularly. Notably, LLM-based reasoning identified an entirely novel tax strategy, highlighting these models' potential to revolutionize tax agencies' fight against tax abuse.

Date: 2025-08
New Economics Papers: this item is included in nep-ain, nep-big, nep-pbe and nep-pub
References: Add references at CitEc
Citations:

Downloads: (external link)
http://arxiv.org/pdf/2508.20097 Latest version (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2508.20097

Access Statistics for this paper

More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().