Anthropic ran a secret project in early 2024 to scan millions of books for training its Claude AI system. Internal documents unsealed in legal filings last week revealed the effort, which the company wanted to keep quiet. According to The Washington Post, planning documents described the initiative as „Project Panama“ and stated the company aimed to „destructively scan all the books in the world.“
Company Sought to Hide Scanning Operation
The internal planning document explicitly said Anthropic did not want the project to become public knowledge. The company bought physical copies of books and scanned them to extract text data. After scanning, the company discarded the physical books.
The destructive scanning method allowed Anthropic to gather training data from a vast library of written works. The project operated while AI companies faced growing scrutiny over how they acquire training data. Many authors and publishers have sued AI firms for using copyrighted materials without permission.
Legal Filings Expose Internal Plans
Court documents filed last week brought Project Panama to light. The unsealed materials showed how the company planned and executed the book scanning effort. The filings did not specify the exact number of books scanned or the total cost of the operation.
Silicon Valley’s Data Collection Methods
Anthropic’s approach reflects broader patterns in how AI companies build their systems. Firms often buy or collect massive amounts of data to train language models. The practice has sparked debates about copyright, fair use, and intellectual property rights.
The company develops Claude, a chatbot that competes with OpenAI’s ChatGPT and Google’s Gemini. Training such systems requires enormous amounts of text data. Books provide high-quality writing samples that help AI models learn language patterns and factual knowledge.
AI companies argue that training on published works falls under fair use. Authors and publishers counter that such use violates copyright law and deprives creators of compensation. Multiple lawsuits against major AI firms remain pending in U.S. courts.