CBC News analysis finds thousands of Canadian authors, books in controversial dataset used to train AI
A CBC News investigation has found at least 2,500 copyrighted books written by more than 1,200 Canadian and Québécois authors were shared online as part of a massive — and now defunct — dataset used to train artificial intelligence.
![CBC News analysis finds thousands of Canadian authors, books in controversial dataset used to train AI](https://i.cbc.ca/1.7050338.1701840694!/fileImage/httpImage/image.jpg_gen/derivatives/16x9_620/canadianauthors.jpg)
![Margaret Atwood, Gordon Korman and Alice Munro were among the Canadian authors whose books were found in a massive — and now defunct — collection of data shared online and used to train artificial intelligence. A composite image of Canadian authors, from left, Margaret Atwood, Gordon Korman and Alice Munro.](https://i.cbc.ca/1.7050338.1701840694!/fileImage/httpImage/image.jpg_gen/derivatives/16x9_620/canadianauthors.jpg)
A CBC News investigation has found at least 2,500 copyrighted books written by more than 1,200 Canadian and Québécois authors were shared online as part of a massive — and now defunct — dataset used to train artificial intelligence.