Recent technological advances in artificial intelligence (AI), especially the rise of generative AI, have raised questions regarding the intellectual property (IP) landscape. As the demand for AI training data surges, certain data collection methods give rise to concerns about the protection of IP and other rights. This report provides an overview of key issues at the intersection of AI and some IP rights. It aims to facilitate a greater understanding of data scraping — a primary method for obtaining AI training data needed to develop many large language models. It analyses data scraping techniques, identifies key stakeholders, and worldwide legal and regulatory responses. Finally, it offers preliminary considerations and potential policy approaches to help guide policymakers in navigating these issues, ensuring that AI’s innovative potential is unleashed while protecting IP and other rights.
Intellectual property issues in artificial intelligence trained on scraped data
Policy paper
Share
Facebook
Twitter
LinkedIn
Abstract
In the same series
-
Working paper
Evidence from selected countries and the European Union
7 May 202658 Pages -
Working paper
Global linkages and the cross‑country distribution of the gains from AI
18 March 202679 Pages -
Working paper
International insights and policy considerations for Italy
11 December 2025100 Pages -
8 December 202543 Pages
Related publications
-
Working paper
Insights from responses to the reporting framework of the Hiroshima AI Process Code of Conduct
25 September 202537 Pages -
17 June 202531 Pages
-
Policy paper3 June 202546 Pages
-
28 February 202526 Pages
-
16 December 202419 Pages