Harvard Law School Library Launches Institutional Data Initiative
Today, the Harvard Law School Library unveiled the Institutional Data Initiative (IDI). This pioneering research project aims to collaborate with libraries, museums, government entities, and various organizations to digitize their collections and publish them as accessible data. Such data is poised to support numerous applications, particularly in training advanced AI models.
Focus Areas of the Initiative
The IDI’s initial efforts will concentrate on enhancing a collection of one million public domain books that have been digitized at the Harvard Library. Additionally, it will partner with the Boston Public Library to transform millions of pages from historical newspapers into usable data formats. While the primary focus is on long-form texts, IDI intends to explore partnerships that encompass a diverse range of data types, including scientific and biomedical datasets.
Support and Funding
This initiative is backed by industry leaders Microsoft and OpenAI, signifying strong institutional support for its goals. Looking ahead, IDI aims to secure sustained funding through collaborations with philanthropic organizations and industry stakeholders.
Statements from Key Stakeholders
Burton Davis, Vice President and Deputy General Counsel at Microsoft, commented on the significance of the initiative:
“Microsoft is proud to support the establishment of the Institutional Data Initiative, which will work to increase access to knowledge and high-quality data for all builders of AI. We are committed to enabling broad access to data and empowering a more inclusive AI ecosystem. Since 2020, we have worked to close the data divide, ensuring that every organization has access to the data it needs to innovate and achieve more, which is essential to growing a vibrant, competitive AI economy.”
Microsoft’s perspective emphasizes the importance of collaboration around open data. In 2020, the company launched the Open Data Campaign, facilitating access to vital data resources for organizations striving to develop AI technologies.
Tom Rubin, Chief of Intellectual Property and Content at OpenAI, also shared his thoughts on IDI:
“Academic institutions have long been key partners in artificial intelligence research and progress, and Harvard’s Institutional Data Initiative is a powerful example of this. The public domain plays a vital role in the spread of knowledge and creativity, and OpenAI is delighted to support this effort. We are inspired by Prof. Zittrain’s leadership throughout this important project and are eager to see its impact.”
Conclusion
By providing open access to extensive datasets, the Institutional Data Initiative is poised to play a significant role in advancing AI technology and ensuring its benefits are accessible to a wider audience.
For more information, visit the source.
Leave a Reply