Google launched a new search engine for the scientific community that will help them make sense of millions of datasets present online.
The service, called Dataset Search, will help scientists, data journalists and geeks find the data required for their work and their stories.
The new service will work like Google Scholar, the company’s popular search engine for reports and academic studies.
“Dataset Search lets you find datasets wherever they’re hosted, whether it’s a publisher’s site, a digital library, or an author’s personal web page,” Natasha Noy, Research Scientist, Google AI, said in a blog post.
To create Dataset search, Google developed guidelines for dataset providers to describe their data in a way that the company (and other search engines) can better understand the content of their pages.
Natasha said”These guidelines include salient information about datasets: who created the dataset, when it was published, how the data was collected, what the terms are for using the data, etc,” .
Google then collects and links this information, analyses where different versions of the same dataset might be, and finds publications that may be describing or discussing the dataset.
Google told “We encourage dataset providers, large and small, to adopt this common standard so that all datasets are part of this robust ecosystem,”.
People can find references to most datasets in environmental and social sciences, as well as data from other disciplines including government data and data provided by news organisations, such as ProPublica.
Dataset Search works in multiple languages with support for additional languages coming soon, said Google.