Google this time created something very innovative to aid the scientific community and this would help them make sense of the millions of datasets present online.
This particular service, termed as Dataset Search, in the long run, would help scientists, data journalists, and science worshipers, find the data required for their work and their stories, or simply to satisfy their intellectual curiosity.
Also, this latest search engine would work like Google Scholar, i.e.the company’s popular search engine for academic studies and reports.
To this, Natasha Noy, Research Scientist, Google AI, added in a blog post that, “Dataset Search lets you find datasets wherever they’re hosted, whether it’s a publisher’s site, a digital library, or an author’s personal web page”.
Now, to create this Dataset search, Google developed guidelines for dataset providers to describe their data in a way that the company (and other search engines) can better understand the content of their pages.
To this, Noy added that “These guidelines include salient information about datasets: who created the dataset, when it was published, how the data was collected, what the terms are for using the data, etc”.
The next step includes, Google collecting and linking this information, analyses where different versions of the same dataset might be, and finds publications that may be describing or discussing the dataset.
Google also added here that, “We encourage dataset providers, large and small, to adopt this common standard so that all datasets are part of this robust ecosystem”.
People can actually find references to most datasets in environmental and social sciences, as well as data from other disciplines including government data and data provided by news organizations, such as ProPublica.
Lastly, Google confirmed that this Dataset Search works in multiple languages with support for additional languages coming soon.