Google Dataset Search Engine

From energypedia


Gathering and organizing as much information as possible, and easing the access for everybody around the world, has always been Google’s goal[1].

This goal’s first target was the commercial web, yet, the company has recently changed its usual target, and has shifted a bit towards the scientific community[1].

To better serve scientists and researchers, on September 5th, 2018, Google has launched a service called Dataset Search, which is a search engine just for finding datasets on the internet[2][1]. This search engine will act as an additional search tool for scientific communities, along with the company’s Google Scholar, which is the company’s most popular search engine for academic studies and reports[2].


Google Dataset Search Engine

Researchers have usually relied on sources like: the World Bank, NASA, ProPublica, IEA, IRENA or other search engines, but this new tool should make their work much easier, as it is dedicated to help journalists and researchers to easily reach the available public data[3].

Google Dataset Search will allow its users to search through enormous amount of datasets across tens of thousands of repositories on the web, whether they exist on a governmental database, public sources, publisher’s site, a digital library or an author’s personal web page[2][3]. It applies the usual Google’s approach and design for search, but to datasets published across the web[3]. For an example: the searcher can use a typical research words “Solar photovoltaics deployment statistics” to look for statistics and reports on the deployment situation[3].



The initial Dataset Search covers: environmental & social sciences, governmental data and datasets from news organizations, and is expected to expand to include more sources shortly[2][1].

The tool already supports multiple languages, and will be able to support more languages soon, and it is also expected to be used by scientists, journalists, data geeks, and basically anyone who is using data behind a specific topic[3][4].

Relevance for Renewable Energy Community

The importance of this tool for the energy community is clear:

  • you can easily allocate all available public datasets
  • you have easy access to as many possible datasets as possible
  • you can see which format the respecitve dataset it (e.g. XLS)
  • you can detect reliable information by identifying the publisher (e.g. the respective department of a Government)
  • you can access the most recent dataset, as it always displays the last update (e.g. Sept, 27 2018).

It is expected that more datasets will be provided in the future.

Tip: you can also search only for data from one publisher. e.g. the World Bank by searching on their site only. energy


Here are couple of comments from people either working on the project or working with it:

  • “The aim is to unify the tens of thousands of different repositories for datasets online. We want to make the data discoverable, but keep it where it is.” Natasha Noy, a research scientist at Google AU who helped create Dataset Search[2].

  • “This type of search has long been the dream for many researchers in the open data and science communities.” Ed Kearns, chief data office at NOAA[2].

Further Information


  1. 1.0 1.1 1.2 1.3 Vincent, J. (2018). Google Launches New Search Engine to Help Scientists Find the Datasets They Need: Dataset Search Could Be a Scientist’s Best Friend. Retrieved From:
  2. 2.0 2.1 2.2 2.3 2.4 2.5 Lahoti, S. (2018). Google Launches a Dataset Search Engine for Finding Datasets on the Internet. Retrieved From:
  3. 3.0 3.1 3.2 3.3 3.4 Mehta, I. (2018). Google’s New Search Engine Reveals Public Datasets for Research and Journalism Projects. Retrieved From:
  4. Schwarz, B. (2018). Google Dataset Search: A New Search Service to Find Data from Sciences, Government, Some News Organizations – Using Dataset Schema, You Can Mark Up Your Data to get it Included in the New Google Dataset Search Engine. Retrieved From: