ICT4SD / Science_Technology_Search

Build a searchable collection of science and technology knowledge useful to implement the Sustainable Development Goals.
https://ict4sd.github.io/2016/09/21/Project_ST_SEARCH/
GNU General Public License v3.0
1 stars 1 forks source link

Science & Technology Search Engine for Sustainable Development

Build a searchable collection of science and technology knowledge useful to implement the Sustainable Development Goals.

The internet has billions of pages. This project aims to build a curated collection of content relevant to sustainable development, keeping and categorizing content from high quality sources. Such as:

For example, if you were to search for "child nutrition tools" the results set would be expected to include for example:

Getting started

Navigating the vast amount of content abailable on the Internet is daunting, a way to get started could be to narrow the collection to include only websites belonging to this narrow list of owners:

Classification of content

In addition to being able to find content based on keywords, it would be useful to be able to narrow down search results by the type of content found. These are some ideas for classification:

Examples

1) Sentence tagging and visualization

2) Sentence tagging and search

3) This is an example of how documents might look like in the search engine server:

{
  "_index" : "sti_search",  
  "_type" : "site",  
  "_id" : "0",  
  "_score" : 1.0,  
  "_source" : {  
    "Country Origin" : [ "India" ],  
    "Entity Origin" : [ "Permanent Mission to the UN", "Ministry of Foreign Affairs" ],  
    "Domain Origin" : [ ".gov.in"],  
    "Title" : [ "Analysis: What this technology means for the future of the console health" ],  
    "Content" : [ "Company co-founder and current board member is going to offload some of his stock. The CEO and is the companys largest individual shareholder, said Thursday that he plans to sell a new device for poverty reduction." ],  
    "url" : "https://example.un.org/12345",  
    "Language" : [ "English", "French" ],  
    "Resource Type" : [ "News", "Projects" ],  
    "Resource Format" : [ "Webpage", "Video" ],  
    "SDG Goal" : [ "Health", "Poverty" ],  
    "Indexed Date" : "2015-31-12"  
  }  

You can see a demo interactive search engine server here: (Note: fake content!!) http://132.148.64.70:9200/_search?pretty=true&q=:

NOTE: Since the international community has agreed on the 17 Sustainable Development Goals for the year 2030, it would be ideal to have a categorization of content according to these goals as well. See the goals below:

SDG poster