ECMWFCode4Earth / challenges_2024

Discover the ECMWF Code for Earth 2024 challenges
46 stars 4 forks source link

Challenge 24 - Knowledge Graph Generation for Enhanced Chatbot and Scientific Literature Synthesis #7

Open RubenRT7 opened 4 months ago

RubenRT7 commented 4 months ago

Challenge 24 - Knowledge Graph Generation for Enhanced Chatbot and Scientific Literature Synthesis

Stream 2 - Machine Learning for Earth Sciences applications

Goal

In this challenge, participants will build a tool to enhance the existing ECMWF chatbot*. The goal is to create a system that can understand scientific texts about weather and use that information to make interactive graphs to facilitate better understanding and exploration of weather-related concepts and phenomena.

Mentors and skills


Challenge description

Knowledge graphs help by structuring knowledge, making it easier to understand and use. By adding knowledge graphs to chatbots and search engines, user experience can be enhanced. Additionally, a tool capable of generating interactive content graphs could allow simply the process of synthesizing scientific literature, making it easier to explore and analyze connections between ideas.

Challenge Tasks

Dataset Participants will be provided with datasets containing scientific literature, weather data, domain-specific ontologies, and any other relevant sources, including preliminary entity dataset extracted from the chatbot development.

Ben-EJ commented 3 months ago

Hello, could you clarify the following questions please. What are the challenges users face regarding user experience interacting with the chatbot, and how can a knowledge graph address them effectively? Who is going to use the chatbot, is it going to be the public or researchers, for example? Thank you.

lincent commented 3 months ago

Hello, just trying to work out what information will end up in the knowledge graph, can you give examples of what you mean by weather data? is it numerical data in forecast GRIB/netCDF format or more plain language based?

anaprietonem commented 3 months ago

Hello, could you please clarify the following questions please. What are the challenges users face regarding user experience interacting with the chatbot, and how can a knowledge graph address them effectively? Who is going to use the chatbot, is it going to be the public or researchers, for example? Thank you.

Hello, thanks for your question. Regarding the user experience what we would like as a potential outcome of this challenge is that chatbot could also point users to the knowledge graph so users could also explore the information in an interactive way. But just to flag that the integration between chatbot-knowledge graph would be explored if there is enough time. We would like to keep the main focus around building an interactive knowledge graph and if useful reusing some tools for NLP that were used to implement the chatbot. The chatbot is public and can be found here https://chat.ecmwf.int/

anaprietonem commented 3 months ago

Hello, just trying to work out what information will end up in the knowledge graph, can you give examples of what you mean by weather data? is it numerical data in forecast GRIB/netCDF format or more plain language based?

Thanks for the question! With weather data we were referring to more plain language based

tjohnson-scottlogic commented 3 months ago

Hi, we’d like to try the process of acquiring data, processing data, generating the knowledge graph and then visualising that graph with a simple set of initial data. Could you recommend what we’d start with? Ideally the simplest possible that allows us to test the process. Would it be possible to see that data before we submit our proposal?

anaprietonem commented 3 months ago

Hi, we’d like to try the process of acquiring data, processing data, generating the knowledge graph and then visualising that graph with a simple set of initial data. Could you recommend what we’d start with? Ideally the simplest possible that allows us to test the process. Would it be possible to see that data before we submit our proposal?

Hello, unfortunately I don't think it's possible to share the data ahead of the proposal stage. However you can find examples of the technical texts related to NWP under https://www.ecmwf.int/en/publications/technical-memoranda. Other option of plausible data could be getting arxiv papers related to 'AI-based weather forecasting'. Currently, https://www.connectedpapers.com/ provides generation of knowledge graphs for topics like this but their graphs are based on citations and not on the entities/context of the papers

Screenshot 2024-03-25 at 16 47 38
Viet1004 commented 3 months ago

Hello. I have a question. We have a couple of people, but some of them are interested in some projects but not others. And they can have multiple interests. My question is, can we participate in multiple proposal rounds? And if one person applies for multiple proposals and these proposals pass (supposedly), can they withdraw from the project without affecting other teammates?

trakasa commented 3 months ago

Hi @Viet1004 many thanks for your interest in this challenge, respectively in the Code for Earth.

Generally participants can submit a proposal for more than one challenge. Also individuals can be members of different teams submitting a proposal. Please note, you have to submit your proposal by 9. April, then we enter the selection process, there won't be other "proposal rounds".

Withdrawing from a team, if you are team member in various selected teams, is possible. As you wrote, this should of course not affect the team in working and providing the solution described in their proposal.

We highly recommend though that you carefully check the challenges you are interested in, get in touch with the mentors for any related questions and chose the one(s) you have really a vetted interest in.

Last but not least, be sure you have checked the eligibility rules, Article 3 in the Terms&Conditions.

I hope this answers your questions.

Bye, Athina