Ailixir is an application that utilises LLMs and custom user input to generate AI agent prototypes specialised in fields such as health, economics, physics etc. The prototypes enable the user, which is an entrepreneur-developer, to compare the results produced by different LLMs.
Develop a web scraper to extract recipe content from the Nutritionfacts.org website. This scraper will use the existing orchestrator infrastructure for task scheduling, execution and monitoring. The scraper will collect data including recipe names, ingredients, instructions, and nutritional information. The data will be stored in JSON format in a MongoDB database hosted on Google Cloud Servers. The extracted data will be utilized in creating context for custom agents specialized in nutrition advice.
User Story
As a common-user,
I want access to nutritional information from reliable sources provided by my customised agent
so that I can make informed lifestyle decisions.
Acceptance Criteria
[x] The scraper is able to access and parse Nutritionfacts.org.
[x] Recipe names, ingredients, instructions, and nutritional information are extracted.
[x] Data is stored in JSON format in the database.
[ ] The scraper uses the orchestrator for task scheduling, execution, and monitoring.
[ ] Error handling and logging are implemented for scraper failures.
Definition of Done
[ ] The feature has been fully implemented.
[ ] The feature has been manually tested and works as expected without critical bugs.
[ ] The feature code is documented with clear explanations of its functionality and usage.
[ ] The feature code has been reviewed and approved by at least one team member.
[ ] The feature branches have been merged into the main branch and closed.
[ ] The feature utility, function and usage have been documented in the respective project wiki on github.
Domain
app frontend, app backend
Description
Develop a web scraper to extract recipe content from the Nutritionfacts.org website. This scraper will use the existing orchestrator infrastructure for task scheduling, execution and monitoring. The scraper will collect data including recipe names, ingredients, instructions, and nutritional information. The data will be stored in JSON format in a MongoDB database hosted on Google Cloud Servers. The extracted data will be utilized in creating context for custom agents specialized in nutrition advice.
User Story
Acceptance Criteria
Definition of Done