Closed amindadgar closed 6 months ago
The recent updates focus on enhancing the ETL processes across Discord and other platforms. Changes involve refactoring code for improved data retrieval using new utility classes, updating function parameters for better processing, and enhancing integration testing for reliability.
File Path | Change Summary |
---|---|
dags/hivemind_discord_etl.py |
Refactored code to use ModulesDiscord class for community data retrieval. |
dags/hivemind_etl_helpers/discord_mongo_vector_store_etl.py |
Enhanced process_discord_guild_mongo function with new parameters. |
dags/hivemind_etl_helpers/src/db/discord/discord_raw_message_to_document.py |
Added selected_channels parameter for flexibility in message retrieval. |
dags/hivemind_etl_helpers/src/db/discord/fetch_raw_messages.py |
Updated functions to filter messages based on selected_channels . |
dags/hivemind_etl_helpers/tests/integration/test_discord_fetch_raw_messages.py |
Modified function calls for selected_channels and from_date . |
dags/hivemind_etl_helpers/tests/integration/test_discord_prepare_document_from_db.py |
Added selected_channels and from_date in function calls. |
dags/hivemind_etl_helpers/tests/integration/test_pg_vector_access_with_discord.py |
Modified _create_and_save_doc method for discord_raw_to_documents . |
🐰✨ A rabbit in the code's domain,
Hops through changes, not in vain.
Discord, Discourse, data anew,
Parameters added, functions grew.
In the digital realm, a dance so sweet,
Changes embraced, in data's heartbeat. 🌟🐾
ModulesDiscord
for Discord data retrieval, and updating test cases to reflect these changes.
Summary by CodeRabbit
New Features
Refactor