DeDolphins / DataHorse

Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.
https://space.datahorse.ai/
MIT License
242 stars 22 forks source link

Inconsistence result for the same query #5

Closed poneoneo closed 1 month ago

poneoneo commented 2 months ago

I decided to integrate datahorse into my project as an ai-agent. So for each query a csv file is provided to data-horse but sometime for the same query i'm often facing different scenario:

bedhind the scene i decided to add "always return the result as dataframe" to reduce the occurence amount of all errors above mentionned.

So how to fix result or plot render or that datahorse try to import not available library?

SsebowaDisan commented 2 months ago

Thank you for highlighting these concerns. We understand the challenges you are facing, and we want to address each issue:

Sohammhatre10 commented 1 month ago

@SsebowaDisan @poneoneo I reckon setting the seed will generate similar or in some cases the same result which may help in solving the issue here for the llama-8b-8192 Reference used - https://medium.com/@kyfex/groqs-chat-settings-dfbb601efdca Should I try on the changes needed?

SsebowaDisan commented 1 month ago

Hello @Sohammhatre10 , Thank you for the suggestion about setting the seed! It does sound like a great approach for ensuring more consistent results. I appreciate your willingness to contribute to the project.

If you’d like to help with this, feel free to dive into the implementation based on the reference you shared.

Looking forward to collaborating with you on this!

Best regards, Disan

poneoneo commented 1 month ago

I think it's could be great to provide endpoints for user and allow them to change it based on their needs

Sohammhatre10 commented 1 month ago

@poneoneo That'll be great for personalization.

Sohammhatre10 commented 1 month ago

Have tried to clear this using a different caching mechanism and seeding technique. Works well here https://github.com/DeDolphins/DataHorse/pull/9 @SsebowaDisan @poneoneo Hope this solves the issue!