Open Jeffwan opened 1 year ago
Hi! Thanks for reporting this issue.
I guess there are some reasons for this issue:
I noticed you are using GPT-3.5-turbo-16k, which sometimes won't perfectly follow our system prompt.
We implement kaggle search based on the official Kaggle API. However, I noticed the API doesn't always produce something you want. Here is a minimal script to illustrate:
from kaggle.api.kaggle_api_extended import KaggleApi
api = KaggleApi()
api.authenticate()
keywords = "NBA play performance statistics"
results = api.dataset_list(search=keywords, page=1, max_size=20000, file_type="csv")
print(results)
The results via API is in fact an empty list:
[]
However, on kaggle.com, you will see the below results:
SO, we try to skip such empty results by replacing it with default datasets (keyword=""), which may cause the datasets misaligned with your original request; it is a bit brute force actually, but can ensure some results will be returned after one API calling. 😅
Hi! Thanks for reporting this issue.
I guess there are some reasons for this issue:
- I noticed you are using GPT-3.5-turbo-16k, which sometimes won't perfectly follow our system prompt.
- We implement kaggle search based on the official Kaggle API. However, I noticed the API doesn't always produce something you want. Here is a minimal script to illustrate:
from kaggle.api.kaggle_api_extended import KaggleApi api = KaggleApi() api.authenticate() keywords = "NBA play performance statistics" results = api.dataset_list(search=keywords, page=1, max_size=20000, file_type="csv") print(results)
The results via API is in fact an empty list:
[]
However, on kaggle.com, you will see the below results:
SO, we try to skip such empty results by replacing it with default datasets (keyword=""), which may cause the datasets misaligned with your original request; it is a bit brute force actually, but can ensure some results will be returned after one API calling. 😅
Thanks for the explanation: it might be better to just say "I could not find any dataset related to xxx" if the API returns an empty list.
@harrywang Thanks for pointing that out! Sincerely, would you be interested in making a small pull request to fix this to become our contributor?
I am asking NBA dataset related messages but get unrelated cards shown there like world populate etc. However, it does shows
You will find a variety of datasets related to NBA play performance statistics that you can explore.
Can someone take a look at the issue?