From our conversation, we are not as concerned with finding IR-related datasets - but rather ones that will work well for our purposes. (Mostly around the type of variables they have).
This issue is going to be my place to talk about some datasets that we can use. As per Evan's suggestion, I have been looking at the UC Irvine Machine Learning Repository which has a large number of datasets organized by domain and kind of problem.
One potential dataset is the Bike Sharing Dataset of Bike rentals in Washington DC. It has calendar and weather information. and could be used to look at differences in the number of rentals by the characteristics of the day.
From our conversation, we are not as concerned with finding IR-related datasets - but rather ones that will work well for our purposes. (Mostly around the type of variables they have).
This issue is going to be my place to talk about some datasets that we can use. As per Evan's suggestion, I have been looking at the UC Irvine Machine Learning Repository which has a large number of datasets organized by domain and kind of problem.
One potential dataset is the Bike Sharing Dataset of Bike rentals in Washington DC. It has calendar and weather information. and could be used to look at differences in the number of rentals by the characteristics of the day.