asah / footprint2009dev

original dev repo for AllForGood.org
http://AllForGood.org/
0 stars 1 forks source link

Showing lots of dups -- servenet #224

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
I noticed we're now showing dups for from servenet.org

Example:
http://footprint2009dev.appspot.com/search#q=%20the&num=10&start=11&vol_loc
=37.415%2C-122.059&timeperiod=everything&cache=1

The result below is shown many times on page 1, 2, etc:

Get involved at the YMCA in your neighborhood!
California 94020 United States - Present
Together, the nations more than 2,540 YMCAs are the largest not-for-profit 
community service organizations in America, working to meet the health and 
social service needs of 18.9 millions men, women and children worldwide of 
people of all faiths, races, abilities, ages and income. ...
http://servenet.org - Like - Add to calendar

Original issue reported on code.google.com by kar...@gmail.com on 14 May 2009 at 7:06

GoogleCodeExporter commented 9 years ago
i'm on it-- have it repro'd

Original comment by adam.sah on 14 May 2009 at 7:44

GoogleCodeExporter commented 9 years ago
Karen-- so each listing has a different location, the current design
states that we dedup on date/time but not on location.  Want to change
that?  Michael had an interesting suggestion, which is merge listings
on "nearby" locations (algorithmically, fuzz-out the latlong, then
check for equality-- not perfect, but good enough + super easy)

Original comment by adam.sah on 14 May 2009 at 8:28

GoogleCodeExporter commented 9 years ago
Yikes, that's an ugly set of results.  
It does seem correct to merge on location, in addition to date.  The 
"has_more_results" attribute will need to be renamed "has_more_dates".

You think this is low-risk to implement?

Original comment by paul.rademacher on 14 May 2009 at 9:24

GoogleCodeExporter commented 9 years ago
Good catch.  In general, we should show city name and not just state.  I 
probably 
would have caught this as not an exact dup.

If we fix this for Monday, would we suppress all the dups in the UI?  I think 
long-
term we might want to do something like web search, where you can click to see 
similar results... (UI/functionality needs to be figured out)

Original comment by kar...@gmail.com on 14 May 2009 at 9:25

GoogleCodeExporter commented 9 years ago
yeah, given the complexity of this code, I think I'd rather punt until 
immediately 
post-launch.  is that OK?

Michael and I attempted to do the websearch-style UI with "more..." in earlier 
versions
which means that some of this code still exists, but also means that things are 
messy
to change at the last minute.

Original comment by adam.sah on 14 May 2009 at 9:32

GoogleCodeExporter commented 9 years ago
feed removed from v1 launch

Original comment by adam.sah on 28 May 2009 at 5:06

GoogleCodeExporter commented 9 years ago
erring on the side of adding searchqual label (for searchqual prioritization)

Original comment by adam.sah on 3 Jun 2009 at 9:54

GoogleCodeExporter commented 9 years ago
feed re-added to launch, i.e. priority increased

Original comment by adam.sah on 4 Jun 2009 at 1:13

GoogleCodeExporter commented 9 years ago

Original comment by adam.sah on 5 Jun 2009 at 2:45

GoogleCodeExporter commented 9 years ago
these are still important, but moving out from 1.5 i.e. super nice to have
for 1.5 but not necessarily realistic

Original comment by adam.sah on 11 Jun 2009 at 6:54

GoogleCodeExporter commented 9 years ago

Original comment by adam.sah on 28 Jun 2009 at 4:25