asah / footprint2009dev

original dev repo for AllForGood.org
http://AllForGood.org/
0 stars 1 forks source link

servenet (and americorps?): we're not mapping their orgnames correctly (was: "Test nonprofit" in the feeds) #205

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Aaron-- can you use FP to track down the provider, check their raw data to 
confirm, then get them to change this?

field c:org_name:string:
 15034 American Red Cross - Mid Coast Chapter
  9729 Test nonprofit                             <===
  2568 Communities Altering The Strays (CATS)
  1996 

(fwiw, I'm grabbing most of the others, which you can't do...)

Original issue reported on code.google.com by adam.sah on 13 May 2009 at 2:05

GoogleCodeExporter commented 9 years ago
It appears to be servenet as this search

http://footprint2009dev.appspot.com/search#q=%22Test%20nonprofit%22&num=10&start
=11&timeperiod=everything&cache=1

Results in only servenet habitat entries.

I couldn't find "test nonprofit" in an older version of servenet's huge 
(non-fpxml)
feed. I'm downloading the current one now. Adam, Can you verify that your custom
parser doesn't introduce it?

Original comment by aaro...@gmail.com on 13 May 2009 at 2:19

GoogleCodeExporter commented 9 years ago
downloaded latest servenet, and 
grep "test nonprofit" SERVEnetOpportunities001.xml 

returns nothing...the trail goes cold. If you give me another bread crumb, I'll 
keep
going.

Original comment by aaro...@gmail.com on 13 May 2009 at 2:42

GoogleCodeExporter commented 9 years ago
always use grep -i (case insensitive) ;-)

I only found 1 record...

<VolunteerOpportunity><LocalID>2298</LocalID><Categories><Category><CategoryID>4
</CategoryID></Category></Categories><OpportunityT
ype><OpportunityTypeID>2</OpportunityTypeID></OpportunityType><Title>Tutoring 
in 
Math</Title><DetailURL>http://servenet.org/MyiBelong/Edit/Calendar/tabid/104/ctl
/Details/mid/379/ItemId/2298/Default.aspx</DetailU
RL><Description>We are offering opportunities in you area.&lt;BR&gt; 
&lt;b&gt;Maximum Volunteers: 
[1]&lt;/b&gt;&lt;BR&gt; &lt;b&gt;Available Days:&lt;/b&gt; Monday, 
Thursday</Description><LogoURL>http://servenet.org/images/servenet_logo.gif</Log
oURL><LocationClassifications><LocationClassificat
ion><LocationClassificationID>1</LocationClassificationID></LocationClassificati
on></LocationClassifications><Locations><Location>
<Address1>Test</Address1><City>Test</City><StateOrProvince>District of 
Columbia</StateOrProvince><ZipOrPostalCode>20009</ZipOrPostalCode><Country>Unite
d 
States</Country></Location></Locations><Audiences><Audience><AudiencesID>3</Audi
encesID></Audience></Audiences><SponsoringOrganiza
tions><SponsoringOrganization><Name>Test nonprofit</Name><Address1>Test 
</Address1><City>Test</City><StateOrProvince>District of 
Columbia</StateOrProvince><ZipOrPostalCode>20009</ZipOrPostalCode><Country>Unite
d States</Country><Phone>9999999999</Phone><URL 
/><LogoURL>http://servenet.org/images/servenet_logo.gif</LogoURL></SponsoringOrg
anization></SponsoringOrganizations></VolunteerOpp
ortunity>

looks like a bug in the pipeline... I'll fix.

sargent~/pkg/footprint2009dev/datahub>cat servenet.xml | perl -ne 
's@</VolunteerOpportunity>@</VolunteerOpportunity>\n@g;print;' | 
perl -ne 'm@<SponsoringOrganization>(.+?)</SponsoringOrganization>@;print 
"$1\n";' | uniq -c|head -10
    267 <Name /><Address1 /><City /><StateOrProvince /><ZipOrPostalCode /><Country /><Phone /><URL 
/><LogoURL>http://servenet.org/images/servenet_logo.gif</LogoURL>
      7 <Name>SERVEnet</Name><Address1>1101 15th St NW Suite 200</Address1><City>Washington</City><StateOrProvince>District of 
Columbia</StateOrProvince><ZipOrPostalCode>20005</ZipOrPostalCode><Country>Unite
d 
States</Country><Phone>2022962992</Phone><URL>www.ysa.org</URL><LogoURL>http://s
ervenet.org/images/servenet_logo.gif</LogoURL>
    166 <Name>The West Virginia Commission for National and Community Service</Name><Address1>601 Delaware Avenue P.O. Box 
11778</Address1><City>Charleston</City><StateOrProvince>West 
Virginia</StateOrProvince><ZipOrPostalCode>25302</ZipOrPostalCode><Country>Unite
d States</Country><Phone>304-558-0111, or 800/WV-
HELPS</Phone><URL>www.wvwise.org</URL><LogoURL>http://servenet.org/images/serven
et_logo.gif</LogoURL>
    733 <Name>Campus Compact</Name><Address1>Box 1975 Brown University</Address1><City>Providence</City><StateOrProvince>Rhode 
Island</StateOrProvince><ZipOrPostalCode>02912</ZipOrPostalCode><Country>United 
States</Country><Phone>(401) 863-
1119</Phone><URL>www.compact.org</URL><LogoURL>http://servenet.org/images/serven
et_logo.gif</LogoURL>
    148 <Name>Communities in Schools</Name><Address1>277 S. Washington Street Suite 
210</Address1><City>Alexandria</City><StateOrProvince>Virginia</StateOrProvince>
<ZipOrPostalCode>22314</ZipOrPostalCode><Country>U
nited 
States</Country><Phone>7035198999</Phone><URL>www.cisnet.org</URL><LogoURL>http:
//servenet.org/images/servenet_logo.gif</LogoURL>
     28 <Name>CityCares</Name><Address1>1605 Peachtree St., Suite 100 
</Address1><City>Atlanta</City><StateOrProvince>Georgia</StateOrProvince><ZipOrP
ostalCode>30309</ZipOrPostalCode><Country>United 
States</Country><Phone>404-875-
7334</Phone><URL>www.citycares.org</URL><LogoURL>http://servenet.org/images/serv
enet_logo.gif</LogoURL>
    181 <Name>Girls Incorporated National Headquarters</Name><Address1>441 West Michigan Street 
</Address1><City>Indianapolis</City><StateOrProvince>Indiana</StateOrProvince><Z
ipOrPostalCode>46202</ZipOrPostalCode><Country>Uni
ted States</Country><Phone>317-634-7546, 
x31</Phone><URL>www.girlsinc.org</URL><LogoURL>http://servenet.org/images/serven
et_logo.gif</LogoURL>
    151 <Name>Campus Outreach Opportunity League</Name><Address1>37 Temple Place Suite 
401</Address1><City>Boston</City><StateOrProvince>Massachusetts</StateOrProvince
><ZipOrPostalCode>02111</ZipOrPostalCode><Country>
United 
States</Country><Phone>6176952665</Phone><URL>www.cool2serve.org/index2.htm</URL
><LogoURL>http://servenet.org/images/servenet_logo
.gif</LogoURL>
    901 <Name>The Corporation for National Service</Name><Address1>1201 New York Avenue, NW 
</Address1><City>Washington</City><StateOrProvince>District of 
Columbia</StateOrProvince><ZipOrPostalCode>20525</ZipOrPostalCode><Country>Unite
d 
States</Country><Phone>2026065000</Phone><URL>www.seniorcorps.org</URL><LogoURL>
http://servenet.org/images/servenet_logo.gif</Logo
URL>

Original comment by adam.sah on 13 May 2009 at 4:11

GoogleCodeExporter commented 9 years ago

Original comment by adam.sah on 13 May 2009 at 8:35

GoogleCodeExporter commented 9 years ago

Original comment by adam.sah on 16 May 2009 at 7:34

GoogleCodeExporter commented 9 years ago

Original comment by adam.sah on 26 May 2009 at 5:28

GoogleCodeExporter commented 9 years ago

Original comment by adam.sah on 26 May 2009 at 7:49

GoogleCodeExporter commented 9 years ago
fixed in r697

Original comment by adam.sah on 4 Jun 2009 at 1:10