vivelohoy / vivelohoy-3.0

The next generation site for Hoy Chicago.
http://www.vivelohoy.com/
5 stars 4 forks source link

Audit Image metadata from wire services #112

Closed luciovilla closed 10 years ago

luciovilla commented 10 years ago

Chicago Tribune

Metadata it comes with How WordPress uses metadata
Creator: Heather Charles
Creator’s Job Title: Chicago Tribune Photo by
City: Chicago
State/Province: IL - Illinois
Country: USA - United States
Description: Drinker Biddle & Reath LLP summer associate Iman Boundaoui at work in Chicago, Tuesday, July 22, 2014. Boundaoui is fasting for Ramadan during her work day she normally has coffee, tea and water while she is at work. (Heather Charles/The Chicago Tribune) B583882175Z.1 ...OUTSIDE TRIBUNE CO.- NO MAGS, NO SALES, NO INTERNET, NO TV, NEW YORK TIMES OUT, CHICAGO OUT, NO DIGITAL MANIPULATION. Description
Description Writer: B583882175Z.1
Title: CT ct-Blue-sky-Ramadan-July204.jpg Image Title
Job Identifier: CHI1407221320227042
Instructions: ....OUTSIDE TRIBUNE CO.- NO MAGS, NO SALES, NO INTERNET, NO TV, NEW YORK TIMES OUT, CHICAGO OUT, NO DIGITAL MANIPULATION…
Credit Line: Chicago Tribune
Source: Chicago Tribune

Also comes with camera data: camera, lens, focal length, exposure

Getty

Metadata it comes with How WordPress uses metadata
Creator: Andrew H. Walker
Creator’s Job Title: Staff
City: Miami Beach
State/Province: USA - United States
Headline: Around Mercedes-Benz Fashion Week Swim 2015 - Day 3 Image Title
Description: ORG XMIT: 492896431 MIAMI BEACH, FL - JULY 19: A drone camera flies during Mercedes-Benz Fashion Week Swim 2015 at The Raleigh on July 19, 2014 in Miami Beach, Florida. (Photo by Andrew H. Walker/Getty Images for Mercedes-Benz Fashion Week) Description
Description writer: ed
Title: GET 452409646
Job Identifier: CHI1407200025274764
Credit Line: (Credit too long, see caption)
Source: Getty Images North America
Copyright Notice: 2014 Getty Images

Also comes with camera data: camera, lens, focal length, exposure

Reuters

Metadata it comes with How WordPress uses metadata
Creator: Jorge Cabrera
City: Tegucigalpa
Country: HND - Honduras
Headline: Victoria Cordova and her daughter Genesis Zepeda, both recently deported from the U.S., hold hands during an interview with Reuters at their home in Tegucigalpa Image Title
Description: ORG XMIT: TBR06 Victoria Cordova and her daughter Genesis Zepeda, both recently deported from the U.S., hold hands during an interview with Reuters at their home at the impoverished 21 de Marzo neighbourhood in Tegucigalpa July 15, 2014. When 9-year-old Genesis stepped off a plane in Honduras after being deported from the United States, she was excited at the thought of seeing her cousins. For her mother, Victoria Cordova, the homecoming was terrifying: she fears being killed if she does not repay money she owes the wife of a local gang leader. Cordova had used the money to pay a smuggler to get her and Genesis to the United States. But after a grueling 2,500 km (1,600 mile) overland trek, the pair were caught entering Texas in June, sent to a detention center and then flown home this week as part of a U.S. effort to speed up the expulsion of thousands of illegal migrants, many of them children. Picture taken July 15, 2014. To match Insight story USA-IMMIGRATION/DEPORT REUTERS/Jorge Cabrera (HONDURAS - Tags: SOCIETY IMMIGRATION) Description
Description Writer: TB/JK
Title: REU USA-IMMIGRATION/DEPORT
Job Identifier: CHI1407181450352881
Credit Line: Reuters
Source: X03200

The images sampled did not come with exif data for camera/lens

nrrb commented 10 years ago

image

nrrb commented 10 years ago

Audit of metadata broken down by XMP, EXIF, and IPTC on 12 sample images from Getty, Reuters, Tribune, and Hoy (Roger Morales) in this Google spreadsheet: https://docs.google.com/spreadsheets/d/1l5hsxJLPg6aTkd_UngJBGaxeTo-Mis3v0x5RdtIscpg/edit?usp=sharing

Code I used to generate this: https://gist.github.com/tothebeat/9212e4e95485a2d533ba

I used the utility exiftool to extract this: http://www.sno.phy.queensu.ca/~phil/exiftool/

thefuturewasnow commented 10 years ago

Looks like we're not going to be able to leverage metadata for wire photos, which is ok. unfortunate, but ok as we do have other fields that provide plenty of context.

nrrb commented 10 years ago

Some discussion on this metadata stripping:

http://www.naturalexposures.com/reuters-strips-all-metadata-from-your-photos/

This one from 2007 says that Reuters stripped EXIF metadata because one of their customers complained that the presence of the EXIF metadata interfered with their workflow: http://www.controlledvocabulary.com/imagedatabases/phmdc_2007a.html

Would still be worth talking to them to ask if we can get it some other (free) way.