kaihendry / greptweet

Sane twitter backup and search
https://greptweet.com/
Other
76 stars 10 forks source link

Some URLs not in XML #16

Closed JamieKitson closed 11 years ago

JamieKitson commented 12 years ago

For some reason some URLs are not shown in Twitter's xml, for example:

http://api.twitter.com/1/statuses/user_timeline.xml?screen_name=jamiekitson&count=1&include_rts=1&include_entities=1&max_id=99571062211231744

The only way to catch these would be to work out some way of catching all URLs from statuses. I think this would be more tricky than it sounds, however, if we were to do that then at least we could forget the XML complications of lookiing for URLs in retweets and pictures/media.

JamieKitson commented 12 years ago

Virtually all of mine seem to come from the Mongol Rally SMS service.

JamieKitson commented 12 years ago

Although there's this one too:

http://api.twitter.com/1/statuses/user_timeline.xml?screen_name=jamiekitson&count=1&include_rts=1&include_entities=1&max_id=10232775241

I guess this was maybe before Twitter started shortening links themselves...?

kaihendry commented 11 years ago

1.1 uses JSON and I think we managed to get all URLs out of XML in the end.