torch2424 / link-drops

A collection of your links that you find across the internet, made with the help of @julianpoy
http://linkdrops.com/#/
Apache License 2.0
2 stars 1 forks source link

Get Titles Ourselves #35

Closed torch2424 closed 8 years ago

torch2424 commented 8 years ago

Yes scraping walks a thin line. But it will save us alot of money from embedly!

Currently No Embed is our main title grabber, with embedly being the backup. we should be the backup just incase! and to keep costs low!

torch2424 commented 8 years ago

We may also look into finding other providers to make embedly a third option!

torch2424 commented 8 years ago

This is a good alternative for returning favicons

http://stackoverflow.com/questions/9856544/google-api-url-to-site-title

Or Julain suggested just going to the page before any slahes and like Google.com/favicon.gif and simply use that

torch2424 commented 8 years ago

Great for grabbing a pages title:

http://decenturl.com/tools

julianpoy commented 8 years ago

if you look at the reply on the link you posted (http://stackoverflow.com/questions/9856544/google-api-url-to-site-title), you will see that the very first reply (not answer) actually mentions my solution.

Keep in mind that you can grab the favicon yourself by accessing it directly at www.whatevertheURLis.com/favicon.ico. I think the Google favicon tool was meant more for conversion. It outputs a png which can be used in img tags. – ShiggityShiggityShwa

Seeing as we don't need conversion, there should be no reason not to grab it ourselves.

julianpoy commented 8 years ago

Ill handle this entire issue.

torch2424 commented 8 years ago

@julianpoy Okay cool! you can handle the favicon. And I can handle the title? OR you wanna do all dat

julianpoy commented 8 years ago

I will handle all of it. I wrote a service to do the titles manually. Try it out! http://dev.kondeo.com/api/page-scraper.php?q=

torch2424 commented 8 years ago

DUDE I FUKCING LOVE YOU!!!! :D