PokemonGoers / PokeData

In this project you will scrape as much data as you can get about the *actual* sightings of Pokemons. As it turns out, players all around the world started reporting sightings of Pokemons and are logging them into a central repository (i.e. a database). We want to get this data so we can train our machine learning models. You will of course need to come up with other data sources not only for sightings but also for other relevant details that can be used later on as features for our machine learning algorithm (see Project B). Additional features could be air temperature during the given timestamp of sighting, location close to water, buildings or parks. Consult with Pokemon Go expert if you have such around you and come up with as many features as possible that describe a place, time and name of a sighted Pokemon. Another feature that you will implement is a twitter listener: You will use the twitter streaming API (https://dev.twitter.com/streaming/public) to listen on a specific topic (for example, the #foundPokemon hashtag). When a new tweet with that hashtag is written, an event will be fired in your application checking the details of the tweet, e.g. location, user, time stamp. Additionally, you will try to parse formatted text from the tweets to construct a new “seen” record that consequently will be added to the database. Some of the attributes of the record will be the Pokemon's name, location and the time stamp. Additional data sources (here is one: https://pkmngowiki.com/wiki/Pok%C3%A9mon) will also need to be integrated to give us more information about Pokemons e.g. what they are, what’s their relationship, what they can transform into, which attacks they can perform etc.
Apache License 2.0
9 stars 6 forks source link

Additional data: More images / image formats #160

Closed AlexanderLill closed 8 years ago

AlexanderLill commented 8 years ago

Hello everyone,

it would be very helpful to receive some more different image formats per pokemon for our website / app. We are talking mainly about different formats, and not necessarily different content of the images :)

Particularly the following would be helpful:

If there are any questions feel free to ask, the ones without animation are the most important right now I think ( @johartl )

jonas-he commented 8 years ago

what about these for the big ones with transparency: http://nintendo.wikia.com/wiki/Category:First_generation_Pok%C3%A9mon_images We have to make sure they are under a free license right? Dont know about these ones ...

AlexanderLill commented 8 years ago

The Service includes access to audio and/or visual content (including videos, photographs, music, graphics, logos, etc.), which has been licensed for use and exhibition herein from third parties (“Media Content”). Such Media Content may be accessible via a player embedded on the Site, which links to and plays content hosted on other sites or services (“Media Player”). Without limiting the application of any other terms of this Agreement, you also agree that you will not do, or attempt to do, any of the following:

attempt to download, redistribute, alter, delete, or insert advertising or promotional materials within any Media Content or in connection with any Media Player; use any Media Content or Media Player for any purpose other than non-commercial, personal consumption and viewing such Media Content; attempt to circumvent, disable or otherwise interfere with security-related features of the Service or any Media Player, including any features that prevent or restrict use or copying of any Media Content; attempt to modify, enhance, edit, translate, adapt, alter, reverse engineer, decompile, disassemble, or create derivative works based upon any Media Player or Media Content; attempt to license, transfer, convey or assign to any third party any rights to use or exhibit any Media Player or Media Content , including to market, promote, display, re-distribute or re-transmit any Media Player or Media Content; reformat, optimize or customize any Media Player or Media Content for display, distribution or transmission via any platforms, protocols or delivery mechanisms other than as expressly permitted by and within the Services;

@jonas-he maybe we find another source, or we ask them in a mail if it's okay for us to use them non-commercially? ( @gyachdav )

jonas-he commented 8 years ago

@AlexanderLill for these images it says they have the copyright of Nintendo but they can be used under "fair use" (e.g. http://nintendo.wikia.com/wiki/File:Tauros.png). As wikia.com is not related to Nintendo and use these images and apparently dont have problems with Nintendo I think we might use them as well. However I don't want to have the last word on this @gyachdav

jonas-he commented 8 years ago

another source, also fair use: https://github.com/jnovack/pokemon-svg

jonas-he commented 8 years ago

@Lugitan maybe you want to join in on the conversation and give a status update if you have one.

samitsv commented 8 years ago

@jonas-he I had a discussion with @Lugitan as well in the morning so using it should be fine using the images, and giving nintendo and the source credits. I will be using the existing gif and png images from http://veekun.com/dex/downloads, svgs from https://github.com/jnovack/pokemon-svg/tree/master/svg, this should cover all the requred images

samitsv commented 8 years ago

@sacdallago now i want to add svg,png and gif images to my pokemonicons collection in mlab and i am not able to because of quota, what should i do? required for supporting new image requirements

samitsv commented 8 years ago
samitsv commented 8 years ago

but there might be some missing images on request(due to our mlab database limitation), but will be udpated

jonas-he commented 8 years ago

@samitsv yeah there are some missing (there are 106 out of 151). I would delete some of the 2,5 million sightings and maybe you can insert it again.

sacdallago commented 8 years ago

@samitsv the RostLab mongo machine is accessible only from the RostLab server wich will hold the app, at some stage. Meaning: I don't have a fix to the mlab quota problem, not for development, at least. For production, yes.