Open yehonadav opened 5 years ago
my branch is ready for review
but don't delete it after review please
i need a python program that uses the requests module to crawl html data
my program asks for words, and url addresses and counts word occurence for each url my file structure:
git_learn
|
yehonadav
|
crawler
|-README.md # how to file
|-requirements.txt # external dependencies file
|-__init__.py # main program function file
|-__main__.py # the file running the main program
|-menu.py # file holding a string parameter of the start menu of the program
|-count_url_words.py # function with input: urls: dict, words: list, updates each url with words and number of occurences
|-get_urls.py # function to ask user url addresses and return a dictionary
|-get_words.py # function to ask words and return a list of words
|-show_counters.py # function to show each word occurences for each url
https://github.com/yehonadav/learn_git/blob/yehonadav/images/crawler.JPG
creating 3 programs