smhenderson89 / FloridaMan

A Python web scrapper for a database of "Florida Man" new stories
1 stars 0 forks source link
jinja python scraper sqlite


FloridaMan - A Florida News Story Google Search Web Scarper

https://floridaman.onrender.com/today
Table of Contents
  1. Project Demo
  2. About The Project
  3. Getting Started
  4. Usage
  5. Roadmap
  6. Contributing
  7. Acknowledgments

Project Demo

890F1CC5-6E02-46BF-AA7D-8FCB9797CD1C

Project Inspiration

This website is my Final Project for CS50 Edx Course 2020. My goal was to create a website that would show a user their "Florida Man Horoscope", which I had learned about years back. A Florida Man Horoscope is created by typing in your date of birth into Google + "Florida Man" and the list of headlines that return are your horoscope. (Here's an example.)

Origin of the Meme: Know Your Meme

Wikipedia: Link

To accomplish this I built a web-scraper that would search for 'Florida Man' headlines and save the results from every day of the year (including leap year). I put those results into a database and any user can query them while searching the website.

(back to top)

Built With

(back to top)

Getting Started

To get a local copy up and running follow these simple example steps.

Prerequisites

Installation

  1. Clone the repo
    git clone gh repo clone smhenderson89/FloridaMan 
  2. Setup virtual environment:
    . venv/bin/activate
  3. Install all pip package requirements:

    Windows

    pip install -r requirements.txt

    Mac

    pip3 install -r requirements.txt
  4. Setup Environmental varialbe:
    export FLASK_APP=app.py
  5. Launch Website
    python3 app.py
  6. Go to location host location, should be running at localhost:5000

(back to top)

Roadmap

I'm pretty happy with how the site turned out. There are some sretch goal ideas I have (making a twitter bot to post a new news story everyday, word cloud), but for now it was a great learning experience.

See the open issues for a full list of proposed features (and known issues).

(back to top)

Contact

Scott Henderson Github: https://github.com/smhenderson89

(back to top)

Acknowledgments

(back to top)