Clueless-Community / scrape-up

A web-scraping-based python package that enables you to scrape data from various platforms like GitHub, Twitter, Instagram, or any useful website.
https://pypi.org/project/scrape-up/
MIT License
247 stars 250 forks source link

Feat: Scrapping Pubmed Data using BioPython and Beautiful Soup #937

Open VarsaGupta opened 1 month ago

VarsaGupta commented 1 month ago

Describe the feature

Problem: PubMed offers crucial insights in life sciences, but extracting data conventionally can be slow, particularly with extensive research papers using the library like Beautiful Soup and Selenium. Solution: BioPython emerges as a game-changer in this scenario, offering a faster and more efficient solution. With specialized functionalities tailored for biological data analysis, BioPython streamlines the process of scraping PubMed data, enabling researchers to access valuable insights in few seconds.

My approach: I will provide the python script for both Beautiful Soup and BioPython for scrapping extensive research papers of Pubmed, showcasing the efficiency of BioPython.

Add ScreenShots

![Uploading Screenshot 2024-05-03 205138.png…]()

Record

kashishrajputt commented 1 month ago

i would like to work on this issue. please assign it to me.

nikhil25803 commented 1 month ago

Hey @VarsaGupta, the reason why you use Beautiful Soup is that it is lightweight. Packages like BioPython are much heavier. If you want to give it a try without using BioPython you can continue with the issue else can close this.

VarsaGupta commented 1 month ago

@nikhil25803 , okk i will give a try using Beautiful Soup. Could you please assign it a label ?

VarsaGupta commented 1 month ago

Hey @nikhil25803, can you please look into my pull request.