bhargavnova / python-helper-modules

MIT License
7 stars 24 forks source link

Extract JSON Data from HTML `<script>` Tags #35

Closed bhargavnova closed 9 months ago

bhargavnova commented 9 months ago

Objective

Develop a Python script or module that extracts JSON (JavaScript Object Notation) data embedded within HTML <script> tags. This tool will provide a convenient way to parse and retrieve JSON data from web pages for further processing or analysis.

Features

Example Usage

import json_extractor

# Extract JSON data from an HTML file
json_data = json_extractor.extract_from_html('sample.html')

# Extract JSON data from a URL
json_data = json_extractor.extract_from_url('https://example.com')

Difficulty: Intermediate/Advanced

Tags: Python, JSON, HTML, Web Scraping, Data Extraction

Additional Information

Contribution Guidelines

The updated guidelines can be found here.

Note:

Ninnjah commented 9 months ago

i'll take this

bhargavnova commented 9 months ago

Sure! @Ninnjah, Thanks for picking this up. Just keep in my the Guidelines.

Happy Coding!