mmshress / INLP-WS23

Group project for the INLP course
0 stars 1 forks source link

Create a Dataset #12

Closed KushalGaywala closed 11 months ago

KushalGaywala commented 11 months ago

Use the CELEX NUMBERS that @KushalGaywala downloaded and @mmshress processed #3

  1. Fetch all the articles then create a dataset #2
  2. Just fetch the whole HTML for the fetching
  3. Find a way to Generalize the HTML tags used
  4. Fetch the important parts from the articles