Closed StatsAI closed 2 weeks ago
Fixed by #3218.
Note that the text extracted by the partitioner for this page is modest (about 25 elements) because most of the content is behind a paywall and generated by JavaScript.
But the "preview" content that is actually present in the HTML is correctly partitioned and the <noscript>
tag that was previously rendered (saying "Please enable JS ...") is no longer present.
Describe the bug Only element returned from partition is (unstructured.documents.html.HTMLTitle, 'Please enable JS and disable any ad blocker')
To Reproduce
Expected behavior Partition results (Title, Narrative Text, etc) should be returned
Environment Info Google Colab