Closed jaswanth-13 closed 3 days ago
i want to give html code as a string , but when i am trying it is giving me error
import requests from docling.document_converter import DocumentConverter doc_converter = DocumentConverter() html_content = request.get('https://en.wikipedia.org/wiki/Cricket').text docling_doc = doc_converter.convert(html_content)
this is giving error
OSError: [Errno 36] File name too long: '<!DOCTYPE html>\n<html class="client-nojs vector-feature-language-in-header-enabled vector-feature-language-in-main-page-header-disabled vector-feature-sticky-header-disabled vector-feature-page-tools-pinned-disabled vector-feature-toc-pinned-clientpref-1 vector-feature-main-menu-pinned-disabled vector-feature-limited-width-clientpref-1 vector-feature-limited-width-content-enabled vector-feature-custom-font-size-clientpref-1 vector-feature-appearance-pinned-clientpref-1 vector-feature-night-mode-enabled skin-theme-clientpref-day vector-toc-available" lang="en" dir="ltr">\n<head>\n<meta charset="UTF-8">\n<title>India - Wikipedia</title>\n<script> ..........
You have two options:
convert()
Question
i want to give html code as a string , but when i am trying it is giving me error
this is giving error