raznem / parsera

Lightweight library for scraping web-sites with LLMs
https://parsera.org
GNU General Public License v2.0
859 stars 55 forks source link

Is docker version in roadmap? #13

Closed winglight closed 4 weeks ago

winglight commented 2 months ago

I tried to install in Ubuntu but failed to playwright install and failed to "pip install parsera" in WSL2.

If there's a docker version, It'll be appreciated.

raznem commented 2 months ago

Hi, @winglight what are you looking for in such a docker? A Python environment with installed Parsera?

winglight commented 2 months ago

Hi, @winglight what are you looking for in such a docker? A Python environment with installed Parsera?

Like I said, I can't intall parsera as the instructions of readme in my pc.

My env: Ubuntu(host and WSL2) + python3.9/3.11.

raznem commented 2 months ago

Since you are facing issues with playwright installation you can try one of Playwright's dockers. I am using mcr.microsoft.com/playwright/python:v1.46.0-jammy. Playwright is installed there, so I only need to install the library.

raznem commented 1 month ago

@winglight today PR with docker from @danyathecoder was merged, now you can run Parsera as a CLI tool or docker container.

winglight commented 1 month ago

@winglight today PR with docker from @danyathecoder was merged, now you can run Parsera as a CLI tool or docker container.

Great job. I'd like to give it a try in a few days.

Toowiredd commented 1 month ago

I had a successful scrape in powershell today mate... still havent had the website work for me though :(

danyathecoder commented 1 month ago

@Toowiredd could you provide more details about your issue?

Toowiredd commented 1 month ago

Emailed you mate