exorde-labs / exorde-client

Exorde participation module (Testnet 2022-2023) - CLI
GNU General Public License v3.0
1.11k stars 86 forks source link

Bounty #1 : Linkedin scraper #28

Open putxe opened 1 year ago

putxe commented 1 year ago

🏆 Rewards: 2000 EXD 🏆

💡 Project Overview

We offer an exciting bounty for developers who can create a Python scraper that fetches relevant posts and comments from LinkedIn.com. The scraper should be able to log in to a LinkedIn account, generate a search URL based on provided keywords, and extract posts and comments posted within the last 120 seconds.

For the complete, detailed technical specifications and information about constraints, rules, interface, parameters, and wanted output format, please refer to our CONTRIBUTING.md file.

🔍 Task Description

Your role, as a Python developer, is to create a powerful scraper that accomplishes the following tasks:

✨ Integration and Authentication

As a developer, it's crucial to integrate the scraper to maximize efficiency and user experience.

🔗 Submission

Let's make the web scraping experience on LinkedIn seamless and productive! 🚀🌟

To participate in this bounty, please create a Pull Request and link it to this issue. We look forward to reviewing your submission and wish you happy coding!

whizsid commented 12 months ago

I am attempting to this.

whizsid commented 11 months ago

@putxe @MathiasExorde Linkedin is asking for a confirmation code from an email after logged using an automated browser. So we have to pass IMAP configurations of the passed e-mail box. Do you have any concerns regarding this?

MathiasExorde commented 11 months ago

Our usual approach has been to inform users about the required action through logs and future push notifications using 'ntfy.sh'. It is the plan for Twitter & other scrapers. Then, we leave it to the users to take the necessary action, which may vary in nature (e.g., email reconfirmation like Twitter or completing a Captcha).

However, for the purposes of this experiment, we are considering the possibility of implementing IMAP and automating the process of reading emails to see if it is feasible. This would allow us to determine whether we should automate email processing or continue with our current method of notifying users about the required action. Feel free to proceed as you see fit! If we notice that this introduces a significant amount of complexity or additional work on your end, we will, of course, take that into consideration