FriendsOfPHP / Goutte

Goutte, a simple PHP Web Scraper
MIT License
9.26k stars 1.01k forks source link

Scraping AliExpress Images #352

Closed JakeHenshall closed 6 years ago

JakeHenshall commented 6 years ago

Hi,

I'm struggling to scrape a full page of Aliexpress image. It gets all of the alt tags, and the first 8 images.

<?php

require 'vendor/autoload.php';

use Goutte\Client;

$url = "https://www.aliexpress.com/af/tie.html?SearchText=tie";

$client = new Client();

$crawler = $client->request('GET', $url);

$output = $crawler->filter('#hs-below-list-items li div div.img.img-border div a img')->each(function ($node) {

    echo '<img src="' . $node->attr('src') . '" alt="' . $node->attr('alt') . '">';

});

var_dump($output);

Is this something todo with AliExpress Lazy Loading in the images possible?

Any Help would be greatly appreciated.

Thanks, Jake.