Implement: `browsingContext.captureScreenshot`

thiagowfx commented 1 year ago

Tracking

[x] Spec subsubsubsection 7.2.3.1
[x] WPT tests: /webdriver/tests/bidi/browsing_context/capture_screenshot/*
[x] Add implementation to Command Processor
[x] Associated CDP command: Page.captureScreenshot -> need viewport
[x] Underneath mechanism: /session/{id}/screenshot (via WebDriver) --> this is merely an implementation detail
[x] Add sample gradient golden page for testing --> use it to compare (base64-wise) with the implementation -> canvas.base64
[ ] Implement subframe captureScreenshot
[ ] Implement scrollIntoView
[x] Closed frames
[x] https://github.com/GoogleChromeLabs/chromium-bidi/pull/580
[x] Height is slightly different in WPT tests. Maybe it's related to the Chromium bar (automated software) or display differences: https://github.com/GoogleChromeLabs/chromium-bidi/issues/547
[ ] Finish milestone: https://github.com/GoogleChromeLabs/chromium-bidi/milestone/4
- [x] ~~iframes: set captureBeyondViewport to true~~
- [x] ~~Tab focus: https://github.com/GoogleChromeLabs/chromium-bidi/pull/796~~
- [x] Tab focus: Focus originally focused tab after taking the screenshot, or change the spec: https://github.com/w3c/webdriver-bidi/issues/171 / https://github.com/w3c/webdriver-bidi/issues/440
- [ ] Throw error on unsupported edge cases, as per the spec: ~~https://github.com/GoogleChromeLabs/chromium-bidi/pull/797~~ / https://github.com/w3c/webdriver-bidi/issues/441
- [ ] Re-enable headful screenshot tests, if at all possible: https://github.com/GoogleChromeLabs/chromium-bidi/pull/813, https://github.com/GoogleChromeLabs/chromium-bidi/pull/823 and https://github.com/web-platform-tests/wpt/pull/40382 https://github.com/GoogleChromeLabs/chromium-bidi/blob/ebe71327d460e763c5f777afeeea1820963320ca/wpt-metadata/mapper/headful/webdriver/tests/bidi/browsing_context/capture_screenshot/frame.py.ini#L2 and https://github.com/GoogleChromeLabs/chromium-bidi/blob/ebe71327d460e763c5f777afeeea1820963320ca/wpt-metadata/mapper/headful/webdriver/tests/bidi/browsing_context/capture_screenshot/capture_screenshot.py.ini#L2
- [ ] Add e2e test for #851 -> setViewport to X , Y go to page that the height is bigger then Y, screenshot should match X , Y
- [ ] https://bugs.chromium.org/p/chromium/issues/detail?id=1277272: Incorrect trace screenshot dimensions with --enable-automation
- [ ] https://bugs.chromium.org/p/chromium/issues/detail?id=32667: Support background screenshots

thiagowfx commented 1 year ago

sadym-chromium commented 1 year ago

WPT headful test is failing. I temporary disabled it in https://github.com/GoogleChromeLabs/chromium-bidi/pull/548

sadym-chromium commented 1 year ago

DevTools has functionality to capture node screenshot. What it does is: Call callFunctionOn(node) with script:

() => {
   const e = this.getBoundingClientRect(),
   t = this.ownerDocument.documentElement.getBoundingClientRect();
   return JSON.stringify({
      x: e.left - t.left,
      y: e.top - t.top,
      width: e.width,
      height: e.height,
      scale: 1.0
  });
}

And call Page.captureScreenshot with the following params:

{
  "format": "png",
  "quality": 100,
  "fromSurface": true,
  "captureBeyondViewport": true,
  "clip": { 
      RECEIVED_VALUES
  }
}

sadym-chromium commented 1 year ago

The comment above could help implementing the nested frame screenshot.

thiagowfx commented 1 year ago

References:

thiagowfx commented 1 year ago

@sadym-chromium:

const e = this.getBoundingClientRect() seems to be

const metrics = await this.#cdpTarget.cdpClient.sendCommand(
  'Page.getLayoutMetrics'
).cssContentSize;

Do you know how to retrieve "t"?

thiagowfx commented 1 year ago

Debugging:

async captureScreenshot(): Promise<BrowsingContext.CaptureScreenshotResult> {
    // XXX: Either make this a proposal in the BiDi spec, or focus the
    // original tab right after the screenshot is taken.
    // The screenshot command gets blocked until we focus the active tab.
    await this.#cdpTarget.cdpClient.sendCommand('Page.bringToFront'); // window.focus() also works

    const docRect = await this.#cdpTarget.cdpClient.sendCommand(
      'Runtime.callFunctionOn',
      {
        functionDeclaration: `() => {
        const docRect = window.documentElement.getBoundingClientRect();
        return JSON.stringify({
          x: docRect.left,
          y: docRect.top,
        });
      }`,
        executionContextId: this.#defaultRealm.executionContextId,
      }
    );
    const {result: docRectResult} = docRect;
    console.log(docRectResult);

    const metrics = await this.#cdpTarget.cdpClient.sendCommand(
      'Page.getLayoutMetrics'
    );
    // or maybe cssLayoutViewport
    const {cssContentSize: viewport} = metrics;

    debugger;

    const [result] = await Promise.all([
      this.#cdpTarget.cdpClient.sendCommand('Page.captureScreenshot', {
        format: 'png', // XXX: add more formats: jpeg,webp, then add quality for jpeg
        captureBeyondViewport: true,
        clip: {...viewport, scale: 1.0},
      }),
    ]);
    return {
      result: {
        data: result.data,
      },
    };
  }

thiagowfx commented 1 year ago

More debugging:

async captureScreenshot(): Promise<BrowsingContext.CaptureScreenshotResult> {
    // XXX: Either make this a proposal in the BiDi spec, or focus the
    // original tab right after the screenshot is taken.
    // The screenshot command gets blocked until we focus the active tab.
    await this.#cdpTarget.cdpClient.sendCommand('Page.bringToFront'); // window.focus() also works

    const docRect = await this.#cdpTarget.cdpClient.sendCommand(
      'Runtime.callFunctionOn',
      {
        functionDeclaration: `() => { return JSON.stringify(globalThis.document.documentElement.getBoundingClientRect()); }`,
        executionContextId: this.#defaultRealm.executionContextId,
      }
    );
    console.log(docRect);

    const metrics = await this.#cdpTarget.cdpClient.sendCommand(
      'Page.getLayoutMetrics'
    );
    // or maybe cssLayoutViewport
    const {cssContentSize: viewport} = metrics;

    debugger;

    const [result] = await Promise.all([
      this.#cdpTarget.cdpClient.sendCommand('Page.captureScreenshot', {
        format: 'png', // XXX: add more formats: jpeg,webp, then add quality for jpeg
        captureBeyondViewport: true,
        clip: {...viewport, scale: 1.0},
      }),
    ]);
    return {
      result: {
        data: result.data,
      },
    };
  }

thiagowfx commented 7 months ago

Implement scrollIntoView

This one can be closed, as it was removed from the BiDi spec.

GoogleChromeLabs / chromium-bidi

Implement: `browsingContext.captureScreenshot` #514

Tracking