Puppeteer: Automating Tasks With Headless Chrome

December 20, 2017
7 min. read

Puppeteer is a project from Chrome's Devtools team to provide a high-level way to automate running Chrome in Headless mode (Chrome running without a graphical user interface. Headless browsers provide automated control of a web page in an environment similar to regular Chrome, but executed via a command-line interface or using network communication). The idea behind headless browsers like PhantomJS, Headless Chrome or He adless Firefox is to automate tasks like testing and doing screenshots of the page visited. As we go through some of these examples we'll explore the Puppeteer API in some (but not all) details. For a deep look at the API check the [API docs](https://github.com/GoogleChrome/puppeteer/blob/master/docs/api.md). ## Capturing page screenshots The first way we'll use Puppeteer is to generate screenshots of a web page or app. We'll take advantage of Puppeteer's predefined device descriptions to ease the workload and generate a png screenshot of my personal blog. The code to do so look like this: ```javascript const puppeteer = require('puppeteer'); const devices = require('puppeteer/DeviceDescriptors'); (async () => { const browser = await puppeteer.launch(); const page = await browser.newPage(); await page.emulate(devices['iPad Pro']); await page.goto('https://rivendellweb.net',); await page.screenshot({ path: 'full.png'}); await browser.close(); })(); ``` From top to bottom, the script: 1.- Loads the required scripts. 2.- Sets up a variable to hold our [puppeteer.launch](https://github.com/GoogleChrome/puppeteer/blob/master/docs/api.md#puppeteerlaunchoptions) declaration. This declaration has an optional parameter of an object used to configure the Chromium instance 3.- Create sa new page object using [browser.newPage()](https://github.com/GoogleChrome/puppeteer/blob/master/docs/api.md#browsernewpage) 4.- Configures the browser to run our commands with [page.emulate](https://github.com/GoogleChrome/puppeteer/blob/master/docs/api.md#pageemulateoptions). You can replace the viewport object with one of the predefined values from `puppeteer/DeviceDescriptors`. The descriptors will pre-populate all the values for the viewport items. I normally use the raw viewport items and values when I need to create a custom viewport and the Device Descriptors otherwise. 5.- Tells Puppeteer where to go and when to consider the page loaded using [page.goto](https://github.com/GoogleChrome/puppeteer/blob/master/docs/api.md#pagegotourl-options). It returns a Promise which resolves to the main resource response. In case of multiple redirects, the navigation will resolve with the response of the last redirect. 6.- Configures the screenshot we want to take with page.screenshot. 7.- Runs `browser.close()` to close the connection. ### Capturing full-screen images There are times when I would like to see all content for the page, even if it goes beyond the default screen size for the device I've chosen to test with (in this case an iPad Pro in portrait mode). We can add a second parameter to `page.screenshot` to indicate this. `fullPage` is a boolean value that, when true, takes a screenshot of the full scrollable page. It defaults to false ```javascript const puppeteer = require('puppeteer'); const devices = require('puppeteer/DeviceDescriptors'); (async () => { const browser = await puppeteer.launch(); const page = await browser.newPage(); await page.emulate(devices['iPad Pro']); await page.goto('https://rivendellweb.net', { waitUntil: 'networkidle2' }); await page.screenshot({ path: 'full.png', fullPage: true}); await browser.close(); })(); ``` ### Disabling headless mode There are times when we need to see what the headless browser is doing to troubleshoot, or just because we're curious. Puppeteer provides two tools to accomplish this as part of the options for `puppeteer.launch`: - `headless` is a boolean that controls if the browser is launched in headless mode. Using false as the value will disable headless mode and let you see what the browser is doing - `slowMo` will slow the browser by the specified number of milliseconds. This may let you actually see what the browser is doing since the actual process may be too fast to catch The revised code looks like this: ```javascript const puppeteer = require('puppeteer'); const devices = require('puppeteer/DeviceDescriptors'); (async () => { const browser = await puppeteer.launch({ headless: false, slowMo: 250, }); const page = await browser.newPage(); await page.emulate(devices['iPad Pro']); await page.goto('https://rivendellweb.net'); await page.screenshot({ path: 'full.png'}); await browser.close(); })(); ``` ### Device emulation `DeviceDescriptors` contains information about a set of predefined device descriptions to make it easier to use Puppeteer without having to manually tweak the configuration. It provides the following preconfigured information: - userAgent string - viewport - width - height - deviceScaleFactor - isMobile - hasTouch - isLandscape For the list of supported devices check [DeviceDescriptors.js](https://github.com/GoogleChrome/puppeteer/blob/master/DeviceDescriptors.js) in the Puppeteer Github repository. ### When to consider the page loaded? Particularly when working with lazy-loaded resources, interacting with the page doesn't necessarily mean that we're done loading it. There may be videos that are still loading or images where intersection observers haven't triggered. It's important to be able to tell Puppeteer when we're done. `waitUntil` is an optional parameter for `page.goto` that, given an array of one or more event strings, considers navigation to be successful after all events have been fired. Events can be: - **load** - consider navigation to be finished when the load event is fired - **domcontentloaded** - consider navigation to be finished when the DOMContentLoaded event is fired - **networkidle0** - consider navigation to be finished when there are no network connections for at least 500 ms - **networkidle2** - consider navigation to be finished when there are no more than 2 network connections for at least 500 ms. ```javascript const puppeteer = require('puppeteer'); const devices = require('puppeteer/DeviceDescriptors'); (async () => { const browser = await puppeteer.launch(); const page = await browser.newPage(); await page.emulate(devices['iPad Pro']); await page.goto('https://rivendellweb.net', { waitUntil: 'networkidle2' }); await page.screenshot({ path: 'full.png'}); await browser.close(); ``` ## Changes to package.json In order to save myself from typing all the commands to generate screenshots and to make sure Jest works as intended (and will be described in the next section) I've added the following blocks to my `package.json` file. The first block specifies commands to run when using `npm test` and is a simpler way of running Jest in verbose mode. The other commands run the screenshot scripts using `npm run` and the name of the script. The second block, jest, configures Jest by disabling automock and configuring the test file names (all files that end with `_test.js`). ``` "scripts": { "test": "jest --verbose", "screenshot": "node screenshot/screenshot.js", "screenshot-full": "node screenshot/screenshot-full.js", "headfull": "node screenshot/screenshot-headfull.js" }, "jest": { "automock": false, "testRegex": "\\_test\\.js$" } ``` ## Page Testing Expanding on the article at [UI testing with Jest and Puppeteer: an introduction](https://www.valentinog.com/blog/ui-testing-jest-puppetteer/) we'll look at how to test a form and the UI of the page. The form is available in the repository for this article at [https://github.com/caraya/jest-puppeteer/blob/master/testing/form.html](https://github.com/caraya/jest-puppeteer/blob/master/testing/form.html) We'll use the following libraries: - [Jest](https://facebook.github.io/jest/): a testing framework by Facebook. Jest provides a platform for automated testing along with a basic assertion library (Expect) - [Puppeteer](https://github.com/GoogleChrome/puppeteer): a Node.js library for controlling headless Chrome - [Faker](https://www.npmjs.com/package/faker): a Node.js library for generating random data like names, phones and addresses In addition, we'll set up Babel, preset-env and Babel libraries related to Jest. The command to install the required Node modules is: ```bash npm i -D jest puppeteer faker \ babel-core babel-jest babel-preset-env ``` Installing the required Node packages may take a long time. This is because Puppeteer installs a local version of Chromium (the open source project Chrome is based on) and ties its functionality to the specific version it installs. You can force Puppeteer to use your locally installed version of Chrome or the Chromium open source browser but it's not guaranteed to work. Once the modules are installed, we can start working on our testing script. First, we import all our module dependencies. We're using ES6 syntax, that's why we imported Babel and babel-jest. ```javascript import faker from "faker"; import puppeteer from "puppeteer"; import devices from 'puppeteer/DeviceDescriptors'; ``` Next we setup variable and constants we'll use throughout the script. These variables are: - `APP` points to the URL for the page we want to test - `lead` is an array of randomly generated data created using Faker - `page` and `browser` are Puppeteer variables we'll use later ```javascript const APP = "https://caraya.github.io/jest-puppeteer/testing/form.html"; const lead = { name: faker.name.firstName(), email: faker.internet.email(), phone: faker.phone.phoneNumber(), message: faker.random.words() }; let page; let browser; ``` The next two functions are part of Jest. They will be executed before and after each test respectively. `beforeAll` sets up Puppeteer and works by launching it, starting the new page, emulate an iPad Pro and going to the page we want to test and waiting until all connections are finished. I've chosen to use an iPad Pro as my emulated testing device rather than use the options for `puppeteer.launch()` to generate custom dimensions for the browser. The actual testing device is not important for this test. It may be for yours. `afterAll` will close the browser connection. ```javascript beforeAll(async () => { browser = await puppeteer.launch(); page = await browser.newPage(); await page.emulate(devices['iPad Pro']) await page.goto(`${APP}`, { waitUntil: 'networkidle0' }); }); afterAll(() => { browser.close(); }); ``` The first test uses Puppeteer to navigate and fill out a form. `page.waitForSelector` waits for the selector to appear in the page. If at the moment of calling the method the selector already exists, the method will return immediately. `page.click` fetches an element with the selector, scrolls it into view if needed, and then uses page.mouse to click in the center of the element. If there's no element matching selector, the method throws an error. `page.type` sends a keydown, keypress/input, and keyup event for each character in the text. In this example, it will fill the field with the corresponding value from our lead array generated with Faker. ```javascript describe("Contact form", () => { test("lead can submit a contact request", async () => { await page.waitForSelector("form"); await page.click("input[name=name]"); await page.type("input[name=name]", lead.name); await page.click("input[name=email]"); await page.type("input[name=email]", lead.email); await page.click("input[name=tel]"); await page.type("input[name=tel]", lead.phone); await page.click("textarea[name=message]"); await page.type("textarea[name=message]", lead.message); await page.click("input[type=checkbox]"); // await page.click("button[type=submit]"); }, 16000); }); ``` The second test suite is more traditional and uses a combination of Puppeteer and Jest to perform assertion tests. Each test has a constant that sets the value we want to test and an expect-style test that test the condition against the value we want. The first test checks that the title of the page is correct. The second test checks that there is an element with class navbar in the page. `page.$$eval` is the Puppeteer equivalent to `querySelectorAll`. The final test checks that there are 6 elements with the field class. It uses `page.$$eval` to check for elements with class `field` and then tests that there are 6 of them. ```javascript describe("Testing the frontend", () => { test("assert that is correct", async () => { const title = await page.title(); expect(title).toBe("Demo form"); }); test("assert that a div named navbar exists", async () => { const navbar = await page.$$eval("navbar", el => (el ? true : false)); expect(navbar).toBe(true); }); test("assert that there are 6 fields", async () => { const fieldCount = await page.$$eval(".field", fields => fields.length); expect(fieldCount).toBe(6) }); // Insert more tests starting from here! }); ``` ## There is more If you look at the API docs for Puppeteer you'll see that there's plenty more you can do and more elaborate tests you can write. We could turn the testing section into a full [Test-Driven Development](https://www.wikiwand.com/en/Test-driven_development) environment by writing the tests first and the code to match it. Although this is a Chrome-only tool, I'm excited to see what else we can do with it. ## Links and Resources - [Puppeteer API docs](https://github.com/GoogleChrome/puppeteer/blob/master/docs/api.md) - [Puppeteer examples](https://github.com/GoogleChrome/puppeteer/tree/master/examples) - [Making your UI Tests Resilient To Change](https://blog.kentcdodds.com/making-your-ui-tests-resilient-to-change-d37a6ee37269) - [UI testing with Jest and Puppeteer: an introduction](https://www.valentinog.com/blog/ui-testing-jest-puppetteer/) - [Getting started with Jest](https://facebook.github.io/jest/docs/en/getting-started.html) <p class="edit-on-github-wrap"><a class="edit-on-github" href="https://github.com/caraya/personal-blog/edit/main/content/blog/puppeteer-automating-tasks-with-headless-chrome.md" target="_blank">Edit on Github</a></p> <div class="prev-next"> <a rel="prev" class="next" href="/blog/machine-learning-image-enhancement/">Machine Learning Image Enhancement</a> <a rel="prev" class="prev" href="/blog/globalize-content-caching-localization-assets-with-service-workers/">Globalize Content: Caching localization assets with service workers</a> </div> </article> </main> <footer> <div class="left-footer"> <section class="social"> <h4>Social Me</h4><section class="social-container"> <a class="github" href="https://github.com/caraya" target="_blank" rel="me" aria-label="GitHub" style="--color: #333"><i class="bi bi-github"></i></a><a class="codepen" href="https://codepen.io/caraya/" target="_blank" rel="me" aria-label="CodePen" style="--color: #333"><i class="bi bi-code"></i></a><a class="twitter" href="http://twitter.com/elrond25" target="_blank" rel="me" aria-label="Twitter" style="--color: #1DA1F2"><i class="bi bi-twitter"></i></a><a class="mastodon" href="https://hachyderm.io/@elrond25" target="_blank" rel="me" aria-label="Mastodon" style="--color: rgb(99 100 255)"><i class="bi bi-mastodon"></i></a> </section> </section> <section classs="latest"> <h4>Latest Posts</h4><ul> <li><a href="/blog/deep-dive-origin-trials/"> Deep Dive - Chrome Origin Trials </a></li> <li><a href="/blog/understanding-the-w3c-recommendation-process/"> Understanding the W3C Recommendation process </a></li> <li><a href="/blog/globalthis-in-javascript/"> globalThis in JavaScript </a></li> <li><a href="/blog/ai-apps-with-gemini3-langchain-and-langserve/"> AI Apps With Gemini 3, LangChain, and LangServe </a></li> <li><a href="/blog/the-trap-of-synthetic-testing-vs-reality/"> The Trap of Synthetic Testing vs. Reality </a></li> </ul> </section> </div> <div class="right-footer"> <h4>Search</h4> <div id="search"> <search-form></search-form> </div> <div class="search-results"> <search-results></search-results> </div> <div class="search-pagination"> <search-pagination></search-pagination> </div> <div class="search-status"> <search-status></search-status> </div> <div class="search-error"> <search-error></search-error> </div> <nav id="nav"> <h4>Links</h4> <div class="nav-container"> <ul class="nav-footer-menu"> <li class="nav-item"><a href="/./">Home</a></li> <li class="nav-item"><a href="/about/">About</a></li> <li class="nav-item"><a href="https://patterns.rivendellweb.net/" class=selected target="_blank">Patterns</a></li> <li class="nav-item"><a href="https://projects.rivendellweb.net" class=selected target="_blank">Projects</a></li> <li class="nav-item"><a href="https://layout-experiments.rivendellweb.net/" class=selected target="_blank">Layouts</a></li> <li class="nav-item"><a href="https://github.com/caraya/personal-blog" class=selected target="_blank">Blog Repo</a></li> </ul> </div> </nav> </div> </footer> </div>    <script type="module" src="/js/search-form.js"></script> <script async id="netlify-rum-container" src="/.netlify/scripts/rum" data-netlify-rum-site-id="26351df2-c0d4-4526-950c-3866501cb99d" data-netlify-deploy-branch="main" data-netlify-deploy-context="production" data-netlify-cwv-token="eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzaXRlX2lkIjoiMjYzNTFkZjItYzBkNC00NTI2LTk1MGMtMzg2NjUwMWNiOTlkIiwiYWNjb3VudF9pZCI6IjU5MTcyY2E5YTcwMGM0NmUyNzQyODNmMyIsImRlcGxveV9pZCI6IjY5YmFlNmIzNGM5NjMzMDAwOGQwOTUzOSIsImlzc3VlciI6Im5mc2VydmVyIn0.K7I8lCjQtcRmjPglir9BCwd3sBVdOk59GB3kZ6R_hGg"></script></body> </html>