Web Scraper Web Scraper + Google Sheets Google Spreadsheets

 =

KPIBees Google Add-on.

Web Scraping to Google Sheets

We live in a world that’s full data driven. Every decision that we make should rely on data which is available at our finger tips. Almost always the data is available within the countable infinite of web pages out there! Even more so, our data should be well organized and easily accessible! It’s not enough to simply Web Scrape a website, you will also need to get that data in a place where you can easily study and manipulate it – you got it, we’re talking about a Spreadsheet!

KPIBees is different than the other Web Scraper out there, as it delivers the data exactly where you need it – in a Spreadsheet! KPIBees is a Google Sheet Add-on that makes it easier to pull data from various tool/data formats/websites to Google Sheets. You don’t need a chrome extension to web scrape, or an expensive tool like UIPath, a simple spreadsheet add-on that gets the data where you want it will do just fine!

The tool can help you periodically web scrape certain values/texts from a web page or even assist in web scrapping the data that you need for your project.

How to install our free Web Scraper within Google Sheets

One easy way to import JSON to Google Sheets is with the Google Sheets Add-on KPIBees. With just a few clicks, JSONs can be imported into Google Sheets. 

STEP 1

Open KPIBees Gsuite Marketplace listing and install the Google Sheets Add-on.

STEP 2

You will need to provide standard add-on access to KPIBees – don’t worry, we are 100% GDPR compliant. We don’t access/use/resell any of the data.

STEP 3

Open any spreadsheet and go the Add-ons toolbar. Find KPIBees add-on and click on Launch.

Google Sheet Add-on location toolbar.

STEP 4

Click on the Web Scraper Google Sheets connector to create your first query!

Web Scraper connector Google Sheets

STEP 5

Let’s say we want to webscrape the view count of Ed Sheeran’s youtube video.

We need the get the video link and also the XPath that identifies the view count html element.

This is how you can easily retrieve the XPath using Chrome – also check the video for this web scraping example.

Youtube video web scraping

STEP 6

Add the url and the XPath. In this example, we are using the “Get Text Content” retrieval strategy which matches our desired use case.

Web scraper Google Sheets example

STEP 7

Click on create query to pull the data to the selected spreadsheet cell! You can also automate web scrapping by adding triggered refreshing to your spreadsheet.

Automatically web scrape website text

How to Web Scrape table from website

Let’s say you want to web scrape an html table from a website and bring the contents of that table to the spreadsheet. For example, we want to bring the first table in order of occurrence from the Oldest People wikipedia page.

Oldest people wikipedia page

Select the web scraper connector, and the website url, and set the table index that should should be parsed to google sheets. In the Advanced section you can set a lot of others details, such as login parameters, how much time to wait until the page is loaded before beginning to web scrape the content etc. Then click on “Create Query”.

Web scrape table from website

After the query is created, the content is pulled to the selected spreadsheet cell. If you to select another table from page, simply select the order in which it occurs in “Which table to retrieve” and run the query again!

Web scrape table to google sheets

Google Search Console API 

In terms of Web Scraping a website, there are several strategies to pull the data that KPIBees offers:

  • Get Text Content: Using this strategy, you can very accurately web scrape a text value from a web page. You just need to right click an element, choose inspect, identify the html element that contains your content, and copy the XPath values. (check the above video for how to do this)
  • Get Table from page: By choosing this content strategy, you can web scrape a table from a web page. You just need to specify the url and which html table to web scrape in the order of occurrence.
  • Get auto-detected relevant page content: Ok, so this is where things get a little complicated – for us that is! KPIBees uses an AI algorithm to determine the most important parts of the page which should be web scraped. The web scraping algorithm attempts to figure out the most important parts that need to be scraped and returns them to the spreadsheet.
    For example, if there’s a table that is not implemented with regular tags, it will identify it. Of course, the web is vast and there are plenty of anomalies that can be encountered, so this method does not always produce the best result. In case it the output result is not satisfying, you should try to narrow the content by providing an XPath. This will point out to the algorithm that you are interest in content from a specific element.
    If this still doesn’t work, we’re more than glad to help you out, so make sure to contact us!