Type scraper Language python Status Public
18 lines of code. 2,090 rows of data.
Created 1 year, 3 months ago.
Demonstration scraper for the ebook Scraping for Journalists This third scraper tries to address the error in the previous version "UnicodeDecodeError: 'utf8' codec can't decode byte 0xa3 in position 1: invalid start byte" This tells us that the error is caused by the first character in the second row of the column 'Invoice amount'. That's a pound sign. This scraper removes it.