ScraperWiki

  • ScraperWiki Classic
  • New ScraperWiki
  • Browse Classic Archive
  • Classic Documentation

Browse Archived Scrapers

  • Scrapers
  • Views
  • Browse by tag
  • Type scraper
    Language python
    Status Public
    Notes Maintenance
    required
    Screenshot of Luton Museums

    bjwebb / Luton Museums

    133 lines of code. 8 rows of data.

    Created 3 years, 10 months ago.

    http://www.youth.luton.gov.uk/13.cfm?p=881

  • Type scraper
    Language python
    Status Public
    Notes Maintenance
    required
    Screenshot of Greater London Assembly Expenditure

    memespring / Greater London Assembly Expenditure

    81 lines of code. 8,969 rows of data.

    Created 3 years, 10 months ago.

    Where the money went!

  • Type scraper
    Language python
    Status Public
    Screenshot of metadata2

    Tom Mortimer-Jones / metadata2

    1 line of code. No rows of data yet.

    Created 3 years, 10 months ago.

  • Type scraper
    Language python
    Status Public
    Screenshot of de montfort courses

    goatchurch / de montfort courses

    25 lines of code. 716 rows of data.

    Created 3 years, 10 months ago.

    for now just some of the course costs

  • Type scraper
    Language python
    Status Public
    Notes Maintenance
    required
    Screenshot of tanssi.net

    petri / tanssi.net

    30 lines of code. 1,042 rows of data.

    Created 3 years, 10 months ago.

    scraper to get the coming dance events in Finland from tanssi.net service

  • Type scraper
    Language python
    Status Public
    Screenshot of NHS trust locations

    goatchurch / NHS trust locations

    178 lines of code. 6,178 rows of data.

    Created 3 years, 10 months ago.

    to go out

  • Type scraper
    Language python
    Status Public
    Screenshot of NHS Primary Care Trusts

    jcranch / NHS Primary Care Trusts

    180 lines of code. 27,216 rows of data.

    Created 3 years, 10 months ago.

    Gets information on all NHS Primary Care Trusts and their hospitals, doctors, dentists, pharmacies, opticians and other services. Includes geolocating.

  • Type scraper
    Language python
    Status Public
    Screenshot of Tutorial lxml

    Julian_Todd / Tutorial lxml

    35 lines of code. No rows of data yet.

    Created 3 years, 10 months ago.

    Using lxml, an alternative library to BeautifulSoup.

  • Type scraper
    Language python
    Status Public
    Screenshot of startup-hello world

    Julian_Todd / startup-hello world

    6 lines of code. No rows of data yet.

    Created 3 years, 10 months ago.

    ss

  • Type scraper
    Language python
    Status Public
    Screenshot of Python Startup

    Julian_Todd / Python Startup

    20 lines of code. 5 rows of data.

    Created 3 years, 10 months ago.

    basic scraper as startup example - just retrieves a page, extracts tags, and saves to datastore

  • Type scraper
    Language python
    Status Public
    Notes Maintenance
    required
    Screenshot of 2010 general election results

    NickBarnes / 2010 general election results

    64 lines of code. 4,801 rows of data.

    Created 3 years, 10 months ago.

    Scrapes the 2010 general election results from the BBC News website.

  • Type scraper
    Language python
    Status Public
    Screenshot of Marginal constituencies, 2010 UK General Election

    sebbacon / Marginal constituencies, 2010 UK General Election

    20 lines of code. 5 rows of data.

    Created 3 years, 10 months ago.

    A list of marginal constituencies for the 2010 UK General election — just the names! Scraped from wikipedia.

  • Type scraper
    Language python
    Status Public
    Screenshot of Marginal constituencies, 2010 UK General Election

    sebbacon / Marginal constituencies, 2010 UK General Election

    28 lines of code. 39 rows of data.

    Created 3 years, 10 months ago.

    A list of marginal constituencies for the 2010 UK General election — just the names! Scraped from wikipedia.

  • Type scraper
    Language python
    Status Public
    Notes Maintenance
    required
    Screenshot of Sefton MBC Road Accidents

    mowen / Sefton MBC Road Accidents

    67 lines of code. 922 rows of data.

    Created 3 years, 10 months ago.

    Road accidents within Sefton Metropolitan Borough.

  • Type scraper
    Language python
    Status Public
    Screenshot of Sefton MBC Council Meetings

    mowen / Sefton MBC Council Meetings

    84 lines of code. 1,102 rows of data.

    Created 3 years, 10 months ago.

    Scrapes the Sefton Metropolitan Borough modern.gov website to get the latest scheduled council meetings. Should be adaptable to other councils that use modern.gov.

  • Type scraper
    Language python
    Status Public
    Screenshot of Walsall Planning Applications

    pezholio / Walsall Planning Applications

    50 lines of code. No rows of data yet.

    Created 3 years, 11 months ago.

    Weekly list of Planning Applications from the Walsall Council website

  • Type scraper
    Language python
    Status Public
    Screenshot of 2005 General Election results from Electoral Commission

    jcranch / 2005 General Election results from Electoral Commission

    121 lines of code. 483 rows of data.

    Created 3 years, 11 months ago.

    Scrapes the Electoral Commission's data on the 2005 general election – Results of the general election for the UK Parliament in 2005

  • Type scraper
    Language python
    Status Public
    Screenshot of Sharjah Airport Movements

    AlexHarrowell / Sharjah Airport Movements

    65 lines of code. No rows of data yet.

    Created 3 years, 11 months ago.

    Current state of Sharjah airport

  • Type scraper
    Language python
    Status Public
    Screenshot of German MP Income

    stefanw / German MP Income

    93 lines of code. 1,264 rows of data.

    Created 3 years, 11 months ago.

    Extracts MP auxiliary incomes from Bundestag website relying on D-API for URLs

  • Type scraper
    Language python
    Status Public
    Screenshot of New Parliamentary Nominations 2010

    AlexHarrowell / New Parliamentary Nominations 2010

    71 lines of code. 909 rows of data.

    Created 3 years, 11 months ago.

    Scrapes the Press Association's list of all parliamentary nominations for the General Election, providing candidates and their party affiliation for each constituency.

« Previous 1 2 3 4 … 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 Next »