#csv

96 guides

Scrape Live Stock Data from Yahoo Finance

Build a Yahoo Finance watchlist scraper in Python: pull current quote snapshots, day ranges, and volume from quote pages, then export a clean CSV using a ProxiesAPI-ready fetch layer.

Web Scraping with Ruby: Nokogiri + HTTParty Tutorial

Learn web scraping with Ruby by pairing HTTParty for fetching and Nokogiri for parsing, then add retries, pagination, and CSV export in a practical end-to-end tutorial.

Web Scraping with R: rvest + httr2 Tutorial

Learn practical web scraping in R with httr2 for requests and rvest for parsing, then export tidy results without switching to Python.

Scrape News Headlines from Google News

Build a practical Google News headline scraper in Python using topic feeds, parse titles, publishers, and links, then export a deduplicated CSV for a lightweight news monitor.

Scrape Hacker News Ask HN Threads with Python

Collect Ask HN titles, points, authors, comment counts, and thread replies into a reusable startup-signal dataset using Python.

Scrape GitHub Trending Developers with Python

Build a daily GitHub trending-developers dataset with Python by extracting names, usernames, avatars, and popular repos from the live page.

Scrape Numbeo Quality of Life Index by City with Python

Extract Numbeo's city-level quality-of-life scores, safety, traffic, pollution, and climate indicators into a clean dataset with Python and ProxiesAPI.

Scrape UK Property Prices from Rightmove

Show how to collect Rightmove listing prices, addresses, agent names, and URLs into a reusable UK property dataset with Python and ProxiesAPI.

Steam Scraper: Extract Prices, Reviews, and Tags with Python

Build a practical Steam scraper that collects store prices, review counts, review summaries, and user-facing tags from search results and app pages.

Scrape Numbeo Crime Index by City with Python + ProxiesAPI

Extract city crime rankings, safety scores, and comparison-ready rows from Numbeo's public rankings table into JSON and CSV.

Scrape GitHub Topic Pages with Python + ProxiesAPI

Collect repository cards, stars, languages, repo URLs, and update timestamps from GitHub topic pages into a niche-watch dataset.

Scrape Numbeo Rent Prices and Cost Breakdown by City

Extract Numbeo city tables for rent and living-cost items, normalize ranges, and build a practical city comparison dataset with a validation screenshot.

Scrape Hacker News Jobs Posts with Python + ProxiesAPI

Turn the HN Jobs feed into a clean dataset of roles, companies, domains, and links from the real jobs page with resilient pagination and a validation screenshot.

Scrape Book Data from Goodreads

Build a Goodreads dataset with book titles, authors, ratings, and review counts from a public list page using Python and an optional ProxiesAPI fetch layer.

Scrape Secondhand Fashion Listings from Vinted

Show how to extract Vinted listing titles, prices, brands, sizes, and image URLs from the public catalog with real selectors and a screenshot.

Scrape Rightmove Sold Prices

Build a sold-price dataset with Rightmove property cards, detail pages, sale dates, and historical prices using real selectors and a screenshot.

Scrape Stack Overflow Newest Questions into CSV with Python

Collect Stack Overflow's newest questions with Python: titles, tags, votes, answers, timestamps, and URLs exported into clean CSV files with an optional ProxiesAPI request layer.

Scrape GitHub Trending Repositories with Python

Build a daily GitHub Trending dataset with Python: collect repository names, languages, star counts, and URLs, then export clean CSV or JSON with an optional ProxiesAPI fetch layer.

Scrape Trustpilot Company Search Results and Ratings with Python

Pull company names, star ratings, review counts, profile URLs, and visible location text from Trustpilot search results. Includes a real screenshot and a Playwright scraper that can run through ProxiesAPI.

Scrape IMDb Top Box Office and Release Data with Python

Collect the live IMDb Top Box Office chart into a clean dataset with title URLs, weekend gross, total gross, and weeks released. Includes a real screenshot and a Playwright scraper wired for ProxiesAPI.

Scrape eBay Listings and Prices

Build an eBay scraper that captures titles, prices, item URLs, and pagination into CSV-ready output.

Scrape eBay Listings and Prices

Build an eBay scraper that captures listing titles, prices, shipping, and item URLs across result pages.

Scrape Stock Prices and Financial Data with Python

Use Python + ProxiesAPI to pull Yahoo Finance quote pages, key stats tables, and historical price rows into CSV without building a heavyweight browser scraper.

Scrape Stock Prices and Financial Data with Python

Build a stock-price dataset from Stooq with Python: fetch symbols, download historical OHLCV CSVs, handle retries/timeouts, and export clean CSV plus SQLite with ProxiesAPI in the network layer.

Scrape Book Reviews and Ratings from Goodreads

Extract Goodreads review text, star ratings, review counts, pagination cursors, and reviewer metadata into a clean book-sentiment dataset.

Scrape Book Data from Goodreads (Titles, Authors, Ratings, and Reviews)

A practical Goodreads scraper in Python: collect book title/author/rating count/review count + key metadata using robust selectors, ProxiesAPI in the fetch layer, and export to JSON/CSV.

Scrape Live Stock Data from Yahoo Finance

Show how to pull live quote fields, daily change, volume, and market-cap data from Yahoo Finance quote pages into a clean CSV.

Scrape Yahoo Finance Analyst Ratings and Price Targets with Python

Extract Yahoo Finance analyst tables into a research-ready CSV: recommendation trend, mean target price, high/low targets, and ticker metadata.

Scrape Steam Upcoming Releases and Launch Dates with Python

Collect Steam coming-soon games, release dates, store URLs, tags, and prices into a launch-watch CSV using Python.

HTML Table Scraping with Python: Parse Tables into CSV Reliably

Learn a reliable workflow for scraping HTML tables with Python, from choosing the right table to fixing messy headers and exporting clean CSV files.

Scrape Wikipedia Category Pages into CSV

Crawl a Wikipedia category tree, collect page titles and URLs, and export a clean CSV with subcategories and article members.

Scrape Crunchbase Company Data

Collect company profile fields from Crunchbase by discovering organization URLs, rendering profile pages, and parsing structured data into CSV.

Scrape Craigslist Listings by Category and City

Show how to pull listing titles, prices, neighborhoods, and posting URLs from Craigslist search pages into a clean dataset.

Scrape Stack Overflow User Profiles and Badges with Python

Extract reputation, badge counts, top tags, and profile metadata from public Stack Overflow user pages into JSON/CSV with robust selectors and a ProxiesAPI-ready fetch layer.

Scrape ESPN Team Schedules and Game Results with Python

Collect upcoming games, completed results, opponents, dates, networks, and home-away splits from ESPN team schedule pages using the serialized page data behind the HTML.

Scrape Marktplaats Seller Listings and Prices with Python

Extract seller inventory, prices, listing URLs, and locations from a live Marktplaats seller page using Python and BeautifulSoup.

Build a Job Board with Data from Indeed

Scrape Indeed job listings (title, company, location, salary, summary) with Python (requests + BeautifulSoup), then save a clean dataset you can render as a simple job board. Includes pagination + ProxiesAPI fetch.

Scrape GitHub Repository Data

Collect GitHub repository metadata, stars, forks, topics, and README-linked context from the public HTML with Python. Includes defensive selectors, CSV export, and a screenshot.

Scrape Financial Data from Yahoo Finance

Extract quote headers, summary statistics, and historical rows from Yahoo Finance into a clean CSV with Python, BeautifulSoup, and a ProxiesAPI-backed fetch layer.

Scrape Rightmove Sold Prices

Walk through building a sold-price dataset from Rightmove with listing details, pagination, and clean CSV export.

Scrape GitHub Pull Requests into a Review Queue (Labels, States, Draft Status)

Build a GitHub pull request queue from public HTML: collect PR titles, numbers, labels, comments, authors, timestamps, and draft status so you can triage reviews without the API.

Scrape Yahoo Finance Earnings Calendar with Python (Dates, EPS Estimates, CSV Export)

Turn Yahoo Finance's earnings calendar into a clean daily dataset you can filter by date, ticker, and surprise expectations.

Scrape IMDb Top 250 into a Weekly Tracker (Rank Changes, Ratings, Votes)

Build a repeatable IMDb Top 250 snapshot pipeline so you can chart rank moves, rating drift, and vote growth over time.

Scrape GitHub Issues (Labels, States, Pagination) Into CSV

Build a practical GitHub Issues scraper in Python: parse issue rows, collect labels + state + dates, follow pagination, and export a triage-ready CSV. Includes screenshot + working code.

Scrape Sports Scores from ESPN with Python (Scoreboard API + Normalized CSV)

Build a reliable ESPN scores scraper: pull scoreboard data for multiple sports, normalize teams/scores/status, and export clean CSV/JSON. Includes a screenshot and a ProxiesAPI-ready fetch layer.

Scrape Game Prices and Reviews from Steam with Python (Search + App Pages)

Build a practical Steam scraper: crawl search results, extract title/appid/price/discount/review summary, then enrich each game from its app page. Includes a screenshot and a ProxiesAPI-ready fetch layer.

Web Scraping with C# and HtmlAgilityPack: A Practical 2026 Tutorial

A from-scratch C# web scraping tutorial using HttpClient + HtmlAgilityPack: requests, parsing, pagination, and exporting to CSV/JSON. Includes reliability patterns and when to add a proxy layer like ProxiesAPI.

Scrape Currency Exchange Rates with Python (Daily FX Dataset) + ProxiesAPI

Build a daily FX-rates dataset: scrape a real rates table, validate values, write CSV/JSONL, and keep it running with retries. Includes a ProxiesAPI-ready network layer and a screenshot of the source page.

Scrape Numbeo City Cost-of-Living Comparisons (2-City Diff Tables) with Python

Extract Numbeo city-vs-city cost of living comparison rows into a clean dataset (item, city1, city2, percent diff). Includes screenshot, URL builder, and robust table parsing.

Scrape Goodreads Author Pages: Books, Series, Ratings (ProxiesAPI + Python)

Extract author profile data plus a clean list of books (title, URL, average rating, rating count) from Goodreads author pages. Includes real selectors, retries, and a screenshot.

Web Scraping with Go (Colly Framework): Complete Guide

Learn web scraping in Go using Colly: selectors, concurrency, rate limits, retries, and exporting to JSON/CSV. Includes a practical ProxiesAPI integration pattern for more reliable crawling.

Scrape Vinted Listings with Python: Search + Pagination + Clean CSV Export

Build a practical Vinted listings scraper: pull search results via Vinted’s internal catalog endpoint, paginate safely, extract price/brand/size/image URLs, and export a clean CSV. Includes a screenshot + ProxiesAPI integration.

Scrape Stack Overflow with Python: Tag Pages + Question Threads + Q/A Export

Build a production-ready Stack Overflow scraper: crawl tag pages, follow question links, extract question + answers + votes, and export JSON/CSV. Includes a screenshot and ProxiesAPI integration hooks.

Web Scraping with TypeScript in 2026: Playwright + Cheerio End-to-End Guide

A practical TypeScript scraping pipeline: Playwright for rendering and navigation, Cheerio for fast parsing, plus retries/backoff, queue design, and export to JSON/CSV. Includes proxy-rotation hooks and honest notes on where ProxiesAPI belongs.

Steam Deal Tracker: Scrape Daily Specials + Price Drops (Python + ProxiesAPI)

Scrape Steam specials/search pages via ProxiesAPI, extract discount + price + appid, and persist a daily snapshot to detect price drops. Includes pagination, CSV export, and a screenshot of the target page.

Scrape Shopee Reviews at Scale: Ratings, Review Text, and Product Metadata

Fetch Shopee product metadata + reviews via ProxiesAPI, paginate ratings safely, and export clean JSON/CSV for analysis. Includes robust URL parsing, retry/backoff, and a screenshot of a real product page.

Scrape Live Stock Data from Yahoo Finance with Python (Quotes + Key Stats)

A resilient Yahoo Finance scraper in Python: fetch quote pages via ProxiesAPI, extract live-ish quote fields + key stats from embedded JSON, handle retries, and export to CSV.

Scrape Book Data from Goodreads with Python (List Pages + Pagination)

Scrape Goodreads list pages for title/author/rating/reviews with Python: fetch via ProxiesAPI, parse real HTML selectors, paginate safely, and export CSV/JSON.

Scrape eBay Listings + Sold Prices with Python (Active + Completed Listings)

Build a small eBay dataset (title, price, condition, shipping) from search results, then pull completed/sold prices from the Sold filter. Includes pagination, CSV export, and ProxiesAPI in the fetch layer.

Scrape Craigslist Listings by Category and City (Python + ProxiesAPI)

Build a Craigslist scraper with pagination + dedupe, capture title/price/location/date, export CSV, and keep the fetch layer resilient with ProxiesAPI. Includes a target-page screenshot.

Python BeautifulSoup Tutorial: Scraping Your First Website (2026)

A beginner-friendly BeautifulSoup tutorial: fetch HTML with requests, parse elements with CSS selectors, handle pagination, avoid common pitfalls, and export results. Includes an honest ProxiesAPI section for when you scale.

Scrape Yahoo Finance Top Gainers/Losers Screener with ProxiesAPI (CSV Export)

Scrape Yahoo Finance movers tables (gainers + losers), extract tickers, prices, % change, and volume using stable data-testid anchors, then export to CSV. Includes selector rationale and a screenshot.

Scrape Trustpilot Category Rankings (Top Companies + Ratings) with ProxiesAPI

Extract top companies in a Trustpilot category (name, website, rating, review count) across pages using stable DOM anchors, then export to CSV. Includes selector rationale and a proof screenshot.

Scrape Steam Game Prices + Reviews (Search Results) with Python + ProxiesAPI

Build a practical Steam search scraper: fetch the real HTML, extract game title/appid/price/discount/review summary, and export clean CSV/JSON. Includes a screenshot and a ProxiesAPI-based fetch layer for stability.

Scrape Marktplaats Search Results (Listings) with Python + ProxiesAPI

Build a practical Marktplaats search scraper: fetch the real HTML, extract listing title/price/location/url, and export CSV. Includes a screenshot and a ProxiesAPI-based fetch layer to keep crawls stable.

Scrape App Store Rankings (Python + ProxiesAPI)

Pull Apple App Store top charts and app metadata reliably, export to CSV, and keep runs stable with retries + ProxiesAPI. Includes a screenshot-backed walkthrough.

Scrape Marktplaats Listings with Python (Search + Pagination + CSV Export)

Extract listing title, price, location, and URL from Marktplaats search results with Python + BeautifulSoup. Includes pagination, CSV export, and a ProxiesAPI fetch wrapper for stability.

Scrape Government Contract Data from SAM.gov (Opportunities + Details)

Build an end-to-end SAM.gov scraper: search opportunities, paginate results, fetch detail pages, normalize fields, and export JSON/CSV using ProxiesAPI. Includes screenshots + robust retry patterns.

Scrape UK Property Prices from Rightmove (Dataset Builder)

Build a sold-price dataset from Rightmove: crawl results, follow listing links, extract key fields, handle retries, and export to CSV using ProxiesAPI.

Scrape Craigslist Listings by Category and City (Python + ProxiesAPI)

Build a Craigslist city+category scraper with pagination, dedupe, and CSV export. Includes selectors, anti-block hygiene, and screenshot proof.

Scrape UK Property Prices from Rightmove (Dataset Builder + Screenshots)

Build a repeatable Rightmove sold-price dataset pipeline in Python: crawl result pages, extract listing URLs, parse sold-price details, and export clean CSV/JSON with retries and politeness.

Scrape IMDb Top 250 Movies into a Dataset (Python + ProxiesAPI)

Extract IMDb Top 250 movies (rank, title, year, rating, vote count) into clean CSV/JSON — with robust parsing, retries, and polite crawling.

Scrape Government Contract Opportunities from SAM.gov (Python + ProxiesAPI)

Build a reliable scraper for SAM.gov contract opportunities: crawl search results, paginate, extract listing cards, fetch detail pages, and export CSV/JSON. Includes retry logic and a screenshot step for proof.

Scrape Government Contract Opportunities from SAM.gov (Python + ProxiesAPI)

Pull contract opportunity listings from SAM.gov into a clean CSV: pagination, robust retries, request headers, and an honest ProxiesAPI integration to reduce throttling.

Scrape Shopee Product Listings with Python (ProxiesAPI)

Fetch Shopee product pages through ProxiesAPI, extract title/price/sold count from HTML, and export results to CSV. Includes a screenshot + a production-ready fetch layer with retries.

Scrape Google Scholar Search Results with Python (Titles, Authors, Citations)

Collect Scholar SERP pages into a clean dataset, handling pagination + lightweight anti-bot tactics.

Scrape Costco Product Prices with Python (Search + Pagination + Product Pages)

Build a repeatable Costco price dataset from search → listings → product pages, with ProxiesAPI + retries.

Scrape Numbeo Cost of Living Data with Python (cities, indices, and tables)

Extract Numbeo cost-of-living tables into a structured dataset (with a screenshot), then export to JSON/CSV using ProxiesAPI-backed requests.

Scrape Flight Prices from Google Flights (Python + ProxiesAPI)

Pull routes + dates, parse price cards reliably, and export a clean dataset with retries + proxy rotation.

Scrape Rightmove Sold Prices (Second Angle): Price History Dataset Builder

Build a clean Rightmove sold-price history dataset with dedupe + incremental updates, plus a screenshot of the sold-price flow and ProxiesAPI-backed fetching.

Scrape Expedia Flight and Hotel Data with Python (Step-by-Step)

A practical Expedia scraper in Python using Playwright: open search results, extract hotel cards (and where flight offers live), paginate safely, and export clean JSON/CSV. Includes ProxiesAPI-friendly network patterns and a screenshot.

How to Scrape Google Finance Data with Python (Quotes, News, and Historical Prices)

Scrape Google Finance quote pages for price, key stats, news headlines, and a simple historical price series with Python. Includes selector-first HTML parsing, CSV export, and block-avoidance tactics (timeouts, retries, and ProxiesAPI-friendly patterns).

Scrape Sports Scores from ESPN (Python + ProxiesAPI)

Fetch ESPN’s scoreboard page, parse games + teams + scores into a clean table, then export CSV/JSON. Includes a screenshot and a resilient parsing strategy.

Scrape Podcast Data from Apple Podcasts: Charts + Episode Metadata (Python + ProxiesAPI)

Scrape Apple Podcasts chart pages, extract show details, then pull episode metadata into a clean dataset. Includes screenshot + robust parsing with fallbacks.

Scrape Currency Exchange Rates (USD/EUR/INR) into a Daily Dataset with Python + ProxiesAPI

Build a daily FX dataset (USD/EUR/INR) by scraping a public rates table into a clean time series CSV, with basic validation, retries/timeouts, and a ProxiesAPI-ready fetch layer. Includes a screenshot of the source page.

Scrape Real Estate Listings from Realtor.com (Python + ProxiesAPI)

Extract listing URLs and key fields (price, beds, baths, address) from Realtor.com search results with pagination, retries, and a ProxiesAPI-backed fetch layer. Includes selectors, CSV export, and a screenshot.

Scrape GitHub Repository Data (Stars, Releases, Issues) with Python + ProxiesAPI

Scrape GitHub repo metadata from HTML (not just the API): stars, forks, latest release, open issues, and pull requests. Includes a ProxiesAPI fetch layer, safe parsing, and CSV export + screenshot.

How to Scrape Craigslist Listings by Category and City (Python + ProxiesAPI)

Pull Craigslist listings for a chosen city + category, normalize fields, follow listing pages for details, and export clean CSV with retries and anti-block tips.

Scrape Wikipedia Article Data at Scale (Tables + Infobox + Links)

Extract structured fields from many Wikipedia pages (infobox + tables + links) with ProxiesAPI + Python, then save to CSV/JSON.

Scrape Weather Data for Any City (Open-Meteo)

Build a lightweight weather dataset pipeline: geocode a city, fetch forecasts from Open-Meteo, add caching + retries, and export clean JSON/CSV.

How to Scrape Business Reviews from Yelp (Python + ProxiesAPI)

Extract Yelp search results and business-page review snippets with Python. Includes pagination, resilient selectors, retries, and a clean JSON/CSV export.

How to Scrape Apartment Listings from Apartments.com (Python + ProxiesAPI)

Scrape Apartments.com listing cards and detail-page fields with Python. Includes pagination, resilient parsing, retries, and clean JSON/CSV exports.

Scrape IMDb Top 250 Movies into a Dataset

Pull rank, title, year, rating, and votes into clean CSV/JSON for analysis with working Python code.

How to Scrape Wikipedia Tables into CSV with Python

Turn messy HTML tables into structured datasets you can analyze with pandas in minutes.

How to Scrape Trustpilot Reviews for Any Company

Pull ratings, dates, reviewer names, and review text into a clean CSV for reputation monitoring.

How to Scrape GitHub Trending with Python (and Export to CSV/JSON)

A practical GitHub Trending scraper: fetch the Trending page, extract repo names + language + stars, and export a clean dataset.