Accessing backend system…

We're sorry, but your session has expired due to inactivity. Please use your browser to refresh this page and log in to our system again.

Message goes here.

Message goes here.

Message goes here.

LOGIN / REGISTER
VIEW BASKET
SEARCH:
 
php|architect logo
 
SERVICES
  • MAGAZINE
  • PHP|TEK 2012
  • CODEWORKS 2011/12 TOUR
  • BOOKS
  • TRAINING
  • ADVERTISE
 
CHANNELS
  • NEWS
  • PODCAST
  • DEVELOPMENT
  • OPINION
  • WRITE

Buy Digital
Buy Print

ISBN 9780981034515
Pages 192
Author Matthew Turland
Print $39.99
Digital $36.99

php|architect's Guide to Web Scraping with PHP

Despite all the advancements in web APIs and interoperability, it’s inevitable that, at some point in your career, you will have to “scrape” content from a website that was not built with web services in mind. And, despite its sometimes less-than-stellar reputation, web scraping is usually an entire legitimate activity—for example, to capture data from an old version of a website for insertion into a modern CMS.

This book, written by scraping expert Matthew Turland, covers web scraping techniques and topics that range from the simple to exotic using a variety of technologies and frameworks:

  • Understanding HTTP requests
  • The PHP HTTP streams wrapper
  • cURL
  • pecl_http
  • PEAR:HTTP
  • Zend_Http_Client
  • Building your own scraping library
  • Using Tidy
  • Analyzing code with the DOM, SimpleXML and XMLReader extensions
  • CSS selector libraries
  • PCRE pattern matching
  • Tips and Tricks
  • Multiprocessing / parallel processing
 
 

About us

  • What we do
  • Contact us
  • Write for us

Policies & legal

  • Customer support
  • Privacy policy
  • Refund policy
  • Terms & Conditions

Online Store

  • Magazine
  • Training courses
  • Books

Special sections

  • Codeworks 2011
 

Copyright © 2002-2012 Blue Parabola, L.L.C. — All amounts in USD - WP3