Archive for 18 February, 2010

Spidering hacks

Spidering hacks

Written for developers, researchers, technical assistants, librarians, and power users, Spidering Hacks provides expert tips on spidering and scraping methodologies. You’ll begin with a crash course in spidering concepts, tools (Perl, LWP, out-of-the-box utilities), and ethics (how to know when you’ve gone too far: what’s acceptable and unacceptable). Next, you’ll collect media files and data from databases. Then you’ll learn how to interpret and understand the data, repurpose it for use in other applications, and even build authorized interfaces to integrate the data into your own content.

Index

Chapter 1 Walking Softly
Chapter 2 Assembling a Toolbox
Chapter 3 Collecting Media Files
Chapter 4 Gleaning Data from Databases
Chapter 5 Maintaining Your Collections
Chapter 6 Giving Back to the World

Rapidshare | Megaupload | Mediafire | 4Shared

18 February, 2010 at 0:01 Leave a comment


Calendar

February 2010
M T W T F S S
« Jan    
1234567
891011121314
15161718192021
22232425262728

Posts by Month

Posts by Category


Follow

Get every new post delivered to your Inbox.