Validating XML documents with PUBLIC identifiers and catalogs
And indenting them, and changing their encoding...
And indenting them, and changing their encoding...
How sloppy is OK for Google scans?
Something for kids to read on the OLPC XO.
Wired Magazine gives scraping the buzzword treatment but remains clueless about the semantic web and linked data.
The latest issue of Wired has an article with the provocative title of The Data Wars about web sites built around data retrieved by “bots” doing “scraping”. I quote these because the article twists the terms a bit to make them and their subjects seem more dramatic, more cutting edge, and—you guessed it—more “Web 2.0”.
A new IBM developerWorks article.
And, out of context, it can mislead.