Making a stand-alone scraper and XML readers seems to be the vogue nowadays. I met news API developer for The New York Times Derek Willis last week, and he brought up the good point that many online developers working in journalism had to use APIs and other querying services for their own publications because they are usually not given admin access to the publication’s electronic database.
Like Derek, I want to help journalists solve redundancies within their information gathering and distribution models. All of the hard data that many of my hard-working colleagues gather (i.e. names, ages, dates, etc.) should be stored at the most granular level within a relational or hierarchical database should be easily reused and accessible to everyone, not just journalists. Practical sorting of individual articles, i.e. by relevance, relies on an article’s meta-data, the most accurate of which is derived from the article’s most minor elements. Continue reading