Hate bloggers?
What I find funny is that even his original LA Times piece (PDF) was pretty dumb. He misunderstood the purpose of Google’s library-scanning thing. It’s not all about the searching. A huge part of it is just the archiving of all of those millions of volumes, copying them into a form that will be easily replicable and will one day fit in the palm of your hand — everybody’s hand. This is a project that has to be undertaken only once, and its fruits will outlive Google, the Internet, blogs, and maybe even humanity as a species.
The digitization of our libraries is perhaps the most exciting project, knowledge-wise, to hit in a long time. It’s the modern-day equivalent of Gutenberg’s press: a technology that democratizes information, takes it out of the centers of learning and puts it into the hands of people. God knows I’d love to still have regular access to the library at the U of C, but that’s impossible for me right now, and it’s impossible for millions of people for all their lives.
Besides, I dunno what kind of scholarly research he does, but when I was in college, I didn’t have time to read an entire book to get “context.” The containing chapter at most, and sometimes just the few pages surrounding it. And it doesn’t seem that many of the academics who actually wrote the articles and books that I cited, read the entire books either. Taking stuff out of context is par for the course in academia. What might have been one author’s afterthought in one chapter of a book becomes a key piece of support in another author’s article.
But those concerns aside, it is a given that making all of these volumes searchable will increase the productivity of researchers by orders of magnitude. The one reasonably in-depth research paper that I wrote in college started out relying on just a few sources, because those were the only ones that the professor and the Ancient Studies librarian (who, besides being a total babe, was extremely knowledgable and actually knew several articles off the top of her head that would be relevant to my paper) knew of. And then I went to the library, copied those articles, checked out those books, read the relevant parts, checked their citations, found those cited books/articles, checked their citations, etc. etc. in an ever-widening circle of information that took weeks before I found every source I ended up using. And it was frustrating because there might have been a very good, relevant article that I simply didn’t find in time.
Google’s search algorithm actually mimics the citations-give-authority model of academic research, and therefore it is perfectly suited to this kind of research. Academic articles have been “hyperlinking” for quite some time, and the levels of inter-linking and robustness of authority on a topic are MUCH simpler to model within this closed system than they are on the Web. It’d be just as easy to find, not only what articles this paper cites, but what papers cite this paper. Seriously, what took me weeks of crawling the library could happen in an afternoon once this system is in place. And the folks who currently don'’t have access to a knowledgable professor or librarian…. they can find the same information I did. Possibly even better information. Er, “knowledge.”
And several of his objections are simply poorly-thought-out. No, nobody wants to print out a 500-page book…. but nobody wants to spend an hour and a half standing in front of a copy machine, paying 5 cents per copy for 15 articles at 40 pages apiece, in what amounts to manual labor as you flip each page, hold it down, press the button, rotate the book, hold it down, press the button, flip the page….. since most journals aren’t check-out-able in most academic libraries. In this case, the higher-tech solution is most welcome.
Plus, I disagree with his assertion that full context is required. Most academic writing isn’t so opaque that a reasonable person can’t understand what’s on page 142 without reading pages 1 through 141. In fact, many books explicitly refer back to important ideas (”As discussed in Chapter 3, …”). We’re not reading James Joyce here. A reasonable level of knowledge about the subject at hand is assumed. If I’m writing a paper about Sparta, I don’t care if the author is using the example of Sparta to prove that political and military hegemony cannot last more than 3 generations… I just care what he says about the particulars of Sparta. And I don’t care about the other Greek city-states and their political systems either, except to the extent that they interacted with Sparta.
The proof is in the pudding. There are already plenty of invaluable search tools used by academic researchers. I know several professional social-science/psychology professors/researchers whose first resource is always Ovid’s PsycINFO search with fulltext abstract, keyword, and topic searches. In the law arena, Lexis-Nexis provides the same service, with millions of opinions, judgments, and articles fully searchable at the click of a mouse. There’s a reason why lawyers pay hundreds or even thousands of dollars a year for this online service: it works, and it’s much better than the non-tech alternative.
Anyway, I guess I can sum up by saying: Michael Gorman has completely missed the boat.
Hmm, I wish there was a way that WordPress would automatically say “This post tracks back to the following: http://drbacchus.com/wordpress/?p=900