Apr
11
The Play’s the Thing…
Filed Under Big Data, Blog, Industry Analysis, internet, Publishing, Reed Elsevier, Search, semantic web, social media, STM, Thomson, Uncategorized, Workflow | Leave a Comment
…by which to test the seriousness of the industry. (Yes, I went to the new Hamlet production at Stratford last week). And this week’s play, acted out to a packed house of industry watchers and market analysts, has been the seduction and vanquishment of the fair Mendeley by all-powerful Elsevier, so rudely forced. Or, if you prefer, the seduction of barbarous Elsevier by maidenly Mendeley. Whatever, here was a deal done for a company with negligible revenues at a price , with earn-outs, of something up to $100 m, according to those ever-present “people familiar with the deal”. And since I have seldom had more requests to explain, here is my take: Mendeley represents the greatest leap forward since Eugene Garfield in representing the worth of a science research article. If it went to Thomson Reuters it would put them back into a game where Elsevier have spent a huge amount, culminating in SciVal, in competitive efforts to diminish them. As in days of yore (who remembers BioMedNet?) the competitive threat potentially posed by Mendeley proved greater than the price misgivings. If it went to Macmillan, who already have an investment in ReadCube, Mendeley’s competitor, it would create another axis of competition which would be unwelcome, given the strides that Macmillan Digital Science made by investing in Altmetric, as well as figstore. Since every article is unique and not a competitor with other articles, the true point of competition in science research publishing now lies in workflow tools which make researchers more productive – and help them to decide what to actually read, and what to reference and visualize. So, continuing my Danish theme, this is a pre-emptive strike, like Nelson destroying the fleet at Copenhagen. Do we know whether Mendeley is the ultimate social tool for tracking who buys and reads what? No, any more than we know whether FaceBook is the player in place for life in social networking. But we do know that more than 2 million active researchers value it immensely, and so it posed a question – and one that for 20 years Elsevier have been adroit in answering.
This begs a few questions. Will Elsevier be able to run it independently enough to re-assure those critics who regard it as more like Caliban than Caesar? And are we being distracted by watching the wrong part of the game with too much intensity? I am a strong supporter of what the Mendeley team have done, but they were let into the marketplace by a chronic publishing failure: the inability of producers to sell to researchers adequately identified PDFs that obeyed agreed industry standards and which would allow a researcher to auto-index his hard disk and find what he had bought. As ever, publishers were complacent about the downstream problems they caused their users. But the real question here is about metadata, and it is a timely reminder of other problems we have never fully solved. When we adopted DOI/Handle technology the publishing community worked, as always, at the lowest common denominator of agreement. The result is a world in which articles are effectively numbered, and CrossRef express that industry cohesion, but we still cannot offer researchers the ability to search consistently over the full range of articles for which they have permissions cleared using their own or even semi-standardized taxonomies. Nature (http://www.nature.com/news/the-future-of-publishing-a-new-page-1.12665) has done sterling work in the last month on the future of publishing, but simply illustrates to me how inadquately we tackle the last steps – the ones that lead to collaboration and to each player moving forward to create knowledge stores which reflect the real research needs of their users.
I do not mean to say that publishers do not collaborate. They increasingly do, and recent press coverage of Springer and CAS, or the case study of Wiley’s work with the AGU demonstrate this. I have been involved with the TEMIS work on collaboration and have learnt a lot from it. And publicly industry leaders do point to data-led strategies, which I was interested to hear acknowledged in a talk by Steve Smith (CEO, John Wiley) to the AAP/PSP in February, which I moderated. So I was very interested indeed to spend some time with Jason Markos, Director of Knowledge Management and Planning at Wiley, and get a current view on the enrichment picture. The contrast over the past five years is, to someone used to the sometimes somnolent complacency of publishing, quite startling. Now you can have conversations about content enrichment that do begin to embrace both the narrow/deep and the broad/shallow needs of users. If the capacity now available in publishing – coming it must be said from people who entered from outside and have a real technical grasp of knowledge engineering which was not prefaced by life in linear publishing workflow processes – to think about the need to turn away from content architectures predicated by the structure of the article and towards creating entity stores or “knowledge” stores which allow data items from article databases to be searched in conjunction with data drawn from other sources like evidential data then we may indeed be on the way towards a user-driven networked vision of the future of publishing. Learning how to work with knowledge models as a way of expressing the taxonomic values of all of this shows me that we are on a route march that follows the track that has been obvious for a little while now, and which involves adopting the RDF as a basis, and creating triples to anchor our texts in semantically searchable environments. So our new Knowledge engineers will be able to spin out new service environments for increasingly demanding users, and the publishing game will not peter out with the commoditization of the article…
…I left Wiley the other day full of hope, and I still am. But this context is necessary to see that the Mendeley deal, lovely though it is, remains symptomatic of the need to scratch yesterday’s itch. I suspect that the real struggle, already underway, is to persuade researchers that publishers really can add value to data, and that they really do know how to analyse it, structure it, create smart research tools around it and extract real value from users as a reward for this investment and effort. This will need smart industry suppliers as well, and I have learnt a lot from working with MarkLogic and TEMIS in the past year. And most of all it needs the support of CEOs who see beyond maximizing PDF downloads to the strategic crossroads this part of the industry now faces – and beyond.