I have a set of articles in Zotero and pdf format and would like to generate a citation graph from them, ideally with some statistics about how often the articles were cited in total and the possibility to fill in missing links.
Is there any tool that is able to do this?- I've heard that Mendeley is/was capable of extracting references from pdf, but don't know what the status of this feature is. Other suggestions are welcome.
Alternatively, are there tools that just extract references from text, e.g. a collection of regular expressions for different journals? I've got a little experience in Cytoscape plugin development and could code the visualization myself (if anyone is interested in this and would like to help this is also welcome).
There are tools like cbib for parsing references from PDF files. But the real challenge is not that, it is proper record linkage and de-duplication to identify which records point to the same publication. That is an open research problem with few if any easily applicable tools.
I don't know of a tool that does it all. So +1 for your question. I would do it as follows:
If you have access to Thomson's Web of Science (especially the API) you might be able to use that. Unless you want your results to be public I guess. They do not only know how often publications were cited but also from where. So they must already have collected the information you need.