Thanks for the cbib link. For the "real challenge", I think PubMed or Google Scholar would already have solved that for me (?)
I have a set of articles in Zotero and pdf format and would like to generate a citation graph from them, ideally with some statistics about how often the articles were cited in total and the possibility to fill in missing links.
Is there any tool that is able to do this?- I've heard that Mendeley is/was capable of extracting references from pdf, but don't know what the status of this feature is. Other suggestions are welcome.
Alternatively, are there tools that just extract references from text, e.g. a collection of regular expressions for different journals? I've got a little experience in Cytoscape plugin development and could code the visualization myself (if anyone is interested in this and would like to help this is also welcome).
Related: this question and Chris Miller on friendfeed, both without a satisfying answer. Maybe even Maltego could be used for this, but I don't know much about the software.
There are tools like cbib for parsing references from PDF files. But the real challenge is not that, it is proper record linkage and de-duplication to identify which records point to the same publication. That is an open research problem with few if any easily applicable tools.
I don't know of a tool that does it all. So +1 for your question. I would do it as follows: