Previously was using the PHP based PDL package, but that seems to break once the number of articles gets large (once past 51 articles).
Installed R to the web server – we can run it directly using “R –vanilla –slave”, but that seems more than a bit slow – getting results takes a good chunk of time, probably because it costs a few seconds to compute each p-value, and you have to do it for every one of the MeSH terms. Maybe a couple of minutes to process them all. Maybe I should look at figuring out how to batch it all up into one giant computation – maybe make an array that can be read in – which might allow for parallel processing, or at least save on loadup time for R. Otherwise, maybe there’s a lightweight stats package that could be used instead?