New additions to the JURN index today:—
RAG, The (quarterly of the Roman Archaeology Group at UWA)
Aegis Humanities Journal (Otterbein University Humanities Journal)
Metropolitan Museum of Art Bulletin (There doesn’t appear to be an index page)
Feature articles of SAFE : Saving Antiquities for Everyone
My software-assisted checking of JURN’s article-level index is now complete.
The process involved determining if each and every URL in the JURN index was still being “seen” and indexed by Google. In this second and final check, another 180 or so journals have had their URL re-found and corrected. I also adjusted the relevant Directory URL, when that was found to have changed. Another 20 dead or deleted journals have been removed. All this means that every URL in JURN’s main index is currently being indexed by Google.
I then did a quick Linkbot-based re-check of the resulting latest version of the JURN Directory, looking for and correcting any “404 not found” results.
I think I’ve now developed a fairly robust “pincer movement” method that can annually curtail the inevitable link-rot:— i) the article-level URL-checking; and ii) spam-word searches in JURN; and iii) the checks on the English-language Directory for “404” / redirected home-pages.
Three new ejournal additions to JURN, found during the big link-checking:—
Electronic Journal of Mithraic Studies (Ancient Roman Mithraism)
Classics@ : an online journal (journal of the Center for Hellenic Studies, Harvard. Added to the JURN Directory, but articles can’t be indexed by JURN — they have one of the most hideous URL structures I’ve ever seen, and on top of that there also seems to be per-session URL-shaping to prevent linking to individual pages. It’s Classics, guys — web traffic isn’t likely to be that heavy…)
APS Proceedings Online (Proceedings of the American Philosophical Society)
Warpspire on URL Design…
A URL is an agreement
A URL is an agreement to serve something from a predictable location for as long as possible. Once your first visitor hits a URL you’ve implicitly entered into an agreement that if they bookmark the page or hit refresh, they’ll see the same thing.
Added four new philosophy ejournals to JURN:—
Dialegesthai : revista telematica di filosofia (Has some English articles)
Removed African Philosophy – now requires a subscription.
Added Internet Encyclopedia of Philosophy, The (IEP)
I’m pleased to say that I’ve found a robust way to auto-check if Google is still “seeing” content at the article-level URLs indexed by JURN. It’s a software based solution, and is basically ‘dark side’ SEO software that I’ve turned to the good side. It auto-prepends the site: modifier to each of the URLs contained in the JURN index, and then checks if those URLs are actually indexed by Google. It then logs any wholly un-indexed URLs. It just chugs away in the background and is very slow — so as not to trigger flood-control blocking measures. But it’s certainly better than doing the checking by hand.
If you have such a list you want to check, it’s probably best to remove or cut back any URLs containing multiple wildcards such as /*/*/. Google has also been known to choke on URLs containing question-marks (it can see them as evidence of someone trying a scripting exploit on Google), although I don’t see this happening during the checking. But if you’re doing the checking in blocks of 200, it’s not difficult to correct those sort of URLs first.
Spamming Google Scholar. Very possible, or so it seems…
“…we conducted several tests on Google Scholar. The results show that academic search engine spam is indeed – and with little effort – possible: We increased rankings of academic articles on Google Scholar by manipulating their citation counts; Google Scholar indexed invisible text we added to some articles, making papers appear for keyword searches the articles were not relevant for; Google Scholar indexed some nonsensical articles we randomly created with the paper generator SciGen; and Google Scholar linked to manipulated versions of research papers that contained a Viagra advertisement.”
Beel, J. (2010)
Academic Search Engine Spam and Google Scholar’s Resilience Against it.
Journal of Electronic Publishing 13 (3), December 2010.
Bing now supports the site: search modifier. Example usage:
A new Google search modifier… AROUND.
apples AROUND(3) pears
…gives results that contain the word “apples” within three words of “pears”.
[ Hat-tip: Researchbuzz ]
Removed The Other Journal at Mars Hill Graduate School from JURN. Shows as infected with an online pharmacy bot in Google, and then on visiting and dropping the NoScript block the site attempts to download an infection onto a visitors’ computer…
Removed the Athens Arts Review. Now squatted by a pill-pushing online pharmacy site.
Added to the JURN site-index today:—
Chronicles of Oklahoma (Oklahoma history – full-text 1923 – 1962, thereafter TOCs only)
Trabalhos de Arqueologia (Portuguese, some articles in English – e.g.: “Portuguese-derived ship design methods in southern India?”)
Societas Magica Newsletter (Scholarly study of historical magic, with academic contributors and substantial articles)
I seem to have missed out on mentioning a couple of recent articles on the state of ejournals in China:
1. An article from Nature, on China’s severe problems with academic journals…
“in a Correspondence to Nature last week, Yuehong Zhang of the Journal of Zhejiang University–Science reported that a staggering 31% of the papers submitted to that campus journal contained plagiarized material (Nature 467, 153; 2010).”
2. And a long article in the New York Times…
“The Lancet, the British medical journal, warned that faked or plagiarized research posed a threat to President Hu Jintao’s vow to make China a “research superpower” by 2020.”
“a recent government study in which a third of the 6,000 scientists at six of the nation’s top institutions admitted they had engaged in plagiarism or the outright fabrication of research data.”
As far as I know, no mainland Chinese journals are accessible via JURN, since the Chinese state requires them all to be kept on a central server in page-scanned image form only (i.e.: no Googleable text).
Three new titles added to the JURN index today:—
Interpretation : a journal of bible and theology (book reviews are free)
Postgraduate Journal of Aesthetics (2004-2009. Seems to have a three-issue rolling subscription wall).
Free sample chapters from the books of publisher Boydell and Brewer (mostly TOC and introduction)
Google has implemented a new filter that allows the filtering of search results by ‘reading level’. It’s accessed via the Advanced Search page, thus…
In a search for the term “reading level”, with the Reading Level set to Advanced, I still had a basic About.com page in the first page of results, as well as this blatant SEO spam page as result No.8.
A search for ‘tolkien + symbols’ showed better results, with a solid and useful first two pages of results. Although not that much different from the standard search, except that using Advanced Reading Level blocked a result from the scumbag SEO spam domain directhit.com on the second page of plain results.
Added two ejournals today:—
Ars Technica has a long review of the Mendeley academic research management tool.