• Directory
  • FAQ: about JURN
  • Group tests
  • Guide to academic search
  • JURN’s donationware
  • openEco: nature titles indexed

News from JURN

~ search tool for open access content

News from JURN

Category Archives: My general observations

JURN fixed and repaired

27 Sunday Sep 2020

Posted by David Haden in Ecology additions, My general observations, New titles added to JURN

≈ 1 Comment

JURN is now as up-to-date as it can be, ready for the “back to university” crowd. I’ve completed a link-check of the full URL base, checking for continued presence of an indexed URL path in Google Search. The full path is checked, not just the top domain (e.g. foobar.foo/foo-foo/journal_of_foo/articles/ and not just foobar.foo). This checking process has been slow, taking about 18 months, on and off.

Of course, a few URLs may still have newly broken in the meanwhile. But the core URL base is kept fresh by a regular check of the key home-page URLs, as organised and listed at the JURN Directory of arts & humanities journals (English-language journals only). This Directory was link-checked and updated in mid September 2020. Also recently link-checked, back in July 2020, was JURN’s openEco Directory of over 800 journal titles variously related to the study of wildlife, ecology etc. Please update any local copies you may be keeping.

It’s not Funai

10 Thursday Sep 2020

Posted by David Haden in My general observations

≈ Leave a comment

It appears that someone is using Nigeria’s Federal University Research Archive (FURA) to run one of those “produce a poisoned spam-PDF for every book-title in the world” robo-scams, and is using the repository server to spam these online with a high search ranking. See the results via a Google Search for…

site:dspace.funai.edu.ng horror

For now I’m keeping dspace.funai.edu.ng in my GRAFT (‘search across all the world’s academic repositories’), with the suggestion that GRAFT users simply add -dspace.funai.edu.ng to the end of their search to clear this spam. Hopefully the problem will be sorted out, and the server cleaned up, relatively soon. The server is not in the JURN index.

Duck flies

29 Saturday Aug 2020

Posted by David Haden in My general observations

≈ Leave a comment

This is interesting. For once, DuckDuckGo’s “last week” search is now better than Google Search’s “last week” search. I have the same URL blocklist running on both, so there’s no difference there. But having run a regular search on Google, I found myself going through the exact same with the Duck and saying… “why did Google Search not get that, and that, and that…” And most of that was currently-dated blog or magazine posts, so there was no sense of items being dredged from the past and presented as if new. Of course, it could be a simple mis-match in terms of timeliness, and I just caught Google in a late-August “techies have gone to the beach” slacker mode. But I suspect not.

The borrowed page

06 Saturday Jun 2020

Posted by David Haden in My general observations

≈ Leave a comment

I see that the Archive.org now has a ‘no-Borrow preview page’ on search-landing pages for its ‘Borrow’ books. They’ve also announced that their extended “Borrow” feature is to come to an end, as was always to be the case after the emergency period had passed.

One wonders if this new ‘page preview’, similar in nature to Google Books, is about bringing Google in on the new lawsuit from publishers? ‘If this goes down, so does Google Books?’ Just my guess.

On auto-downloading open access books

21 Tuesday Apr 2020

Posted by David Haden in How to improve academic search, My general observations, Spotted in the news

≈ Leave a comment

Martin Paul Eve has a new post on Zotero and auto-downloading open access books…

all I really wanted was to be able to embed an ISBN and a citation_pdf_url and have Zotero do the lookup and save the file. However, out of the box there is no easy way to do this.

His test book is quite interesting, his own new Close Reading with Computers: Textual Scholarship, Computational Formalism, and David Mitchell’s Cloud Atlas (April 2020), which applies textual computing to the science-fiction-philosophy novel Cloud Atlas.

I don’t know about or use the current version of Zotero, so I’m unsure what advantages it confers. I assume Eve intended to find a way to automatically harvest all CC-SA books in PDF, and build a local collection for automated analysis.

But I see his book is already on the OA book aggregator catalogue OAPEN. Theoretically then, since OAPEN is comprehensive and timely, one could have a harvester look at all the pages hanging off library.oapen.org/handle/ and save out only those pages with the required permissive CC “Rights” label on them. These pages each have a uniform PDF link URL in their HTML, in the form of library.oapen.org/bitstream/ and these could be easily extracted to a list. One would end up with a set of PDF links for a linkbot, ready to download to a local folder for computational analysis. I presume that’s what Eve intended to have Zotero do.

One would need to reference the OAPEN record page first, in the way I’ve suggested, since the PDF itself can have different or non-uniform or contradictory licence information. For instance in its interior Eve’s book is labelled as both “©” … “No part of this book may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying and recording, or in any information storage or retrieval system without the prior written permission of Stanford University Press.” and also “Creative Commons Attribution-ShareAlike 4.0”.

How many items on OAPEN have a creativecommons.org/licenses/by-sa/ “Rights” label at present, as Martin’s book does? A Google site: search suggests around 650 titles. Half an hour of my filtering the OAPEN CSV suggests it’s actually just over 3,000 under some form of permissive CC that permits commercial use. That’s still a manageable harvest at present. But as the supply of OA books and monographs grows rapidly, the likely result of various OA mandates in the near-future, it might be a useful time-saver for text-miners and digital humanists if OAPEN were to maintain a single torrent of all the PDFs. Inside which a half dozen folders would neatly organise the books by CC licence type. Such a one-click solution might save a lot of faffing around with digging into and filtering their XML and CSV feeds, wrangling with harvester scripts and timeouts, or trying to wrestle with third-party services such as Zotero. A torrent could also save OAPEN’s bandwidth.

Dead Carrot

24 Sunday Nov 2019

Posted by David Haden in My general observations, Spotted in the news

≈ Leave a comment

The Carrot2 search-engine appears to be kaput. It’s been down for days now, unable to show results. Multiple different browers, on different systems, even different VPNs, give no results. A query can be input, but… no results. Even a TOR browser can’t get it to respond. Such a pity, it was so promising.

The worthless Peter Lowe ad-blocking list

05 Saturday Oct 2019

Posted by David Haden in My general observations, Ooops!

≈ 1 Comment

The Peter Lowe ad-blocking list is obviously now worthless, due to its over-reach and scattergun blocking of all sorts of legitimate things. Back in June I found it blocking Harvard. I’ve since found all sorts of similar blocks on things that should not be blocked. I’ve unsubscribed my browser’s ad-blocker addon from the Peter Lowe list, and I suggest that you also consider doing so.

Carrot2 search – a new script for multi-column results

04 Friday Oct 2019

Posted by David Haden in JURN tips and tricks, My general observations, Spotted in the news

≈ 1 Comment

The Carrot2 search-engine has changed both URL and layout. It was at search.carrot2.org/stable/search and it’s now just at the search.carrot2.org URL. I guess the public-facing search may have come out of beta? This is what their new layout looks like…

Not good, on a widescreen desktop monitor.

Which means I’ve made another custom CSS for it. This slices the Carrot into a multi-column layout suited to a widescreen 1920px monitor. It works in the Stylus browser addon, and you need to tell the script to target the search.carrot2.org site

Yum, crunchy Carrot! All the z-depths are set up nicely, so you can still click on/in the filters and search box.

I like to read the URLs in search-results and so I’ve turned them dark green and wrapped them to make this possible. The results look good with URLs that line-wrap by up to three lines. But it’s unavoidable that some very long URLs will wrap over four lines, and will thus spill over the element below.

In most cases there should be no need for any scrolling.

It only works in Day mode, with the Carrot’s new toggle-able Night and Day mode…

If you want a night mode, you’ll have to manually change the colour chips to a charcoal black on…

body, body.light {

and

div.ResultList

The other drawback is that whereas the old multi-column fix showed about 25-30 results, now we’re down to about 16. If you want it up to 20+ add the following code…


.ResultClusters {
display: none;
}

There’s also the lack of a “more…” button, but this usefully forces the user to use the Carrot’s innovative faceting systems over in the left-hand pane.


To install my fix, simply go to Carrot2, then left-click on your icon for the Stylus browser addon and click “Write style for…”.

Then paste in this…


/* ==== CARROT2 - Multi-Columns v.02 Oct 2019 ==== */
/* run this Stylus script on search.carrot2.org */
div.document div.url {
overflow: hidden;
color: #3a7730;
font-size: 110%;
}
body, body.light {
background-color: #ece5db;
}
.ResultList > div > a > span.url {
color: #3f7126;
font-size: 80%;
font-weight: bold;
white-space: pre; /* CSS 2.0 */
white-space: pre-wrap; /* CSS 2.1 */
white-space: pre-line; /* CSS 3.0 */
white-space: -pre-wrap; /* Opera 4-6 */
white-space: -o-pre-wrap; /* Opera 7 */
white-space: -moz-pre-wrap; /* Mozilla */
white-space: -hp-pre-wrap; /* HP Printers */
word-wrap: break-word; /* IE 5+ */
}
a {
font-size: 100%;
}
.SearchForm {
z-index:20;
}
div.ResultList {
z-index:4;
position:absolute;
background-color: #ece5db;
padding-top: 180px;
top:0;
bottom:0;
right:0;
column-count: 5;
width: 70%
}
.ResultList > div > a >div {
font-size: 80%;
}
div.sources {
padding-top: 25px;
}
div.ClusterList {
background-color: #ece5db;
padding-top: 10px;
column-count: 2;
}
div.clusters-tabs {
width: 40%
}
div.clusters {
z-index:4;
width: 50%
}

Tested in Opera, which is a browser that runs on Chrome. It will probably work with other CSS style injectors.

Be warned that Carrot2 will perma-block an entire IP address (in the case of BT in the UK, or VPNs, that can mean hundreds of thousands of users) if it detects “excessive traffic”.

WordPress to ebook – without plugins

16 Monday Sep 2019

Posted by David Haden in My general observations

≈ Leave a comment

Needed: an easy three-click “WordPress to ebook” service that works on any free WordPress.com blog account. Such a thing doesn’t currently seem to exist. I can only find ebook plugins for install on third-party hosted installs of WordPress, or which need WordPress.com’s £240 per year Business account upgrade (which allows use of plugins).

Zinepal used to be a service that would do it from an RSS feed, but has been withdrawn. The similar ebookGlue is dead. I would have thought this would be an in-demand service for the millions of free WordPress blogs. But no, it seems not. Ideally such a service would allow you to mark only your wanted posts by previewing a snippet as well as the title, then let you re-shuffle them into themed chapters and sections, before outputting to an editable ebook-friendly format. I’m assuming a non-fiction blog here, with perhaps 3,000 posts.

What it doesn’t need to be is something just pushes the whole blog into an ePub and calls it done.

Update: Solved. WordPress2Doc – a free ebook converter for free WordPress.com blogs.

Button-holed

28 Tuesday May 2019

Posted by David Haden in My general observations

≈ Leave a comment

The Google CSE team is obviously updating with some tweaked code, on the front-end of the Custom Search Engines (CSEs).

As a result, JURN’s custom “click to search button” has become a small generic-looking one done in .SVG code. This is smaller and tighter than I’d like, and has replaced JURN’s nice custom .JPG button.

This “click to search” button still works fine, so it’s only an aesthetic change…

I’ll leave it a few days before jiggling the CSS to try to get the custom button back, just in case Google decides to revert the changes.

← Older posts
Subscribe: RSS News Feed.
I'm on Patreon!

JURN:

  • JURN : as a browser addon
  • JURN : directory of ejournals
  • JURN : main search-engine
  • JURN : openEco directory
  • JURN : repository search

Related sites:

  • 4 Humanities
  • Accuracy in Academia
  • Addsearch (CSE alternative)
  • Alliance Defending Freedom
  • ALPSP
  • alt.academy
  • AMIR
  • Anterotesis
  • Arcadia project
  • Archive Team
  • Arnetminer
  • Art Historicum (German)
  • AWOL
  • Beall's List (updated at 2018)
  • Beall’s List (old)
  • Beyond Search
  • Bibliographic wilderness
  • Booktwo
  • Campus Reform
  • Charleston Advisor
  • Coalition for Networked Information
  • Common Crawl
  • Communia (public domain watchdog)
  • Copy, Shake, and Paste
  • Cost of Knowledge
  • Council of Editors of Learned Journals
  • Current Cites
  • Dan Cohen
  • Digital Campus
  • Digital Koans
  • Digital Shift
  • Dissernet (Russian anti-plagiarism)
  • DOAB
  • doai.io (alt. DOI resolver)
  • DOAJ
  • Don't Block TOR
  • eFoundations
  • EIFL
  • Electronic Frontier Foundation
  • ELO
  • Embargo Watch
  • ePublishing Trust for Development
  • Facebook: Arab Open Access
  • Facebook: Italian Open Access
  • Facebook: Open Access India
  • Film Studies for Free
  • FIQ
  • FIRE
  • Flaky Academic Conferences
  • Found History
  • Foundation for Individual Rights in Education
  • Free Speech Tech Alliance
  • Free Speech Union (UK)
  • Go to Hellman
  • Google Algorithm
  • Google Scholar Digest
  • Harvard OA Tag Team (discoverability feed)
  • Heterodox Academy
  • Hybrid Publishing
  • Iconclass
  • IFLA Serials blog
  • ImpactStory
  • infoDocket
  • InTech Blog
  • Intl. Journal of the Commons
  • Jinfo (formerly Free Pint)
  • Keepers' Registry
  • Kindle blog
  • L'edition Electronique (French)
  • La Criee : periodiques (French)
  • Leader Statement Database on Free Speech
  • LibX Fulltext Linker
  • Mark Ware
  • Monographer
  • National Association of Scholars
  • National Coalition of Independent Scholars
  • Neil Beagrie
  • OA Lookup : Fees
  • OA Lookup : Policies
  • OA Working Group
  • OASPA
  • Online Searcher
  • Open Academic Society
  • Open Access Bibliography
  • Open Access News (archive)
  • Open Access Newsletter
  • Open Access Week
  • Open and Shut?
  • Open Electronic Publishing
  • Open Folklore
  • Open Humanities list
  • Open Knowledge Maps
  • Open Library of Humanities
  • Periodiques en ligne (French)
  • Peter Murray Rust
  • PKP / OJS
  • Professor Watchlist
  • Project Gutenberg
  • Publishers for Development
  • Publishing Archaeology
  • Quackwatch
  • RBA Blog
  • Reclaim the Net
  • Research Information
  • Research Remix
  • Retraction Watch
  • Right to Research
  • River Valley TV
  • ROARS (Italian)
  • Scholarly Communication News
  • Scholarly Electronic Publishing
  • Scholarship Matters
  • Search Engine News
  • Searchblox
  • Searcher
  • Semantico
  • Serials Cataloger
  • Serials Review
  • Society of Young Publishers
  • SPARC
  • Speech First
  • TaxoDiary (taxonomies news)
  • Taxpayer Access
  • The Keepers
  • The Scholarly Kitchen
  • Thoughts from Carl Grant
  • Unlocking Research
  • Web Scale Discovery
  • Web Scale Discovery mag
  • Zotero blog

Some of the libraries linking to JURN

  • AIATSIS
  • Boston College Libraries
  • Brooklyn Public Library, NY
  • Duke University
  • Florida State Universities
  • Gulf University, Kuwait
  • Kobe University, Japan
  • Northwestern University
  • Rhode Island College
  • San Jose State University
  • UConn Stamford
  • Université Paris
  • University College Cork
  • University of Brittany
  • University of California
  • University of Cambridge (Casimir Lewy Library)
  • University of Cambridge (main)
  • University of Canberra
  • University of Georgia
  • University of Lille
  • University of Rennes
  • University of Tanzania
  • University of Toronto
  • URFIST de Bretagne
  • Washington University
  • West Virginia University
  • York University

Spare BitCoins? Please send donations to JURN via: 17e2KGuyzjzEEE7BsoYTwMo3MtUod6DrjP

Archives

  • January 2021
  • December 2020
  • November 2020
  • October 2020
  • September 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • January 2020
  • December 2019
  • November 2019
  • October 2019
  • September 2019
  • August 2019
  • July 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • August 2018
  • July 2018
  • June 2018
  • May 2018
  • April 2018
  • March 2018
  • February 2018
  • January 2018
  • December 2017
  • November 2017
  • October 2017
  • September 2017
  • August 2017
  • July 2017
  • June 2017
  • May 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • December 2016
  • November 2016
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • March 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015
  • October 2015
  • September 2015
  • August 2015
  • July 2015
  • June 2015
  • May 2015
  • April 2015
  • March 2015
  • February 2015
  • January 2015
  • December 2014
  • November 2014
  • October 2014
  • September 2014
  • August 2014
  • July 2014
  • June 2014
  • May 2014
  • April 2014
  • March 2014
  • February 2014
  • January 2014
  • December 2013
  • November 2013
  • October 2013
  • September 2013
  • August 2013
  • July 2013
  • June 2013
  • May 2013
  • April 2013
  • March 2013
  • February 2013
  • January 2013
  • December 2012
  • November 2012
  • October 2012
  • September 2012
  • August 2012
  • June 2012
  • May 2012
  • April 2012
  • March 2012
  • February 2012
  • January 2012
  • December 2011
  • November 2011
  • October 2011
  • September 2011
  • August 2011
  • July 2011
  • June 2011
  • May 2011
  • April 2011
  • March 2011
  • February 2011
  • January 2011
  • December 2010
  • November 2010
  • October 2010
  • September 2010
  • August 2010
  • July 2010
  • June 2010
  • May 2010
  • April 2010
  • March 2010
  • February 2010
  • January 2010
  • December 2009
  • November 2009
  • October 2009
  • September 2009
  • August 2009
  • July 2009
  • June 2009
  • May 2009
  • April 2009
  • March 2009
  • February 2009

Blog at WordPress.com.

Cancel
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy