• Directory
  • FAQ: about JURN
  • Group tests
  • Guide to academic search
  • JURN’s donationware
  • openEco: nature titles indexed

News from JURN

~ search tool for open access content

News from JURN

Monthly Archives: May 2020

How to print a list of what’s in a Windows folder

30 Saturday May 2020

Posted by David Haden in JURN tips and tricks

≈ Leave a comment

How to print a list of what’s in a Windows folder. No need to wrestle with the Command prompt or Outlook. It can be done with most Web browsers…

1. Open a Web browser (not Internet Explorer). Highlight your folder, and drag it to the browser’s Address Bar (not onto the page itself).

2. The folder structure will then display as a Web page. Copy and paste to the clipboard.

3. If, for some reason, you can copy everything except the actual file-names, then simply print the page to a PDF. Copy-paste them from the PDF.

There is also a Windows freeware option called DirPrintOK. This looks fiendishly complex when you open it up. But it’s actually just working much like any normal Windows Explorer replacement, except it has a handy export option up in its top left corner…

This can save the list of a folder/directory’s files out to a .CSV file, as well as to plain text. Thus, this is the option to choose when you have a huge folder, and you want to divide the file-names from the datestamps and file-size information.

I’m worried about Google Search…

28 Thursday May 2020

Posted by David Haden in JURN's Google watch, Spotted in the news

≈ Leave a comment

Google has announced a new anxiety stress-test. No, it’s not one of their infernal ‘captchas’ on Google Search, guaranteed to send stress levels through the roof. This is a post-lockdown “seven-question survey” about one’s personal anxiety, developed with the National Alliance on Mental Illness. The Googlebots will keep the results private and locked down forever, or so they promise. But I’d be a little worried about that.

Verizon OneSearch

27 Wednesday May 2020

Posted by David Haden in Spotted in the news

≈ Leave a comment

Verizon OneSearch, a new Google competitor which appears to have been launched in February. Initially reported as being Bing-powered.

I had strange results. When first used it, it showed stuff I know Bing doesn’t index. I know what Bing (and by extension, DuckDuckGo) looks like for certain searches, and the OneSearch results were nothing like what I saw. I saw the exact results I’d expect from Google Search.

But on a second try with the same search 30 minutes later, the same search results reverted to looking exactly like the mediocre Bing / DuckDuckGo. Very curious.

Anyway, also of note is its Image search, with Google Images-like filters for size and CC licence…

The big drawback to everyday use is its curious 30 seconds of complete unresponsiveness, which seems to kick in every few minutes. When it’s actually responding, it’s fast.

The Brazilian SciELO is updating its criteria

13 Wednesday May 2020

Posted by David Haden in Spotted in the news

≈ Leave a comment

The Brazilian SciELO is updating its inclusion criteria. There are of course half a dozen other SciELO aggregators around South and Central America, but Brazil’s is the biggest.

According to the English-language summary, to stay in after 2020 the indexed journal must…

* accept for consideration articles “that are posted in a preprint server”

* be “citing and referencing all data, software codes and other content underlying the article’s texts”

* have in place “options for opening peer review”

No. 1 sounds good, and might be usefully extended to blog posts that included part of what later became an article.

No. 2 may be a bit problematic for those who rely on big closed computer-models, but I guess that simply “citing” that the model (presumably) exists may be deemed good enough. But it would be nice if SciELO required that the link should always lead to the full public data or model.

No. 3 appears to leave “opening” curiously unspecified. What options are acceptable, and by what criteria will a journal be judged to have engaged in “opening” its peer review? And how will this impact perfectly valid small single-editor journals in the arts and humanities? In which, for instance, the editor is the world-expert on the niche topic and single-handedly does a ‘light-touch’ peer-review on the year’s articles? Will they be forced to take on a new Peer Review Board, and then run and chase it, or else leave the Brazilian SciELO?

Curatives

13 Wednesday May 2020

Posted by David Haden in Spotted in the news

≈ Leave a comment

A mega-list of all known BBC ‘John Peel Sessions’ online, at May 2020. As you might expect, it’s nearly all at YouTube.

 
In a similar folk-curation (done by ‘curatives’?) vein, Weird Tales: 1923-1954 complete in a handy linked spreadsheet, taking you through to Archive.org.

WordPress2Doc – a free ebook converter for free WordPress.com blogs

13 Wednesday May 2020

Posted by David Haden in JURN tips and tricks

≈ 1 Comment

Update, March 2022: there’s now a better freeware solution.

Hurrah, there is freeware to get a free WordPress site to Word as a .DOCX file, and thence to an ebook. Only the one, and hardly known to search-engines, but it’s for Windows and it works. WordPress2Doc comes from Germany and was last updated by the author, Raffael Herrmann, in December 2017. Downloading is tricky. You need to right-click on the bit.ly download link and “saved linked content”, otherwise nothing happens.

The source code for WordPress2Doc is on Github.

It has some nice features…

* Select only certain posts from your exported .XML feed.

* There’s a post preview, over in a side-panel.

* Images are called and embedded (so long as the source WordPress blog is public, presumably).

* Images are correctly sized for the page (if you just copy-paste straight into Word from your blog post, they’re not).

* The blog’s pages are also exported and can be selected, as well as the posts.

* You can also save straight to a PDF, as well as Word.

* Can save each post as named Word .DOCX file.

There’s a PDF Help guide in the download, but there’s also a good Help video at YouTube. But it’s very simple and easy to use. Once you have your Word file, it’s then relatively easy to save to an ebook.

If you’re still using the old WordPress UI, then you may want my guide to how to export your site as an .XML export. Because using the old interface won’t get you a viable export. Incidentally, it seems WordPress are now chunking the .XML export of large blogs. I had two .XMLs in the backup .ZIP, for my medium-sized test blog.

The only bother for an ebook maker will be re-ordering into themed sections or chapters without a whole lot of tedious select-copy-paste, and WordPress2Doc can’t help even partly with that as it lacks tag-support and list re-ordering before saving the .DOCX file. The best way to tackle the problem is probably to export in themed batches, one per intended chapter or section, though that would entail making ten passes of what might be 4,000 posts. Which is not ideal. But it looks to me like WordPress2Doc works best with smaller blogs anyway.

But note that WordPress can export posts to .XML by category, at least. Also, WordPress2Doc can save multiple selected posts as ‘one file per-post’. You could then sort into chapter folders by dragging and dropping, and then use a Word joiner to join them together.


Update: I don’t like the over-compression of images with this method, and some of the text formatting is also lost. There are further problems, in that the software can’t seem to select all posts and stalls soon after selecting 500 of them…

I’m now coming to the conclusion that one would ideally tag posts with the category “ebook-ch1”, “ebook-ch2” etc in the blog itself, then view that category, then copy-paste the page(s) into (Libre Office) LibreOffice Writer. Unlike Word, this retains formatting, including italics and indented quotes, and also retains picture quality. In almost all cases the pictures are scaled correctly to the page width, which they’re not when using Word to do the same thing. Small pictures such as icons are not scaled. Word cannot do this, even with a macro.

This all assumes, however, that you still have a live blog with its CSS, and not just the .XML export files.

CSS and desist

11 Monday May 2020

Posted by David Haden in JURN tips and tricks

≈ 1 Comment

Ever wonder what a Web page would look like if just the plain HTML were shown, as if it were 1996 again? Ah, but which HTML? There are actually two forms of HTML that reach your browser, and there can be quite a disparity between these two types.

The first is what you see after the raw HTML has been pummelled about by CSS and javascript and the browser’s interpretation. This is often referred to as the ‘DOM’ HTML. This code is what you see and navigate through if you ‘Inspect element’ in your browser, or if you block an element with uBlock’s Element Picker tool.

The second type is the HTML code that gets sent to the browser in the first place, and that original is kept pristine and effectively ‘under’ the Web page. It can be seen via: right-click on page / ‘View source’. This source can then be selected and copied with a Ctrl + A / Ctrl + C keyboard command. Or it can be saved out when you ‘Save page as…’ / ‘Save as HTML only’, and from there you can re-open the saved page in the browser. Some remote CSS, javascript and images may still be called, even then.

A quicker way to ‘see’ this original without its CSS and other ‘remote-code’ flibbertigibbets is to install the add-on disable-HTML in your browser…

The addon is quite simple to use, and though old still works fine. It was somewhat mis-named, as it can robustly block everything except the HTML of the actual ‘page source’. With CSS and javascript blocked, it appears to be blocking the DOM version of the HTML from emerging from the page source. So what you see displayed, on page re-load, is effectively the page source. As such it can be quite handy for the removal of some types of especially tough and obstreperous CSS-and-javascript -driven overlays, in a situation where you don’t much care about the fancy wrapping and just want the words in a readable and/or copy-able form. Such as on the vile overlays of the unherd.com site.

Regex 2020

11 Monday May 2020

Posted by David Haden in JURN tips and tricks, Regex

≈ Leave a comment

I’ve expanded my free PDF file, “Some useful regex commands for Notepad++”, which had been released here in May 2019. Here’s the new one…


Update: now updated again My Little Regex Cookbook, for Notepad++ (September 2020).

AnswerThePublic.com

06 Wednesday May 2020

Posted by David Haden in Spotted in the news

≈ Leave a comment

AnswerThePublic.com is an interesting new search tool. Instead of searching for answers, it tries to pick up the questions being asked about products. It promises “a direct line to your customers’ thoughts”, or at least those customer-users who are savvy enough and non-expert enough to pose a well-formed question in the right place.

The searcher seems to be limited to three searches per day, after which your time is up and you’re shown this slice of cheese…

The results format is quite elegantly graphical and useful, though I can’t screenshot and discuss these here because… I’ve had my three searches. There was some kind of linkback to Google Search on some of the results tabs, which seemed to make it even more useful.

New from Google Research

05 Tuesday May 2020

Posted by David Haden in Academic search, Spotted in the news

≈ Leave a comment

Google Research has launched COVID-19 Research Explorer. This has “a semantic search interface” that enables better search and discovery across “more than 50,000 journal articles and preprints”.

← Older posts
Subscribe: RSS News Feed.
I'm on Patreon!

JURN:

  • JURN : directory of ejournals
  • JURN : main search-engine
  • JURN : openEco directory
  • JURN : repository search

Related sites:

  • 4 Humanities
  • Academic Freedom Alliance
  • Accuracy in Academia
  • Alliance Defending Freedom
  • ALPSP
  • alt.academy
  • AMIR
  • Anterotesis
  • Arcadia project
  • Art Historicum (German)
  • AWOL
  • Beall's List (updated at 2018)
  • Beall’s List (old)
  • Beyond Search
  • Bibliographic wilderness
  • Booktwo
  • Campus Reform
  • Charleston Advisor
  • Coalition for Networked Information
  • Communia (public domain watchdog)
  • Cost of Knowledge
  • Council of Editors of Learned Journals
  • Dan Cohen
  • Digital Koans
  • Digital Shift
  • Dissernet (Russian anti-plagiarism)
  • DOAJ
  • Don't Block TOR
  • eFoundations
  • EIFL
  • Electronic Frontier Foundation
  • ELO
  • Embargo Watch
  • ePublishing Trust for Development
  • Facebook: Arab Open Access
  • Facebook: Italian Open Access
  • Facebook: Open Access India
  • Film Studies for Free
  • FIRE
  • Flaky Academic Conferences
  • Found History
  • Foundation for Individual Rights in Education
  • Free Speech Union (UK)
  • Google Algorithm
  • Heterodox Academy
  • Iconclass
  • IFLA Serials blog
  • ImpactStory
  • infoDocket
  • InTech Blog
  • Jinfo (formerly Free Pint)
  • Kindle blog
  • L'edition Electronique (French)
  • La Criee : periodiques (French)
  • Leader Statement Database on Free Speech
  • National Association of Scholars
  • National Coalition of Independent Scholars
  • Neil Beagrie
  • OA Lookup : Policies
  • OA Working Group
  • OASPA
  • Online Searcher
  • Open Access Bibliography
  • Open Access Week
  • Open and Shut?
  • Open Electronic Publishing
  • Open Folklore
  • Open Knowledge Maps
  • Open Library of Humanities
  • Periodiques en ligne (French)
  • Peter Murray Rust
  • PKP / OJS
  • Project Gutenberg
  • Publishing Archaeology
  • RBA Blog
  • Reclaim the Net
  • Research Information
  • Research Remix
  • Right to Research
  • River Valley TV
  • ROARS (Italian)
  • Scholarly Electronic Publishing
  • Scholarship Matters
  • Searchblox
  • Searcher
  • Serials Cataloger
  • Serials Review
  • Society of Young Publishers
  • Speech First
  • TaxoDiary (taxonomies news)
  • Taxpayer Access
  • Tentaclii
  • The Scholarly Kitchen
  • Thoughts from Carl Grant
  • Web Scale Discovery
  • Zotero blog

Some of the libraries linking to JURN

  • Boston College Libraries
  • Brooklyn Public Library, NY
  • Duke University
  • Kobe University, Japan
  • Rhode Island College
  • San Jose State University
  • UConn Stamford
  • University of California
  • University of Cambridge (Casimir Lewy Library)
  • University of Cambridge (main)
  • University of Canberra
  • University of Toronto
  • Washington University
  • West Virginia University

Spare BitCoins? Please send donations to JURN via: 17e2KGuyzjzEEE7BsoYTwMo3MtUod6DrjP

Archives

  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • July 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021
  • August 2021
  • July 2021
  • June 2021
  • May 2021
  • April 2021
  • March 2021
  • February 2021
  • January 2021
  • December 2020
  • November 2020
  • October 2020
  • September 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • January 2020
  • December 2019
  • November 2019
  • October 2019
  • September 2019
  • August 2019
  • July 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • August 2018
  • July 2018
  • June 2018
  • May 2018
  • April 2018
  • March 2018
  • February 2018
  • January 2018
  • December 2017
  • November 2017
  • October 2017
  • September 2017
  • August 2017
  • July 2017
  • June 2017
  • May 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • December 2016
  • November 2016
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • March 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015
  • October 2015
  • September 2015
  • August 2015
  • July 2015
  • June 2015
  • May 2015
  • April 2015
  • March 2015
  • February 2015
  • January 2015
  • December 2014
  • November 2014
  • October 2014
  • September 2014
  • August 2014
  • July 2014
  • June 2014
  • May 2014
  • April 2014
  • March 2014
  • February 2014
  • January 2014
  • December 2013
  • November 2013
  • October 2013
  • September 2013
  • August 2013
  • July 2013
  • June 2013
  • May 2013
  • April 2013
  • March 2013
  • February 2013
  • January 2013
  • December 2012
  • November 2012
  • October 2012
  • September 2012
  • August 2012
  • June 2012
  • May 2012
  • April 2012
  • March 2012
  • February 2012
  • January 2012
  • December 2011
  • November 2011
  • October 2011
  • September 2011
  • August 2011
  • June 2011
  • April 2011
  • March 2011
  • February 2011
  • January 2011
  • December 2010
  • November 2010
  • October 2010
  • September 2010
  • August 2010
  • July 2010
  • June 2010
  • May 2010
  • April 2010
  • March 2010
  • February 2010
  • January 2010
  • December 2009
  • November 2009
  • October 2009
  • September 2009
  • August 2009
  • July 2009
  • June 2009
  • May 2009
  • April 2009
  • March 2009
  • February 2009

Create a free website or blog at WordPress.com.

  • Follow Following
    • News from JURN
    • Join 901 other followers
    • Already have a WordPress.com account? Log in now.
    • News from JURN
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...