A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://phabricator.wikimedia.org/T301291 below:

⚓ T301291 PDF and Djvu files on Commons failed to be processed (no thumbnails, zero pages) but otherwise valid

Event Timeline Comment Actions

What is this wikimirror.org? Why change links to that?

So this list is exhaustive. I went through all PDFs and Djvu files on Wikimedia Commons as of previous week. Not just a random example. if we fix these, then all of them will be fixed. :-)

Comment Actions

No, this one seems just a slightly broken PDF. I just fixed it.

Comment Actions

that's odd, I saved the pdf file starting from a Word document. (Ok, at a second thought that's not odd at all :-) ) Thanks!

Comment Actions

So I fixed it using mutool clean. But the ones I listed above cannot be fixed this way. And this is what I am reporting. So mutool clean does not fix it, looking at MediaBox values show reasonable page sizes (including the first page), and even metadata (example for the first file above shows page size available:

{
    "name": "pdf-PageSize",
    "value": [
        {
            "name": 0,
            "value": "612 x 792 pts (letter)"
        },
        {
            "name": 1,
            "value": "697 x 855 pts"
        }
    ]
}

But Mediawiki does not show width and height. So something is wrong.

Comment Actions

@mau If you made this PDF yourself, could I recommend removing the first blank page? Because otherwise the first thumbnail does not show anything.

Comment Actions

@Mitar probably it's even better to substitute the first page with the actual cover for the book, indeed. I proceed :-)

Comment Actions

I ran into the same problem. I don't know if this can be considered a solution, because these steps have to be done on the server side, but I solved my problem:

  1. step – repair thumbnails for files of the core MediaWiki
php maintenance/refreshImageMetadata.php --verbose --mime image/vnd.djvu --force
  1. step – do null edit of the index pages by Extension:Proofread_Page (need for actualization info about the pages count for special page)
php maintenance/refreshLinks.php --namespace 252

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4