[Bug 1937850] [NEW] the -L / --relative option breaks --accept-regex

Bill Yikes 1937850 at bugs.launchpad.net
Fri Jul 23 16:44:03 UTC 2021


Public bug reported:

This code should in principle (per the docs) fetch a few *.pdf files:

$ wget -r --level 1 --adjust-extension --relative --no-clobber --no-directories\
       --domains=ncua.gov --accept-regex 'administrative-orders/.*.pdf'\
       'https://www.ncua.gov/regulation-supervision/enforcement-actions/administrative-orders?page=22&sort=year&dir=desc&sq='

But it misses all *.pdf files.  When the --relative option is removed,
the PDF files are downloaded.  However, when you examine the tree-top
HTML file, the links pointing to PDF files actually are relative.

** Affects: wget (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to wget in Ubuntu.
https://bugs.launchpad.net/bugs/1937850

Title:
  the -L / --relative option breaks --accept-regex

Status in wget package in Ubuntu:
  New

Bug description:
  This code should in principle (per the docs) fetch a few *.pdf files:

  $ wget -r --level 1 --adjust-extension --relative --no-clobber --no-directories\
         --domains=ncua.gov --accept-regex 'administrative-orders/.*.pdf'\
         'https://www.ncua.gov/regulation-supervision/enforcement-actions/administrative-orders?page=22&sort=year&dir=desc&sq='

  But it misses all *.pdf files.  When the --relative option is removed,
  the PDF files are downloaded.  However, when you examine the tree-top
  HTML file, the links pointing to PDF files actually are relative.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/wget/+bug/1937850/+subscriptions




More information about the foundations-bugs mailing list