[Bug 580961] Re: unzip fails to deal correctly with filename encodings
Bug Watch Updater
580961 at bugs.launchpad.net
Sat Apr 16 05:00:36 UTC 2016
Launchpad has imported 36 comments from the remote bug at
https://bugzilla.novell.com/show_bug.cgi?id=540598.
If you reply to an imported comment from within Launchpad, your comment
will be sent to the remote bug automatically. Read more about
Launchpad's inter-bugtracker facilities at
https://help.launchpad.net/InterBugTracking.
------------------------------------------------------------------------
On 2009-09-21T05:38:29+00:00 5-pavel wrote:
Created attachment 319015
An archive file with cyrillic file names included
User-Agent: Mozilla/5.0 (X11; U; Linux i686; ru; rv:1.9.0.13)
Gecko/2009080200 SUSE/3.0.13-0.1.2 Firefox/3.0.13
There are several discussions about the problem concerning cyrillic
filenames in zip archives and unzip package. Unzip out-of-the-box
(compiled from sources) does not choose filenames encoding correctly.
Developers from Ark say me, that the error is completely from info-zip
project (https://bugs.kde.org/show_bug.cgi?id=204984).
There are sime patches to info-zip's unzip package, that makes unzip
extract filenames with correct encoding. But maintainers of info-zip
project rejected these patches (http://www.info-
zip.org/board/board.pl?m-1248086794).
It would be nice to include this package in main openSuSE distribution.
Reproducible: Always
Steps to Reproduce:
1. Create zip-archive, containing files with cyrillic names under Windows.
2. Try to open it with unzip under SuSE
Actual Results:
Filename encoding is incorrect. Example:
pavel at pavel:~/tmp> unzip ReportPacket_DBV90821CJ.zip
Archive: ReportPacket_DBV90821CJ.zip
inflating: ???????? ????? (????????).pdf
inflating: ???????? ????? (??????????).pdf
Expected Results:
Results, produced with natspec patch from sisyphus
pavel at rzn-sepak-bpa:~/backup> pavel at rzn-sepak-bpa:~/temp> unzip ReportPacket_DBV90821CJ.zip
Archive: ReportPacket_DBV90821CJ.zip
inflating: ????????? ????? (??????????).pdf
inflating: ????????? ????? (??????????).pdf
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/0
------------------------------------------------------------------------
On 2010-03-18T11:18:21+00:00 Dvaleev wrote:
We have found solution. But it requires additional libraries to convert
file names on the fly.
The library is librcc especially created for handling non utf encoded
file names.
How we can proceed then? RPM packages are built on OBS and tested.
Should we create submit request?
The librcc and patched unzip are here:
http://download.opensuse.org/repositories/home:/Lazy_Kent/openSUSE_11.2/
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/1
------------------------------------------------------------------------
On 2010-03-18T13:45:55+00:00 Pth-3 wrote:
*** Bug 575715 has been marked as a duplicate of this bug. ***
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/2
------------------------------------------------------------------------
On 2010-04-01T14:20:43+00:00 Anaumov wrote:
Submit request:
https://build.opensuse.org/stage/request/diff/34833
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/3
------------------------------------------------------------------------
On 2010-04-07T12:27:17+00:00 Stian Viskjer wrote:
This is also a problem with the letters 'æ ø å' used in some of the
Scandinavian alphabets.
It's also an issue for tar's created by 7zip on Windows.
Unzip 6.0 and the packages from home:/Lazy_Kent/openSUSE_11.2/ mentioned
in comment 1 doesn't change anything on my system. (11.2 x86_64)
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/4
------------------------------------------------------------------------
On 2010-05-03T17:28:57+00:00 Kyrill Detinov wrote:
I made a submit request to Factory:
https://build.opensuse.org/request/diff/39326
Confirmed, it works at least with Russian, Czech and Slovak.
http://lizards.opensuse.org/2010/04/07/call-for-testing-unzip-feature/
% LANG=cs_CZ.utf8 unzip -l test-cz.zip
Archive: test-cz.zip
Length Date Time Name
-------- ---- ---- ----
117 03-18-10 15:24 aábcčdďeéěfghchiíjklmnňoópqrřsštťuúůvwxyýzžAÁBCČDĎEÉĚFGHCHIÍJKLMNŇOÓPQRŘSŠTŤUÚŮVWXYÝZŽ.txt
-------- -------
117 1 file
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/5
------------------------------------------------------------------------
On 2010-05-04T17:25:46+00:00 Pth-3 wrote:
I won't accept the patch for openSUSE because upstream doesn't accept it
and openSUSE would have to maintain this patch indefinitely. If this or
a similiar patch gets accepted upstream I'll help in backporting it.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/6
------------------------------------------------------------------------
On 2010-05-04T17:38:04+00:00 Anixx wrote:
What about changing the file to Sisyphus' patched version? If openSUSe
cannot maintain it, let's Alt Linux team do the maintenance and regard
them as upstream of a forked version?
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/7
------------------------------------------------------------------------
On 2010-05-04T17:41:08+00:00 Anixx wrote:
Well it is really annoying: nobody can open archives made under Windows.
People of business say Linux is buggy: it even cannot open archives
properly. The same say government officials.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/8
------------------------------------------------------------------------
On 2010-05-09T23:25:33+00:00 Dvaleev wrote:
@Philipp
Chances to push this patch to upstream is very small or even not possible at all. Other distributions tried to accomplish that without success.
The upstream statement is: The trend in IT is to use UTF8.
That's why patch is not accepted.
Then why we can't accept this patch as openSUSE specific to close such
annoying bug? And maintain it until good time comes. openSUSE maintain a
numbers of specific patches for rpm, OpenOffice.org
If you won't maintain patch, please let community to do it.
The patch is small. It introduces new header and changes few strings of main code.
We tested patched unzip for two-three months and it just works. Also we got positive feedback on Czech and Slovak in addition to Russian language.
It also pretty applicable on latest 6.0 unzip version.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/9
------------------------------------------------------------------------
On 2010-05-10T17:29:43+00:00 Pth-3 wrote:
OK, after thinking about this I have added the patch to our unzip and will keep it at least as long as the package builds and the patch doesn't need extra work. Kyrilk, would you be willing to act as co-maintainer? Or to ask more more broadly, would anyone of you be willing to comaintain zip/unzip?
I'll also try to get an update for 11.2 out of the door.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/10
------------------------------------------------------------------------
On 2010-05-10T18:13:48+00:00 Kyrill Detinov wrote:
Philipp, I made sr#39767.
At the moment we have librcc0 in Factory only. So we may build patched unzip against Factory.
I added %if 0%{?suse_version} > 1120 for all the chahges.
> would you be willing to act as co-maintainer?
Yes, I'd like to take this role.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/11
------------------------------------------------------------------------
On 2010-05-20T15:24:26+00:00 Meissner-i wrote:
do we really want to take 2 new libraries for 11.2? not sure.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/19
------------------------------------------------------------------------
On 2010-05-20T16:46:27+00:00 Bruno-ioda-net wrote:
In reply to C12
More & more customers are having incoming zip from differents encodings and it's really a pain to explain, oh this zip should be unzip under window to get the right encoding. We look like charlot.
So as 11.2 as a long life in front of it, yes I'm voting for having it
include as fast as possible. The bug start under 11.2, so I feel it's
better to close it on 11.2, and be sure it was integrated in 11.3
Or (I'm only seeing ma world part :-) ) there's a much complicated
implication, if so it should be explain.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/20
------------------------------------------------------------------------
On 2010-05-25T13:43:06+00:00 Cdengler-z wrote:
I'm not happy about adding two new libraries to a released product, but
in this case I think it should be fine if someone will maintain them.
(+1)
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/24
------------------------------------------------------------------------
On 2010-05-25T13:49:35+00:00 Meissner-i wrote:
so lets do it. :)
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/25
------------------------------------------------------------------------
On 2010-05-25T13:56:21+00:00 Swamp-a wrote:
The SWAMPID for this issue is 33540.
This issue was rated as low.
Please submit fixed packages as soon as possible.
Also create a patchinfo file using this link:
https://swamp.suse.de/webswamp/wf/33540
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/26
------------------------------------------------------------------------
On 2010-05-25T14:00:53+00:00 Cdengler-z wrote:
Update process started ... be so kind and submit fixed sources and a
patchinfo.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/27
------------------------------------------------------------------------
On 2010-05-26T09:24:54+00:00 Pth-3 wrote:
@Marcus: which is the second new library? unzip only needs librcc0.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/31
------------------------------------------------------------------------
On 2010-05-26T09:27:04+00:00 Meissner-i wrote:
librcc however requires librcd
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/32
------------------------------------------------------------------------
On 2010-06-17T12:22:32+00:00 Swamp-a wrote:
Update released for: librcc-devel, librcc0, librcc0-debuginfo, librcc0-debugsource, librcd-devel, librcd0, librcd0-debuginfo, librcd0-debugsource, rcc-runtime, rcc-runtime-debuginfo, unzip, unzip-debuginfo, unzip-debugsource
Products:
openSUSE 11.2 (debug, i586, x86_64)
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/38
------------------------------------------------------------------------
On 2010-06-17T12:24:15+00:00 Cdengler-z wrote:
Update released after a long testing phase in the test update channel.
Closing.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/39
------------------------------------------------------------------------
On 2010-09-22T19:37:56+00:00 Anixx wrote:
Still does not work in File Roller under OpenSUSE 11.3.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/62
------------------------------------------------------------------------
On 2010-09-22T19:39:57+00:00 Anixx wrote:
Created attachment 391041
file with problem
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/63
------------------------------------------------------------------------
On 2010-09-22T19:40:25+00:00 Anixx wrote:
Created attachment 391042
screenshot of file roller
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/64
------------------------------------------------------------------------
On 2010-09-22T19:40:53+00:00 Anixx wrote:
The same file (bug.zip) opens well with Ark from KDE3.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/65
------------------------------------------------------------------------
On 2010-09-22T19:42:18+00:00 Anixx wrote:
Created attachment 391043
the same file opened in Ark/KDE3
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/66
------------------------------------------------------------------------
On 2010-09-23T04:10:46+00:00 Kyrill Detinov wrote:
Works OK.
% unzip -l bug-540598_bug.zip
Archive: bug-540598_bug.zip
Length Date Time Name
-------- ---- ---- ----
72704 09-20-10 23:11 Коммерческое предложение..doc
388608 09-20-10 23:11 прайс на палатки и снаряжение14.09.2010.xls
-------- -------
461312 2 files
Open a bug against File Roller. No problem with unzip.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/67
------------------------------------------------------------------------
On 2010-09-23T04:38:51+00:00 Anixx wrote:
Does File Roller use unzip in this case?
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/68
------------------------------------------------------------------------
On 2010-09-23T14:36:25+00:00 Kyrill Detinov wrote:
It should use unzip. But I found an interesting bugreport:
https://bugzilla.gnome.org/show_bug.cgi?id=611257
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/69
------------------------------------------------------------------------
On 2010-09-23T15:03:16+00:00 Anixx wrote:
Удалил p7zip. Теперь в File Roller все нормально, но встроенный
просмотрщик архивов в КДЕ3 все равно показывает мусор (в Ark все
нормально).
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/70
------------------------------------------------------------------------
On 2010-09-23T15:05:08+00:00 Anixx wrote:
Removed p7zip. Now all OK in File Roller, but embeeded viewer in KDE3
still shows garbage (in Ark all OK).
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/71
------------------------------------------------------------------------
On 2010-09-23T15:05:32+00:00 Anixx wrote:
Created attachment 391237
what I see in embeeded viewer
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/72
------------------------------------------------------------------------
On 2010-09-23T16:43:58+00:00 Kyrill Detinov wrote:
Same here. Krusader 1.90.0 shows file names correctly.
As you know, nobody interested to fix KDE3 bugs.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/73
------------------------------------------------------------------------
On 2010-09-23T17:36:49+00:00 Anixx wrote:
Maybe this bug is fixed in Trinity. If not, it is possible to make a
bugreport.
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/74
------------------------------------------------------------------------
On 2016-04-15T09:54:11+00:00 Bwiedemann wrote:
This is an autogenerated message for OBS integration:
This bug (540598) was mentioned in
https://build.opensuse.org/request/show/39794 Factory / unzip
https://build.opensuse.org/request/show/40783 11.2 / librcd0
https://build.opensuse.org/request/show/40784 11.2 / librcc0
https://build.opensuse.org/request/show/40785 11.2 / unzip
https://build.opensuse.org/request/show/40799 11.2:Test / unzip
Reply at:
https://bugs.launchpad.net/ubuntu/+source/unzip/+bug/580961/comments/197
** Bug watch added: KDE Bug Tracking System #204984
https://bugs.kde.org/show_bug.cgi?id=204984
** Bug watch added: GNOME Bug Tracker #611257
https://bugzilla.gnome.org/show_bug.cgi?id=611257
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to unzip in Ubuntu.
https://bugs.launchpad.net/bugs/580961
Title:
unzip fails to deal correctly with filename encodings
Status in File Roller:
Confirmed
Status in Linux Mint:
Triaged
Status in Ubuntu Japanese Kaizen Project:
Fix Committed
Status in unzip:
Unknown
Status in unzip package in Ubuntu:
Fix Released
Status in unzip source package in Precise:
Fix Released
Status in unzip source package in Quantal:
Fix Released
Status in unzip source package in Raring:
Fix Released
Status in unzip package in Debian:
Confirmed
Status in Gentoo Linux:
Won't Fix
Status in unzip package in Mandriva:
Unknown
Status in unzip package in openSUSE:
Fix Released
Bug description:
Binary package hint: unzip
This is a fairly annoying bug that's been around and known at least
since 2005. It's very visible as it will very often make exchange of
zip files with Windows users impossible, for example. As such, it
gathered it's fair share of "me too" and "how dare you haven't fixed
this yet!!111!" comments.
Problem description:
zip/unzip and the specification fall short when dealing with non-ASCII filenames not encoded in UTF-8
test case:
do an "unzip -l" on the file http://tinyurl.com/2aofpxs and witness the question marks
affected programs:
the problem is in unzip itself, but affects GUI like xarchiver, file-roller, etc. that rely on unzip for the decompression
suggested solutions (most are workarounds, not proper fixes):
a) reintroduce patch for codepage-based zip filenames: bug 477755, http://tinyurl.com/2aqdbqg (Ubuntu blueprint)
b) unzip filename according to locale: bug 203609
c) Ubuntu JP has a patch, probably not generally applicable, bug 269482
d) Russian altlinux distro uses natspec lib and patched zip binary
natspec was mentioned in bug 477755 comment #2 and may indeed be a
proper fix, needs closer inspection (I haven't really looked, yet. As
discussed in https://bugzilla.gnome.org/show_bug.cgi?id=306403 there
is no failsafe, straight-forward way to fix this in all cases.
Nonetheless, the current situation can and should be improved.
There's some good ideas floating around. It needs somebody to pull
and wrap them together.
It's unfortunate the FOSS community so far hasn't been able to fix
this rather visible problem. I'm opening this ticket as a master bug
and clean slate to document the issue and current status. Please
don't ruin it by making above-mentioned unhelpful comments, they
actually slow things down! Please don't nominate for a release.
Unless you're a dev and can provide a patch, you should think VERY
carefully to do anything but
1) subscribe yourself to this ticket
2) mark this bug as affecting you
3) tell me via mail about other bugs you think are a duplicate of this one, discussing the same problem
1) to 3) will showcase to the devs how many people are affected and
that is the only real chance we have for somebody to take a serious
look. "Me too" comments do the opposite, so again, please don't do
it.
To manage notifications about this bug go to:
https://bugs.launchpad.net/file-roller/+bug/580961/+subscriptions
More information about the foundations-bugs
mailing list