View Issue Details

IDProjectCategoryView StatusLast Update
0001865Double CommanderFile operationspublic2017-06-20 19:24
ReporterX1353135 Assigned To 
PrioritynormalSeveritymajorReproducibilityalways
Status newResolutionopen 
ProjectionnoneETAnone 
PlatformMS WindowsOSMS WindowsOS Version8.1
Product Version0.7.8Product Build2017/02/24 
Summary0001865: Multi byte character can not be searched by regex correctly
DescriptionThe regex pattern of D.F can not find text of DEF in files of ucs2 little endian encoding regardless of the selecting ucs2le on search dialog box.
-> see search_ucs2le.PNG

The regex pattern of 一.三 can not find text of 一.三 in files of utf8 encoding regardless of the selecting utf8 on search dialog box.
-> search_utf8_mbchar.PNG.
Steps To ReproduceSee attached screenshots
TagsNo tags attached.
Attached Files
files1.txt (1,104 bytes)   
$ find . -type f -print -exec xxd {} \;
./ucs2le.txt
00000000: 4100 4200 4300 0a00 4400 4500 4600 0a00  A.B.C...D.E.F...
00000010: 4700 4800 4900 0a00 004e 8c4e 094e 0a00  G.H.I....N.N.N..
00000020: db56 944e 6d51 0a00 034e 6b51 5d4e 0a00  .V.NmQ...NkQ]N..
00000030: 6d51 944e db56 0a00 f158 105f c253 0a00  mQ.N.V...X._.S..
00000040: f958 308d c153 0a00                      .X0..S..
./utf8.txt
00000000: 4142 430a 4445 460a 4748 490a e4b8 80e4  ABC.DEF.GHI.....
00000010: ba8c e4b8 890a e59b 9be4 ba94 e585 ad0a  ................
00000020: e4b8 83e5 85ab e4b9 9d0a e5a3 b1e5 bc90  ................
00000030: e58f 820a e5a3 b9e8 b4b0 e58f 810a       ..............
./cp936.txt
00000000: 4142 430a 4445 460a 4748 490a d2bb b6fe  ABC.DEF.GHI.....
00000010: c8fd 0acb c4ce e5c1 f90a c6df b0cb bec5  ................
00000020: 0ad2 bcb7 a1c8 fe0a                      ........
./cp932.txt
00000000: 4142 430a 4445 460a 4748 490a 88ea 93f1  ABC.DEF.GHI.....
00000010: 8e4f 0a8e 6c8c dc98 5a0a 8eb5 94aa 8be3  .O..l...Z.......
00000020: 0a88 eb93 f38e 510a                      ......Q.

files1.txt (1,104 bytes)   
files2.PNG (19,647 bytes)   
files2.PNG (19,647 bytes)   
search_ansi.PNG (22,600 bytes)   
search_ansi.PNG (22,600 bytes)   
search_ansi_result.PNG (16,462 bytes)   
search_ansi_result.PNG (16,462 bytes)   
search_ansi_result.PNG (16,462 bytes)   
search_ansi_result.PNG (16,462 bytes)   
search_ucs2le.PNG (22,706 bytes)   
search_ucs2le.PNG (22,706 bytes)   
search_ucs2le_relust.PNG (16,196 bytes)   
search_ucs2le_relust.PNG (16,196 bytes)   
search_utf8_mbchar.PNG (22,671 bytes)   
search_utf8_mbchar.PNG (22,671 bytes)   
search_utf8_mbchar_result.PNG (12,884 bytes)   
search_utf8_mbchar_result.PNG (12,884 bytes)   
search_ucs2le_mbchar.PNG (23,488 bytes)   
search_ucs2le_mbchar.PNG (23,488 bytes)   
search_ucs2le_mbchar_result.PNG (12,884 bytes)   
search_ucs2le_mbchar_result.PNG (12,884 bytes)   
ucs2le.txt (72 bytes)   
ABC
DEF
GHI
N�N	N
�V�NmQ
NkQ]N
mQ�N�V
�X_�S
�X0��S
ucs2le.txt (72 bytes)   
utf8.txt (62 bytes)   
ABC
DEF
GHI
一二三
四五六
七八九
壱弐参
壹贰叁
utf8.txt (62 bytes)   
cp932.txt (40 bytes)   
ABC
DEF
GHI
���O
�l�ܘZ
������
���Q
cp932.txt (40 bytes)   
cp936.txt (40 bytes)   
ABC
DEF
GHI
һ���
����
�߰˾�
Ҽ���
cp936.txt (40 bytes)   
0001865.zip (144,244 bytes)
Fixed in Revision
Operating systemWindows
Widgetset
Architecture32-bit

Activities

X1353135

2017-06-20 19:24

reporter   ~0002288

This is a fix of the bug description:

The regex pattern of D.F can not find text of DEF in files of ucs2 little endian encoding although user selects ucs2le as encoding on the search dialog box.
-> See search_ucs2le.PNG

The regex pattern of 一.三 can not find text of 一二三 in files of utf8 encoding although user selects utf8 as encoding on search dialog box.
-> See search_utf8_mbchar.PNG.

Issue History

Date Modified Username Field Change
2017-06-17 18:26 X1353135 New Issue
2017-06-17 18:26 X1353135 File Added: files1.txt
2017-06-17 18:27 X1353135 File Added: files2.PNG
2017-06-17 18:29 X1353135 File Added: search_ansi.PNG
2017-06-17 18:30 X1353135 File Added: search_ansi_result.PNG
2017-06-17 18:30 X1353135 File Added: search_ansi_result.PNG
2017-06-17 18:32 X1353135 File Added: search_ucs2le.PNG
2017-06-17 18:35 X1353135 File Added: search_ucs2le_relust.PNG
2017-06-17 18:38 X1353135 File Added: search_utf8_mbchar.PNG
2017-06-17 18:44 X1353135 File Added: search_utf8_mbchar_result.PNG
2017-06-17 18:49 X1353135 File Added: search_ucs2le_mbchar.PNG
2017-06-17 18:53 X1353135 File Added: search_ucs2le_mbchar_result.PNG
2017-06-17 18:57 X1353135 File Added: ucs2le.txt
2017-06-17 19:00 X1353135 File Added: utf8.txt
2017-06-17 19:16 X1353135 File Added: cp932.txt
2017-06-17 19:21 X1353135 File Added: cp936.txt
2017-06-17 19:29 X1353135 File Added: 0001865.zip
2017-06-20 19:24 X1353135 Note Added: 0002288