View Issue Details
ID | Project | Category | View Status | Date Submitted | Last Update |
---|---|---|---|---|---|
0001865 | Double Commander | File operations | public | 2017-06-17 18:26 | 2017-06-20 19:24 |
Reporter | X1353135 | Assigned To | |||
Priority | normal | Severity | major | Reproducibility | always |
Status | new | Resolution | open | ||
Projection | none | ETA | none | ||
Platform | MS Windows | OS | MS Windows | OS Version | 8.1 |
Product Version | 0.7.8 | Product Build | 2017/02/24 | ||
Summary | 0001865: Multi byte character can not be searched by regex correctly | ||||
Description | The regex pattern of D.F can not find text of DEF in files of ucs2 little endian encoding regardless of the selecting ucs2le on search dialog box. -> see search_ucs2le.PNG The regex pattern of 一.三 can not find text of 一.三 in files of utf8 encoding regardless of the selecting utf8 on search dialog box. -> search_utf8_mbchar.PNG. | ||||
Steps To Reproduce | See attached screenshots | ||||
Tags | No tags attached. | ||||
Attached Files | files1.txt (1,104 bytes)
$ find . -type f -print -exec xxd {} \; ./ucs2le.txt 00000000: 4100 4200 4300 0a00 4400 4500 4600 0a00 A.B.C...D.E.F... 00000010: 4700 4800 4900 0a00 004e 8c4e 094e 0a00 G.H.I....N.N.N.. 00000020: db56 944e 6d51 0a00 034e 6b51 5d4e 0a00 .V.NmQ...NkQ]N.. 00000030: 6d51 944e db56 0a00 f158 105f c253 0a00 mQ.N.V...X._.S.. 00000040: f958 308d c153 0a00 .X0..S.. ./utf8.txt 00000000: 4142 430a 4445 460a 4748 490a e4b8 80e4 ABC.DEF.GHI..... 00000010: ba8c e4b8 890a e59b 9be4 ba94 e585 ad0a ................ 00000020: e4b8 83e5 85ab e4b9 9d0a e5a3 b1e5 bc90 ................ 00000030: e58f 820a e5a3 b9e8 b4b0 e58f 810a .............. ./cp936.txt 00000000: 4142 430a 4445 460a 4748 490a d2bb b6fe ABC.DEF.GHI..... 00000010: c8fd 0acb c4ce e5c1 f90a c6df b0cb bec5 ................ 00000020: 0ad2 bcb7 a1c8 fe0a ........ ./cp932.txt 00000000: 4142 430a 4445 460a 4748 490a 88ea 93f1 ABC.DEF.GHI..... 00000010: 8e4f 0a8e 6c8c dc98 5a0a 8eb5 94aa 8be3 .O..l...Z....... 00000020: 0a88 eb93 f38e 510a ......Q. | ||||
Fixed in Revision | |||||
Operating system | Windows | ||||
Widgetset | |||||
Architecture | 32-bit | ||||
|
This is a fix of the bug description: The regex pattern of D.F can not find text of DEF in files of ucs2 little endian encoding although user selects ucs2le as encoding on the search dialog box. -> See search_ucs2le.PNG The regex pattern of 一.三 can not find text of 一二三 in files of utf8 encoding although user selects utf8 as encoding on search dialog box. -> See search_utf8_mbchar.PNG. |
Date Modified | Username | Field | Change |
---|---|---|---|
2017-06-17 18:26 | X1353135 | New Issue | |
2017-06-17 18:26 | X1353135 | File Added: files1.txt | |
2017-06-17 18:27 | X1353135 | File Added: files2.PNG | |
2017-06-17 18:29 | X1353135 | File Added: search_ansi.PNG | |
2017-06-17 18:30 | X1353135 | File Added: search_ansi_result.PNG | |
2017-06-17 18:30 | X1353135 | File Added: search_ansi_result.PNG | |
2017-06-17 18:32 | X1353135 | File Added: search_ucs2le.PNG | |
2017-06-17 18:35 | X1353135 | File Added: search_ucs2le_relust.PNG | |
2017-06-17 18:38 | X1353135 | File Added: search_utf8_mbchar.PNG | |
2017-06-17 18:44 | X1353135 | File Added: search_utf8_mbchar_result.PNG | |
2017-06-17 18:49 | X1353135 | File Added: search_ucs2le_mbchar.PNG | |
2017-06-17 18:53 | X1353135 | File Added: search_ucs2le_mbchar_result.PNG | |
2017-06-17 18:57 | X1353135 | File Added: ucs2le.txt | |
2017-06-17 19:00 | X1353135 | File Added: utf8.txt | |
2017-06-17 19:16 | X1353135 | File Added: cp932.txt | |
2017-06-17 19:21 | X1353135 | File Added: cp936.txt | |
2017-06-17 19:29 | X1353135 | File Added: 0001865.zip | |
2017-06-20 19:24 | X1353135 | Note Added: 0002288 |