1
Tue Feb 19 02:08:20 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
3
* mbio.c (mb_new_pos_info): Re-allocation of vector of
4
``mb_pos_info_t'' did corrupt application using ``mb_info_t''
5
struct returned by this function.
6
(mb_new_pos_info): ``mb_pos_info_t'' struct was not initialized
9
Mon Feb 18 03:40:08 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
11
* mb2iso.c (mb_conv_for_specific_ces): Add check whehter ``ces''
14
Sun Jan 27 02:40:31 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
16
* mb.h: Definition of ``mb_conv_t'' is modified.
18
* mb2iso.c (mb_conv_for_specific_ces): New API to impliments CES
19
specific conversion function.
21
Sat Jan 26 19:53:50 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
23
* mbfind.c (mb_{mem,str}_to_wstr): New APIs for block conversion
24
of multi-octet string to wide string.
26
Mon Jan 14 02:02:43 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
28
* CCS conversion functions are now passed sequence of wide
29
characters instead of a single wide character.
31
Fri Jan 11 02:00:29 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
33
* iso2mb.c (mb_encode): Read point in buffer of ``mb_info_t''
34
struct was not updated porperly.
36
Thu Jan 10 22:12:58 2002 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
38
* mb2iso.c (mb_decode): New API to provide reverse mapping of
41
* iso2mb.c (mb_encode): New API to encode input from a stream into
42
wide character sequence or into ``x-moe-internal'' encoded octet
45
* Bit width of internal wide character is changed to 24.
46
Following after this change, ``x-moe-internal'' has become 4
47
octets (at most) encoding.
49
Thu Dec 13 01:32:57 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
51
* btri.c (main): Add support for case sensitive string table.
53
Thu Dec 6 00:29:01 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
55
* mbinit.c: UTF-16{,BE,LE} are removed from autodetection
56
candidates (including CESs incompatible with US-ASCII is harmful).
58
* mbinit.c: UTF-16{,BE,LE} are included in autodetection
61
* altmalloc.{c,h}: Obsolete files were included.
63
* iso2mb.c (mb_cs_detector_read, mb_getmem, mb_getline):
66
Wed Dec 5 02:17:15 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
68
* iso2mb.c (mb_mkunbound_cs_detector): Clear flag
69
``MB_FLAG_UNKNOWNCS''.
70
(mb_cs_detector_read): Clear flag ``MB_FLAG_UNKNOWNCS'' when
72
(mb_bind_cs_detector): Set flag ``MB_FLAG_UNKNOWNCS''.
74
Thu Nov 29 14:24:37 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
76
* ucs-to-iso.pl: Stop to permit line-breaking after a character
77
other than a kind of space or a CJK character.
79
Thu Nov 8 14:25:09 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
81
* altmalloc.c (alt_{set_atomic_allocater,call_malloc_atomic}): New
84
Thu Aug 30 19:32:00 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
86
* Makefile: Options ``v'' (to increase verbosity) were removed
87
from some commands (base on report from Ambrose Li
90
Fri Jun 15 22:42:24 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
92
* eaw_ucswidth.pl: Data format of EastAsianWidth.txt of Unicode
93
3.1 was not processed properly.
95
Wed Jun 13 21:51:31 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
97
* mbfind.c: New width table ``eaw_a2w_ucswidth'' of Unicode
98
characters based on EastAsianWidth.txt is introduced, and original
99
table ``eaw_ucswidth'' is renamed to ``eaw_a2n_ucswidth'' (based
100
on report from Hideyuki SHIRAI <shirai@rdmg.mgcs.mei.co.jp>).
102
* eaw_ucswidth.pl: Criterion whether a codepoint should be
103
registered or not, was wrong.
105
Tue Jun 5 01:43:25 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
107
* mbcesdefs.pl: New CES ``TIS-620''.
109
Sat Jun 2 00:57:50 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
111
* iso2mb.c (mb_iso2022_SSL_encoder): Mapping between final octet
112
of 7bit single shift sequence and intermediate buffer was wrong
113
(report from Akihiro Arisawa <ari@mbf.sphere.ne.jp>).
115
Tue May 29 13:50:00 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
117
* iso2mb.c (mb_fetch_wchar): When compiled with GCC, the body was
118
not included in libmoe, which might cause compilation failure of
119
programms using libmoe (report from Kenzi Cano
120
<kc@furukawa.ch.kagu.sut.ac.jp>).
122
Mon May 28 02:05:28 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
124
* ucs-to-iso-lib.pl (make_map): Regexp. to check a mapping table
125
of which encoding a file contains, was too strict, and fails to
126
find windows-1253 <-> Unicode mapping table.
128
Mon May 21 13:19:14 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
130
* Update maping tables for ``jis0208-to-ucs-extra'' and
131
``ucs-to-jis0208-extra'' based on the update from Ambrose Li
134
Sun May 13 13:41:55 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
136
* mbfind.c (mb_mem_to_wchar): Octet at the beginning of string was
137
returned unless multiple octet character was detected.
139
Thu Apr 26 23:07:58 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
141
* Makefile: Add description about required changes for Solaris 7
142
(information from Hideyuki SHIRAI <shirai@rdmg.mgcs.mei.co.jp>).
144
Wed Apr 25 21:08:08 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
146
* iso2mb.c (mb_info2mb): Terminating NUL character was not stored.
147
(mb_info2mb): Characters were not stored into correct place.
149
Tue Apr 17 00:29:11 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
151
* mb2iso.c (mb_vmem2iso_setup): ``mb_init_w()'' was used instead
154
* ucs-to-iso-lib.pl (unihan): Failed to find line for CNS 11643 in
157
Mon Apr 16 13:44:28 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
159
* ucs-to-iso-lib.pl (unihan): New mapping table entry generator
160
using Unihan.txt of Unicode 3.1.
162
* mb2iso.c (mb_vmem2b64): ``free()'' was used instead of
165
Thu Apr 12 23:45:16 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
167
* mb2iso.c (mb_store_esc_for_char_internal): Escape sequence to
168
identify UTF-16{,BE,LE} was corrupt.
170
Thu Mar 15 12:16:22 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
172
* wcrx.c (wcrx_compile_group): Structure representing meta
173
character ``.'' was destroyed once used, which caused segmentation
174
fault when the structure used next (report from Kazuhiko
175
<kazuhiko@ring.gr.jp>).
177
* uirx.c (uirx_match): NFA remains unchanged when match have
180
Wed Mar 14 18:00:02 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
182
* uirx.h: Introduce configurable type ``uirx_wc_t'' (after
183
suggestion by Hironori Sakamoto <h-saka@lsi.nec.co.jp>).
185
Sun Mar 11 11:56:46 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
187
* Add new converters ``jis0208-to-ucs-extra'' and
188
``ucs-to-jis0208-extra'' using the mapping table from Ambrose Li
191
* wcrx.c: Character class is compiled into one leaf in parse tree.
193
Sat Mar 10 12:13:23 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
195
* uirx.c (uirx_match): Member ``posflag'' of structure
196
``uirx_nfa_t'' was not initialized correctly.
198
Sun Mar 4 14:06:52 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
200
* wcrx.c: Remove codes using binary tri.
201
(wcrx_compile_group): Epsilon transition at end of group was not
202
properly processed when closing parenthesis was explicity
205
Wed Feb 28 01:27:11 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
207
* mbenclib.pl: {En,de}coder table for internal multiple octet
210
* btri.c (btri_uint_optimize): Calculation of max. key value of
213
Tue Feb 27 02:59:24 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
215
* wcrx.c (wcrx_compile_class_macro_negate_class_body,
216
wcrx_compile_class_negate_macro_negate_class_body): Both
217
``bof_char'' and ``eof_char'' are always excluded.
219
* Add support for case insensitive regular expression search.
221
* mbfind.c (mb_mem_to_wchar): Position of first octet was not
224
Mon Feb 26 23:21:45 2001 Fumitoshi UKAI <ukai@debian.or.jp>
226
* mb2iso.c (mb_vprintf): Fix paramter type mismatch (``NULL'' vs.
229
Mon Feb 26 01:06:29 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
231
* btri.c (main): Add new source type 'u' to generate
232
``butri_uint_tab_t'' table.
234
* New structure ``btri_uint_opt_tab_t'' to generate packed binary
235
tri with key of unsigned int type. Members ``count'' and ``max''
236
of ``btri_uint_tab_t'' are removed.
238
* Bit width of wide character is expanded to 32 again.
240
* Structure of structures related to regular expression are
241
changed in accordance with time when their values are determined --
242
at matching, at parsing, or never changed.
244
Sun Feb 25 01:31:33 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
246
* btri.c (str2uintkey): Accept string of C's character literal
249
* mbconv.c (chomp): Recognize CRLF as end of line as well as LF
250
and remove them from each line.
252
Sat Feb 24 22:13:06 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
254
* Support for regular expression is re-implemented as generic API.
256
Thu Feb 22 21:37:05 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
258
* Add support for regular expression over unsigned int.
260
Wed Feb 14 11:21:23 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
262
* ucs-to-iso-lib.pl (uni2gbk2k): Some mapping of GBK might not be
265
Tue Feb 13 22:11:46 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
267
* libmoe.shtml: Updated after implementation.
269
* mbenclib.pl: Character map ``CL'' stop to recognize codes such
270
as NUL which do not apprear in usuall text stream.
272
* Add support for UTF-16.
274
Sun Feb 11 02:09:08 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
276
* mbinit.c: ``cn-gb'' in candidates of CES auto detection is
277
changed to ``x-gb-18030-2000''.
279
* mbcesdefs.pl: Add new CES ``GBK'' and ``GBK2K''.
281
Sat Feb 10 02:41:12 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
283
* Some mapping tables are re-implemented using binary tri again.
285
* Dense sub trees of binary tri are converted to array.
287
* ucs-to-iso-lib.pl (uni2jis): Did not register entry for
288
``HALFWIDTH KATAKANA MIDDLE DOT'' because it had been assumed that
289
all lines are composed of single tab seprated columns.
291
Wed Feb 7 04:25:56 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
293
* Stop special treatment of mapping of JIS double octet
296
* Some mapping tables are re-implemented using array.
298
Tue Feb 6 22:18:50 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
300
* Fix mapping table of BIG5 into Unicode based on information from
301
Ambrose Li <acli@ada.dhs.org>.
303
Sun Feb 4 02:57:45 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
305
* Bit width of wide character is restricted to 24.
307
* Internal multiple octet encoding has been changed to that of
308
fixed length with name ``x-moe-internal''.
310
Sat Feb 3 02:23:56 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
312
* btri.c (btri_pack_uint_tab): New API to generate packed binary
313
tri with key of unsigned int type.
315
Sat Jan 27 02:03:26 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
317
* mb.h (MB_NCPROPS): Introduce new macro which shows number of
320
Fri Jan 26 01:21:11 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
322
* mb2iso.c (mb_force_flush_buffer): Fix posibility to lose part of
325
Thu Jan 25 02:49:24 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
327
* mb.h (mb_mem_to_wchar_internal): Resembled to ``mbtowc()'' in
330
Tue Jan 23 00:00:51 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
332
* Remove codes to check failure of binary search in case the
335
* Makefile: Introduce extra part of version string to add
336
auxiliary infomation such as prerelease version.
338
* mb.h (mb_store_esc_for_char): Re-implemented as macro.
339
(mb_fetch_wchar): Defined as inline function if GNU C is used.
341
* btri.c (btri_map_max_smaller and btri_map_min_larger): More
342
strict check whether retrieval should be continued.
344
* Prefix ``ISO2MB_'' is change to ``MOE_'' or ``MB_''.
346
Mon Jan 22 22:30:23 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
348
* btri.h: Prototypes of ``btri_add_uint_n_to_1()'' and
349
``btri_add_uint_n_to_n()'' were missing.
351
* Simplify decoder from internal encoding.
353
Sun Jan 21 13:36:34 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
355
* libmoe.shtml: Add description about developement version.
357
* iso2mb.c (mb_cs_try_detect): Destroy contents in input buffer of
358
original ``mb_info_t'' structure (report from Kazuhiko
359
<kazuhiko@ring.gr.jp>).
361
* mbenclib.pl (make_cm_matrix): Encoder tables were not correctly
362
made for NON-ISO 2022 encodings (report from Kazuhiko
363
<kazuhiko@ring.gr.jp>).
365
Sat Jan 20 02:20:10 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
367
* iso2mb.c (mb_cs_try_detect): Wide character weight table is
368
re-implemented using binary try.
370
* mbwcweight.pl: Range of katakana codepoints in EUC encoding was
373
* EUC_JISX0213_PACKED: New CES.
375
* Binary tri is re-implemented as ususal PATRICIA tree.
377
Thu Jan 18 21:13:16 2001 Kazuhiko <kazuhiko@ring.gr.jp>
379
* Makefile: Semicolons are added to tail of last statements in
382
* btri.c: Includes ``errno.h''.
384
Tue Jan 16 00:37:10 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
386
* mb2iso.c (mb_conv_from_jisx0213_internal and
387
mb_conv_to_jisx0213): New converters.
389
Sun Jan 14 00:44:04 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
391
* btri.c (btri_copy): New implimentation.
393
* btri.h: API names are changed:
394
btri_uc_cmp() --> btri_uchar_cmp(),
395
btri_uc_ci_cmp() --> btri_uchar_ci_cmp().
397
* btri.c: New API's: btri_uc_cmp(), btri_uc_ci_cmp(), and
400
* mbfind.c: API names are changed:
401
mb_wcv_width() --> mb_wmem_width(),
402
mb_wcstr_width() --> mb_wstr_width().
404
mb_str_width_n(), mb_wstr_width_n().
406
Sat Jan 13 03:46:39 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
408
* mb2iso.c (mb_store_char): Return value was length of decoded
409
multiple octet form, which should be length of ``fake'' UTF-8
412
* iso2mb.c (mb_escfun_cs_96): Processing of escape sequence to
413
designate 96 set was broken.
415
Fri Jan 12 13:15:39 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
417
* Stop implicit invocation of user defined encoding conversion.
419
* mbcesdefs.pl: Range of acceptable codepoints for ``x-ctext'' was
422
* mb.h (mb_str_to_wchar_internal): 1st octet of 2 octet utf-8
423
encoded character was not processed correctly.
425
* ucs-to-iso.pl: Initialization data of default converter table
427
Properties of some punctuation characters in JISC6226 or in
428
JISX0208 are not recorded.
430
Thu Jan 11 03:31:38 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
432
* iso2mb.c (mb_cs_try_detect_from_mem): New API to detect CES from
433
byte sequence on main memory.
435
* mbenclib.pl (make_named_encoder): Codepoint of 2^7 set character
436
was calcuated as 2^15 set character.
438
* mb.h (MB_SBC_LOWER): Was assigned with ``MB_96_LOWER'' which
439
should be ``MB_96_UPPER''.
441
Wed Jan 10 19:30:07 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
443
* mb.h (MB_UTF8_TO_WCHAR): Accept single octet.
444
(mb_wchar_to_badutf8): Comparison is performed in ``binary search
446
(mb_str_to_wchar_internal): Wide character is processed by
449
* btri.c (btri_copy): Was completely broken.
451
* iso2mb.c (mb_vmem2mb_setup): Order of intialization of members
454
* mbfind.c (mb_str_to_wchar): ``*p_s'' was not updated correctly.
456
Wed Jan 10 00:08:04 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
458
* Change all the prefix ``mb_bt'' of binary tri related objects to
461
* mb.h: Prototype of ``mb_flush_auxbuf()'' is added.
462
Prototype of ``mb_cs_try_detect()'' is added.
464
Tue Jan 9 21:55:39 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
466
* mbbtri.h: Prototype of ``mb_btri_copy()'' is added.
468
* mb2iso.c (mb_flush_auxbuf): Separate codes to flush auxbuf from
471
Mon Jan 8 13:26:49 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
473
* Makefile (INSTLIBEXEC): ``mk_btri'' is installed into
474
``${DESTDIR}${LIBEXECDIR}''.
476
* mb.h (MB_SBC_ESC_ENC and MB_DBC_ESC_ENC): Change arithmetic
477
minus operation to logical and.
479
Sun Jan 7 03:46:28 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
481
* The function ``mb_char_dec()'' is renamed to
482
``mb_mem_to_wchar_internal()'' and implemented as macro.
484
* Some functions are implemented as macros
485
(``mb_wchar_to_{badutf8,utf8,mbc}()'').
487
* New API (``mb_str_to_wchar()'', ``mb_str_to_prop()'').
489
* Some APIs are renamed (``mb_find_wchar()'' -->
490
``mb_mem_to_wchar()'', ``mb_mbc_prop()'' -->
491
``mb_mem_to_prop()'').
493
Sat Jan 6 01:15:31 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
495
* iso2mb.c (mb_escfun_cs_utf8): Was broken.
497
* mbfind.c (mb_wchar_prop): Wrong property might be returned for
500
* mb2iso.c (mb_conv_f2h and mb_conv_h2f): New converters between
501
halfwidth and fullwidth characters.
502
(mb_store_wchar): Converters of (mb_info_t *) is called before
503
those of (mb_ces_t *).
505
* mbbtri.c (mb_bt_search): Went down to right half of each node
506
even if it had no valid value.
508
* mb.h (MB_WORD_UCS_DEC): Decoder of UCS was broken.
510
* mb2iso.c (mb_conv_to_ucs): Return value was garabage when
513
* iso2mb.c (mb_encode_to_wchar): Conversion is now applied to
514
characters from input stream.
516
* Treatment of ASCII was different from what I had intended.
518
* Change treatment of JIS multiple octet characters.
520
* mbbtri.c (mb_btri_add_uint_n_to_n and mb_btri_add_uint_n_to_1):
521
Calculation of key range was wrong.
523
Fri Jan 5 01:41:01 2001 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
525
* mbbtri.c (output_bt): counting of necessary words for leaf node
527
(mb_btri_uint_optimize): criterion whether 2 subnodes may be
528
joined into one, was wrong.
530
* mbconv.c (opt_help): Shows library version.
532
* Remove an obsolete header file.
534
* Improve setup of LD_LIBRARY_PATH in test script.
536
* Fix some mistates in documentation.
538
* Start of new project ``libmoe''.
540
Thu Dec 7 01:25:02 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
542
* cprop.pl: ``MB_CPROP_MAY_BREAK'' flag was not set for HIRAGANA,
543
KATAKANA, KANA, FULLWIDTH, or IDEOGRAPHIC punctuation character.
545
Tue Nov 28 01:51:43 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
547
* ucs-to-iso-lib.pl (ucs2isov): Add sequence to designate right
548
half of iso-8859-1 to G1 after other character set designated.
550
Mon Nov 27 21:06:36 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
552
* preconv.h and notascii.h: Re-generated.
554
* ucs-to-iso-lib.pl: Fix some typo.
556
* ucs-to-iso-lib.pl (ucs2isov): Some fixes to follow after changes
557
of JIS related mapping table.
559
Sun Nov 26 00:59:03 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
561
* Makefile (big_headers): New target to generate big headers.
563
* mbfind.c (mb_charprop): Wrong values were returned for
564
characters registered property table.
566
* mbbtri.pl (bt_optimize): Criterion whether two leaf nodes may be
569
Sat Nov 25 13:21:04 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
571
* jismap.pl: New Perl script to make a mapping table of JIS
572
codepoints into registeration numbers, separated from
575
* notascii.pl: New Perl script to make a mapping table of
576
domestic-ascii characters which are not mapped to U+0021 --
577
U+007E, separated from ucs-to-iso.pl.
579
* mbbtri.pl (bt_make_c_header): Inser file name and date at
580
beginning of file to notify cvs that the file is really modified.
582
* mbbtri.c (mb_bt_search): Wrong value was generated from multiple
585
* Makefile: Separate target ``install'' into ``install-bin'',
586
``install-lib'', and ``install-man''.
588
Sat Nov 25 13:18:05 2000 Kazuhiko <kazuhiko@ring.gr.jp>
590
* Makefile (DESTDIR): New variable to override top of install
591
directories regardless of re-definition of individual ones.
593
Sat Nov 25 02:08:16 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
595
* mbbtri.c (mb_bt_search): Wrong value was generated from multiple
598
* mbbtri.pl (bt_encode): Computation of mask for key was wrong.
600
* Compress conversion table between Unicode and others.
602
Thu Nov 23 02:57:44 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
604
* ucs-to-iso.pl: Mapping table of codepoint into JIS registration
605
number was corrupt (report from Fumitoshi UKAI
606
<ukai@debian.or.jp>).
608
* Makefile: Include files are now installed into
609
${PREFIX}/include/iso2mb (suggestion from Fumitoshi UKAI
610
<ukai@debian.or.jp>).
612
* Mapping tables between Unicode and other kind characters are
613
modified to fix JIS X 0208 round trip problem (report from
614
Fumitoshi UKAI <ukai@debian.or.jp>).
616
Mon Nov 20 00:10:25 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
618
* mbinit.c: Converter sequence bound to charset using ISO 2022
619
registered character set include ``mb_conv_allg0sl'' as first
622
* ucs-to-iso-lib.pl (is94x94): Add special treatment of Hangul.
624
Sun Nov 19 06:12:05 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
626
* ucs-to-iso-lib.pl (is94x94): Return true value for Non-ISO 2022
627
multi octet character set.
629
* mbfind.c (mb_charprop): More accurate check whether CJK or not.
631
* mb.h: ``mb_char_t'' and of ``mb_echar_t'' are merged into one
632
structure ``mb_char_t''.
634
* mbfind.c (mb_charprop): Property of US-ASCII was wrong.
636
* mb.h: Members of ``mb_char_t'' and of ``mb_echar_t'' other than
637
code are defined as ``unsigned char''.
639
* Makefile (install): Allwasy copy files regardless of modified
642
* cprop.pl: All ideographic character (in Uncode sense) may break
645
Sat Nov 18 00:15:41 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
647
* New types ``mb_char_t'' and ``mb_echar_t'' for characters and
650
* Add new charset "x-fake-utf-8".
652
* mbinit.c: Position of entry named ``euc-jisx0213'' of charset
655
* Add new language ``cjk''.
657
* kr.c (mb_cs_judge_kr): Range of sencod octet of Johab encoding
660
* mbconv.c: New option ``--which'' to display charset name of each
663
Fri Nov 17 01:19:16 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
665
* kr.c (mb_cs_judge_kr): Weights of characters was wrong.
667
* mb.h: Stop to swap final octets 0x40 and 0x42 of 94x94 set in
670
* mbinit.c: Fix order of entries in converter table.
671
Stop to hard-code number of candidates of charset in autodetector
674
* kr.c, kr.h, ucs-to-iso-lib.pl, and mb128.h: Add support for
677
* kr.c (mb_cs_judge_kr): Improve autodetection of encoding.
679
Thu Nov 16 00:12:35 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
681
* mbconv.pod: Add description about new converters.
682
Add description about language specifications.
684
* cn.c, ja.c, and kr.c: Charset remains ``UNKNOWN'' if no
685
character with MSB set presents.
687
* mbinit.c: Number of encoding candidates was wrong for
690
* cn.c (mb_conv_ucs_to_gb): ``mb_conv_ucs_to_cn'' is divided into
691
``mb_conv_ucs_to_cn'' and ``mb_conv_ucs_to_gb''.
692
(mb_conv_euc_tw): Final octet was not set correctly.
694
* mbinit.c: Add new converter ``euc-tw''.
696
* cn.c: Rename ``mb_conv_euctw'' to ``mb_conv_euc_tw''.
698
* Makefile (TAB_Hs): Add ``preconv.h''.
700
* preconv.h: Generate separately from other table headers.
702
Wed Nov 15 01:33:09 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
704
* cn.c: Add support for ``EUC-TW''.
705
(mb_cs_setup_cn): ``utf-8'' was returned for ``x-euc-tw''.
706
(mb_conv_euctw): Characters in CNS 11643 plane 1, are converted to
709
* Makefile (kr.o): Add dependency upon ``hangul.h''.
711
* cn.c (mb_cs_judge_cn): Indexing variable was not updated
714
* kr.c (mb_cs_judge_kr): Indexing variable was not updated
716
(mb_johab_dec): Generate wrong code for johab code 0x8441 -
719
Sun Nov 12 12:57:22 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
721
* Improve autodetection of character encoding for cn and kr.
723
* Improve autodetection of character encoding.
725
Fri Nov 10 02:15:07 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
727
* mbfind.c (mb_is_cjk): New API to see if Unicode character is CJK
730
* cprop.pl: Modified to generate binary search table for CJK.
732
Thu Nov 9 13:00:31 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
734
* Makefile: Fix dependence on some headers.
736
* Property of punctuation characters are revised.
738
Wed Nov 8 21:37:16 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
740
* Restructuring of Unicode <-> ISO 2022 mapping mainly to fix
741
problem that there is no mapping information between Unicode and
742
JIS C 6226-1978 (report from Kazuhiko <kazuhiko@ring.gr.jp>).
745
Sat Nov 4 21:12:37 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
747
* iso2mb.c (mb_fetch_char): There was possiblity to fail to
748
recognize Unicode after conversion applied.
750
Fri Nov 3 00:37:26 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
752
* mbinit.c: Converters bound to charsets normalize characters
753
according to Unicode unification.
755
* ucs-to-iso-lib.pl (set_map): Generate mapping of Shift_JIS and
756
Shift_JISX0213 into Unicode.
758
Thu Nov 2 00:25:31 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
760
* mbfind.c (mb_mbc_proc): New API to look up character property
763
* cprop.pl: Add new property for space character.
765
* mbfind.c (mb_charprop): New API to look up character property.
767
Wed Nov 1 03:23:08 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
769
* Remove codes assuming ``char'' is unsinged.
771
Tue Oct 31 20:11:37 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
773
* mb128misc.c (mb_conv_ms_latin1): New converter to convert
774
characters of code point 0x80-0x9F into Microsoft Windows Codepage
777
Wed Oct 25 06:03:47 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
779
* Makefile (PREFIX): Everthing is installed into subdirectories of
780
the directory specified by this macro.
782
* iso2mb.c (mb_fetch_char): Fix mis-encoding of single octet
783
character set when designated G2 or G3 and invoked to GR by
786
Mon Oct 23 06:57:21 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
788
* mbinit.c (mb_charset_to_esc): Member ``cs2esc'' of ``*info'' is
789
initialized with ``NULL''.
791
* iso2mb.c (mb_conv_charset): New converter to apply converters in
792
structure pointed by member ``cs2esc'' in ``mb_info_t''.
794
* iso2mb.c (mb_conv_ascii): Table search is performed only for 94
797
* Makefile (${Os}): Add missed depedence on ``domestic-ascii.h''.
799
* domestic-ascii.pl: Broken domestic ascii table was made by
800
variable scope problem.
802
===== libiso2mb-0.7.1 <-- libiso2mb-0.7.0 =====
804
Sat Oct 21 12:57:09 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
806
* mbinit.c (mb_namev_to_flag): Table of conversion names was
807
specified as binary search table of flag names.
809
* mbconv.c (main): Output stream ``ostream'' was always overridden
812
Fri Oct 20 13:47:43 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
814
* mbinit.c: Fix mistake in conditional compilation of
815
``to_mb128misc[]''. Add missed converter for charset ``cn-big5''.
817
* iso2mb.c (mb_conv_ascii): New converter from domestic ascii to
820
Thu Oct 19 13:40:02 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
822
* mbinit.c: Automatic setup of converters for output stream based
825
Wed Oct 18 03:40:44 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
827
* mbconv.pod: Upated after C source.
829
* ucs.c (mb_ucs_to_isoc): Stop to generate non ISO 2022 registered
832
Tue Oct 17 15:36:53 2000 Kazuhiko <kazuhiko@ring.gr.jp>
834
* mbconv.c and Makefile: Fix of compilation and install problem.
836
Tue Oct 17 02:44:43 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
838
* mbio.c: Remove codes assuming ``fpos_t'' is arithmetic.
840
Fri Oct 13 17:08:12 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
842
* Improve JIS X 0213 support.
844
Thu Oct 12 13:37:46 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
846
* mbconv.pod: Manual page of sample application is provided.
848
* mbconv.c: Add new option to display help message.
850
* mbinit.c: Fix conv_tab, flag_tab, and default_cs2esc.
852
===== libiso2mb-0.7.0 <-- libiso2mb-0.6.3 =====
854
Wed Oct 11 23:24:06 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
856
* kr.c: Add support for Johab encoding. Add automatic detection
857
of charset among EUC-kr, Johab, and UTF-8.
859
* ja.c (mb_cs_judge_ja): Add support for automatic UTF-8
862
* cn.c: Add automatic detection of charset among CN-GB, Big Five,
865
Tue Oct 10 11:30:52 2000 Kiyokazu SUTO <suto@merry.xmath.ous.ac.jp>
867
* Separate Big Five to CNS 11643 conversion code from
870
===== libiso2mb-0.6.3 <-- libiso2mb-0.6.2 =====
872
Mon Oct 9 00:12:14 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
874
* mb128misc.c (mb_misc_enc): Fix encoding buf.
876
* Add support for Microsoft Windows Codepage 1250 -- 1258.
878
===== libiso2mb-0.6.2 <-- libiso2mb-0.6.1 =====
880
Sun Oct 8 21:39:50 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
882
* iso2mb.c (mb_set_localized_ascii_table): New API to disable (or
883
change behavior of) localized ASCII normalization.
885
* ucs-to-iso-lib.pl: Add entry for "FULLWIDTH REVERSE SOLIDUS" to
886
mapping table between Unicode and ISO 2022 registered characters.
888
* iso2mb.c (mb_nonutf8_enc): Fix bug when encoding 94 set
889
character into "fake" UTF-8.
891
===== libiso2mb-0.6.1 <-- libiso2mb-0.6.0 =====
893
Sun Oct 8 00:50:22 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
895
* ja.c (mb_sjis_dec): Fix bug of Shift_JIS decoder.
897
* ttywidth.pl: Get data from EastAsianWidth.txt.
899
===== libiso2mb-0.6.0 <-- libiso2mb-0.5.0 =====
901
Fri Oct 6 03:33:40 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
903
* iso2mb.c (mb_utf8_escenc): Add new flag to avoid conflict
904
between 7bit shingle shift and [kx]term's escape sequence.
906
Thu Oct 5 16:42:05 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
908
* Interpret ISO 2022 escape sequence even if UTF-8 mode.
910
* mbfind.c: Add new API to setup table of character width.
912
Wed Oct 4 21:10:49 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
914
* Modular implementation of encodings which are neither UTF-8 nor
917
===== libiso2mb-0.5.0 <-- libiso2mb-0.4.19 =====
919
Mon Oct 2 23:36:07 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
921
* mbinit.c: Add new APIs to setup charset table dynamically.
923
Mon Oct 2 13:55:46 2000 Kiyokazu SUTO <suto@ks-and-ks.ne.jp>
925
* mbinit.c: Add charset ``x-euc-jp''.