elinks

mirror of https://github.com/rkd77/elinks.git synced 2024-11-04 08:17:17 -05:00

History

Kalle Olavi Niemitalo 4a5af7fd26 Bug 381: Store codepage-to-Unicode mappings as dense arrays. Previously, each mapping between a codepage byte and a Unicode character was stored as a struct table_entry, which listed both the byte and the character. This representation may be optimal for sparse mappings, but codepages map almost every possible byte to a character, so it is more efficient to just have an array that lists the Unicode character corresponding to each byte from 0x80 to 0xFF. The bytes are not stored but rather implied by the array index. The tcvn5712 and viscii codepages have a total of four mappings that do not fit in the arrays, so we still use struct table_entry for those. This change also makes cp2u() operate in O(1) time and may speed up other functions as well. The "sed \| while read" concoction in Unicode/gen-cp looks rather unhealthy. It would probably be faster and more readable if rewritten in Perl, but IMO that goes for the previous version as well, so I suppose whoever wrote it had a reason not to use Perl here. Before: text data bss dec hex filename 38948 28528 3311 70787 11483 src/intl/charsets.o 500096 85568 82112 667776 a3080 src/elinks After: text data bss dec hex filename 31558 28528 3311 63397 f7a5 src/intl/charsets.o 492878 85568 82112 660558 a144e src/elinks So the text section shrank by 7390 bytes. Measured on i686-pc-linux-gnu with: --disable-xbel --disable-nls --disable-cookies --disable-formhist --disable-globhist --disable-mailcap --disable-mimetypes --disable-smb --disable-mouse --disable-sysmouse --disable-leds --disable-marks --disable-css --enable-small --enable-utf-8 --without-gpm --without-bzlib --without-idn --without-spidermonkey --without-lua --without-gnutls --without-openssl CFLAGS="-Os -ggdb -Wall"		2006-09-24 16:55:29 +03:00
..
7bit.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
7bitrepl.lnx	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_1.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_2.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_3.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_4.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_5.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_6.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_7.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_8.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_9.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_10.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_13.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_14.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_15.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
8859_16.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp437.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp737.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp850.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp852.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp866.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp1125.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp1250.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp1251.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp1252.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp1256.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
cp1257.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
entities.txt	Hmm, seem b.delta decided not to become 0x03B4 like it should	2006-01-10 15:39:11 +01:00
gen	Remove now useless $Id: lines.	2005-10-21 09:14:07 +02:00
gen-7b	Remove now useless $Id: lines.	2005-10-21 09:14:07 +02:00
gen-case	UTF-8: New function unicode_fold_label_case and a related script.	2006-08-06 20:02:42 +00:00
gen-cp	Bug 381: Store codepage-to-Unicode mappings as dense arrays.	2006-09-24 16:55:29 +03:00
gen-ent	Skip entities with unknown unicode (0x????)	2006-01-03 17:12:58 +01:00
index.txt	Internally rename the utf_8 codepage to utf8.	2006-09-17 16:23:17 +03:00
kamen.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
koi8_r.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
koi8_ru.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
koi8_u.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
mac_lat2.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
macroman.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
README	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
tcvn5712.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00
tr7bit.awk	Remove now useless $Id: lines.	2005-10-21 09:14:07 +02:00
utf8.cp	Internally rename the utf_8 codepage to utf8.	2006-09-17 16:23:17 +03:00
viscii.cp	Initial commit of the HEAD branch of the ELinks CVS repository, as of	2005-09-15 15:58:31 +02:00

README

This contains charset definitions and convert tables. If you'll change
anything, run ./gen - it will regenerate the header files and place them inside
src/intl/.