elinks

mirror of https://github.com/rkd77/elinks.git synced 2024-11-04 08:17:17 -05:00

Author	SHA1	Message	Date
Kalle Olavi Niemitalo	d65f41faab	Bug 1069: Add to NEWS	2009-02-28 16:30:55 +02:00
Kalle Olavi Niemitalo	1487d206db	Bug 1069: Revert "1031: JS_SetErrorReporter only once per JSRuntime." This reverts commit `b94657869b`. I don't know where I got the idea that JS_SetErrorReporter affects the entire JSRuntime, rather than only the provided JSContext. The people on #jsapi say it has never worked that way.	2009-02-26 22:56:33 +02:00
Miciah Dashiel Butler Masters	e9370fe5b9	Comment the last change	2009-02-25 01:54:36 +00:00
Miciah Dashiel Butler Masters	f11b2a8f97	utf8_to_jsstring: Don't free mem handed over to JS In utf8_to_jsstring, do not free the string that is passed to JS_NewUCString if the latter is successful; if it is, SpiderMonkey handles the memory from then on. Use libc routines instead of ELinks's routines to allocate and free the string so that ELinks's memory debugging code does not try to keep track of it after it has been handed to SpiderMonkey. This commit fixes a bug introdued in `97d72d15a0`.	2009-02-25 01:45:56 +00:00
Witold Filipczyk	9b9f5d9973	Fixed issue with EL_CONFIG_OPTIONAL_LIBRARY and features.conf Settings of features.conf were ignored by the EL_CONFIG_OPTIONAL_LIBRARY. (cherry picked from commit `3e8f774659`)	2009-02-23 12:17:29 +01:00
Kalle Olavi Niemitalo	39707c4102	Rewrap protocol.http.compression doc to 59 columns This makes the option manager display it much better on an 80x24 terminal. Alternatively, the "\n" newline characters within paragraphs could have been removed entirely. ELinks would then have line-wrapped the text to the appropriate width in the info window of the option manager, but unfortunately not in --config-help.	2009-02-22 21:28:14 +02:00
Kalle Olavi Niemitalo	9a186a45e8	NEWS about decompression	2009-02-22 20:55:06 +02:00
Kalle Olavi Niemitalo	2a7346d371	Tone down the protocol.http.compression warning AFAIK, all bugs in it have been fixed. Some bugs may still be lurking but they are more likely to get caught if compression is enabled. I also replaced COMP_NOTE with static text because xgettext does not support macros in the argument of N_. (cherry picked from commit `3a9b5d091d`)	2009-02-22 20:55:05 +02:00
Petr Baudis	a7f94dbbd1	Introduce protocol.http.compression knob When disabled, no Accept-Encoding header is sent. (cherry picked from commit `d4cec950ec`) Conflicts: src/protocol/http/http.c	2009-02-22 20:17:53 +02:00
Witold Filipczyk	851d35aa1d	CONFIG_LZMA disabled in features.conf.	2009-02-22 16:42:59 +01:00
Witold Filipczyk	66ea059732	Merge branch 'elinks-0.12' of git+ssh://pasky.or.cz/srv/git/elinks into elinks-0.12	2009-02-22 14:04:56 +01:00
Witold Filipczyk	0787b95634	- lzma disabled by default. (cherry picked from commit `56b7d38a28`)	2009-02-22 14:04:06 +01:00
Miciah Dashiel Butler Masters	84259ff26a	Fix crash on search-toggle-regex when RE disabled Check the return value of get_opt_rec on "document.browse.search.regex" before dereferencing it. The option is not there if regular expression support is disabled at build time. This commit fixes a bug introduced in commit b2d51c75ff0d6c52a4f6a2761801beb641cba3a2.	2009-02-22 04:06:51 +00:00
Witold Filipczyk	4a2fd2d964	Fallback to the raw deflate only when nothing was decompressed so far. It lets view the site from bug 1017. (cherry picked from commit `3131de4767`) Conflicts: src/protocol/http/http.c	2009-02-21 14:12:03 +01:00
Witold Filipczyk	53ab6d493e	bug 1068: Decompress data when the socket is closed. The reasons why the decompression failed: - the server gave wrong Content-Length - the socket was closed	2009-02-21 10:36:44 +01:00
Kalle Olavi Niemitalo	c8cee1df61	bug 1067: More verbose NEWS.	2009-02-15 05:02:43 +02:00
Kalle Olavi Niemitalo	d14f65a331	bug 1067: Comments about freeing the DOM document node.	2009-02-15 04:27:39 +02:00
Kalle Olavi Niemitalo	eb820a57a6	bug 1067: Assertions and comments about done_dom_node(). In bug 1067, dom_rss_pop_document() freed a node with done_dom_node() even though call_dom_node_callbacks() was still using that node. This made call_dom_node_callbacks() read a function pointer from beyond the end of an array and call that. Add assertions to detect out-of-range node types, and comments to warn about the bug.	2009-02-15 03:39:00 +02:00
Witold Filipczyk	a7c2f14e6d	bug 1067: the node was freed, but still used.	2009-02-12 09:48:04 +01:00
Kalle Olavi Niemitalo	c53e6335a1	Mention bug 761 in NEWS.	2009-02-09 00:24:13 +02:00
Kalle Olavi Niemitalo	7067fc7af9	Check for JS_ReportAllocationOverflow before using it. Debian libmozjs-dev 1.9.0.4-2 has JS_ReportAllocationOverflow but js-1.7.0 reportedly hasn't. Check at configure time whether that function is available. If not, use JS_ReportOutOfMemory instead. Reported by Witold Filipczyk.	2009-02-08 23:07:22 +02:00
Kalle Olavi Niemitalo	7941c7097a	Bug 1060: Document the need for TRE.	2009-02-08 18:55:15 +02:00
Kalle Olavi Niemitalo	63a362ee53	Bug 1060: Try TRE_LIBS=-ltre if pkg-config tre fails. This works around Debian bug 513055 in libtre-dev.	2009-02-08 18:26:22 +02:00
Witold Filipczyk	664048098a	Bug 1060: #undef HAVE_TRE_REGEX_H only in elinks.h I didn't read the code of the tre library, but I suppose that when sizes of wchar_t and unicode_val_T are equal it will work fine. [ From bug 1060 attachment 508. --KON ]	2009-02-08 18:26:22 +02:00
Witold Filipczyk	c5a7f87c43	Bug 1060: Use libtre for regexp searches. When the user tells ELinks to search for a regexp, ELinks 0.11.0 passes the regexp to regcomp() and the formatted document to regexec(), both in the terminal charset. This works OK for unibyte ASCII-compatible charsets because the regexp metacharacters are all in the ASCII range. And ELinks 0.11.0 doesn't support multibyte or ASCII-incompatible (e.g. EBCDIC) charsets in terminals, so it is no big deal if regexp searches fail in such locales. ELinks 0.12pre1 attempts to support UTF-8 as the terminal charset if CONFIG_UTF8 is defined. Then, struct search contains unicode_val_T c rather than unsigned char c, and get_srch() and add_srch_chr() together save UTF-32 values there if the terminal charset is UTF-8. In plain-text searches, is_in_range_plain() compares those values directly if the search is case sensitive, or folds them to lower case if the search is case insensitive: with towlower() if the terminal charset is UTF-8, or with tolower() otherwise. In regexp searches however, get_search_region_from_search_nodes() still truncates all values to 8 bits in order to generate the string that search_for_pattern() then passes to regexec(). In UTF-8 locales, regexec() expects this string to be in UTF-8 and can't make sense of the truncated characters. There is also a possible conflict in regcomp() if the locale is UTF-8 but the terminal charset is not, or vice versa. Rejected ways of fixing the charset mismatches: * When the terminal charset is UTF-8, recode the formatted document from UTF-32 to UTF-8 for regexp searching. This would work if the terminal and the locale both use UTF-8, or if both use unibyte ASCII-compatible charsets, but not if only one of them uses UTF-8. * Convert both the regexp and the formatted document to the charset of the locale, as that is what regcomp() and regexec() expect. ELinks would have to somehow keep track of which bytes in the converted string correspond to which characters in the document; not entirely trivial because convert_string() can replace a single unconvertible character with a string of ASCII characters. If ELinks were eventually changed to use iconv() for unrecognized charsets, such tracking would become even harder. * Temporarily switch to a locale that uses the charset of the terminal. Unfortunately, it seems there is no portable way to construct a name for such a locale. It is also possible that no suitable locale is available; especially on Windows, whose C library defines MB_LEN_MAX as 2 and thus cannot support UTF-8 locales. Instead, this commit makes ELinks do the regexp matching with regwcomp and regwexec from the TRE library. This way, ELinks can losslessly recode both the pattern and the document to Unicode and rely on the regexp code in TRE decoding them properly, regardless of locale. There are some possible problems though: 1. ELinks stores strings as UTF-32 in arrays of unicode_val_T, but TRE uses wchar_t instead. If wchar_t is UTF-16, as it is on Microsoft Windows, then TRE will misdecode the strings. It wouldn't be too hard to make ELinks convert to UTF-16 in this case, but (a) TRE doesn't currently support UTF-16 either, and it seems possible that wchar_t-independent UTF-32 interfaces will be added to TRE; and (b) there seems to be little interest on using ELinks on Windows anyway. 2. The Citrus Project apparently wanted BSD to use a locale-dependent wchar_t: e.g. UTF-32 in some locales and an ISO 2022 derivative in others. Regexp searches in ELinks now do not support the latter. [ Adapted to elinks-0.12 from bug 1060 attachment 506. Commit message by me. --KON ]	2009-02-08 18:26:22 +02:00
Kalle Olavi Niemitalo	264a66fe4d	bug 153: UTF-8 bookmark.title has been fully implemented. Mention it in NEWS too.	2009-02-08 18:26:21 +02:00
Kalle Olavi Niemitalo	311d95358d	bug 153, 1066: Convert bookmarks to/from UTF-8 when searching.	2009-02-08 18:26:21 +02:00
Kalle Olavi Niemitalo	8c0fa7f09c	bug 153, 1066: Convert strings to edit-bookmark dialog from UTF-8.	2009-02-08 18:26:21 +02:00
Kalle Olavi Niemitalo	5a29dbc4a1	bug 153, 1066: Convert strings to bookmark info dialog from UTF-8.	2009-02-08 18:26:20 +02:00
Kalle Olavi Niemitalo	b3acd2a5bc	bug 153: Convert titles in bookmark manager from UTF-8.	2009-02-08 18:26:20 +02:00
Kalle Olavi Niemitalo	b3f9d48bba	bug 153, 1066: Convert strings from add-bookmark dialogs to UTF-8. In src/bookmarks/dialogs.c, do_add_bookmark() gets the title and URL in the terminal charset and needs to know which one that is. When a bookmark is being added, save the struct terminal * to dialog.udata2 and read the charset from there. When a bookmark is being edited, dialog.udata2 is needed for the struct bookmark , but there we always have the parent struct dialog_data in dialog.udata and can get the terminal from that.	2009-02-08 18:26:19 +02:00
Kalle Olavi Niemitalo	b432b735e4	bug 1066: Attempt to convert -remote addBookmark(URL) to UTF-8. Currently, it is not clear which codepage is used in struri(). Assume it is the system codepage.	2009-02-08 18:26:19 +02:00
Kalle Olavi Niemitalo	99d1269bc5	bug 153, 1066: Convert session-snapshot bookmarks to/from UTF-8. These functions now expect or return strings in UTF-8: delete_folder_by_name (sneak in a const, too), bookmark_terminal_tabs, open_bookmark_folder, and get_auto_save_bookmark_foldername_utf8 (new function).	2009-02-08 18:26:19 +02:00
Kalle Olavi Niemitalo	11acd03eb2	Use update_bookmark() in SMJS bookmark object. When setting the title or URL of a bookmark from SMJS user scripting, use update_bookmark() instead of writing directly to struct bookmark. It triggers the bookmark-update event and sets the bookmarks_dirty flag.	2009-02-08 18:26:18 +02:00
Kalle Olavi Niemitalo	97d72d15a0	bug 153, 1066: Convert properties of SMJS bookmark to/from UTF-8. SpiderMonkey uses UTF-16 and the strings in struct bookmark are in UTF-8. Previously, the conversions behaved as if the strings had been in ISO-8859-1. SpiderMonkey also supports JS_SetCStringsAreUTF8(), which would make the existing functions convert between UTF-16 and UTF-8, but that effect is global so I dare not enable it yet. Besides, I don't know if that function works in all the SpiderMonkey versions that ELinks claims to work with.	2009-02-08 18:26:18 +02:00
Kalle Olavi Niemitalo	03b112796d	bug 153, 1066: Add codepage parameter to update_bookmark(). This also makes the bookmark-update event carry strings in UTF-8. The only current consumer of that event is bookmark_change_hook(), which ignores the strings, so no changes are needed there.	2009-02-08 18:26:18 +02:00
Kalle Olavi Niemitalo	73f925ce21	bug 153, 1066: Convert XBEL bookmarks to/from UTF-8. When the file is being read, Expat provides the strings to ELinks in UTF-8, so ELinks can put them in struct bookmark without conversions. Make sure gettext returns any placeholder strings in UTF-8, too. Replace '\r' with ' ' in bookmark titles and URLs. When the file is being written, put encoding="UTF-8" in the XML declaration, and then write out the strings from struct bookmark without character set conversions. Do replace some characters with entity references though, by calling add_html_to_string().	2009-02-08 18:26:04 +02:00
Kalle Olavi Niemitalo	8c0ae2a215	bug 153, 1066: Convert ~/.elinks/bookmarks to/from UTF-8. The ~/.elinks/bookmarks file is in the system charset, for compatibility with earlier ELinks releases, but internally the strings are in UTF-8.	2009-01-24 14:38:59 +02:00
Kalle Olavi Niemitalo	1cb81679f4	bug 153, 1066: Add add_bookmark_cp().	2009-01-24 12:18:28 +02:00
Kalle Olavi Niemitalo	d1f2f8df80	bug 153, 1066: init_bookmark() and add_bookmark() expect UTF-8. Comment changes only.	2009-01-24 12:17:48 +02:00
Kalle Olavi Niemitalo	37de386051	bug 153, 1066: Document that bookmarks should be UTF-8. Comment changes only.	2009-01-24 12:12:45 +02:00
Kalle Olavi Niemitalo	9088f11c64	Make encode_utf8() extern even without CONFIG_UTF8. It will soon be needed for conversions from UTF-16 to UTF-8.	2009-01-04 16:55:24 +02:00
Kalle Olavi Niemitalo	a82a5cc6d5	XBEL bug 761: Distinguish between names and values of attributes. When ELinks is parsing an XML element in from an XBEL bookmark file, it collects the attributes of the element to the current_node->attrs list. Previously, struct attributes had room for one string only: the last element of current_node->attrs was the name of the first attribute, and it was preceded by the value of the first attribute, the name of the second attribute, the value of the second attribute, and so on. However, when get_attribute_value() was looking for a given name, it compared the values as well. So, if you had for example <bookmark id="href" href="http://elinks.cz/">, then get_attribute_value("href") would incorrectly return "href". To fix this confusion, store values in the new member attributes.value, rather than in attributes.name.	2009-01-04 15:15:21 +02:00
Kalle Olavi Niemitalo	30dbe6a2f8	Use get_terminal_codepage in handle_interlink_event. This should have been in an earlier commit but I somehow missed it. Related to bug 1064 but does not change visible behaviour yet.	2009-01-01 22:59:11 +00:00
Kalle Olavi Niemitalo	e5722ad0d9	Bug 1061: Correctly truncate UTF-8 titles in the tab bar.	2009-01-01 20:01:50 +00:00
Kalle Olavi Niemitalo	8d19b87cb1	Bug 885: Truncate title at 600 bytes, not 1024. Although xterm allows 1024 bytes, GNU Screen apparently has a lower limit.	2009-01-01 19:54:35 +00:00
Kalle Olavi Niemitalo	29c34df62e	Fix assertion failure if IMG/@usemap refers to a different file. Change test/imgmap2.html so it can be used for testing this too. Debian Iceweasel 3.0.4 does not appear to support such external client-side image maps. Well, that's one place where ELinks is superior, I guess. There might be a security problem though if ELinks were to let scripts of the referring page examine the links in the image map.	2009-01-01 19:12:41 +00:00
Kalle Olavi Niemitalo	dc41f0bd4c	test: Don't refer to deleted files from imgmap.html. align.html and poocs.net.html have been deleted. Point the links to href_tests.html and nbsp.html instead.	2009-01-01 18:36:34 +00:00
Kalle Olavi Niemitalo	5be3f71ddd	Add test/image.png and use it in test/imgmap.html. This makes the image-map test work sensibly in graphical browsers too.	2009-01-01 18:35:11 +00:00
Kalle Olavi Niemitalo	b6dfdf86a6	Bug 885: Proper charset support in xterm window title When ELinks runs in an X11 terminal emulator (e.g. xterm), or in GNU Screen, it tries to update the title of the window to match the title of the current document. To do this, ELinks sends an "OSC 1 ; Pt BEL" sequence to the terminal. Unfortunately, xterm expects the Pt string to be in the ISO-8859-1 charset, making it impossible to display e.g. Cyrillic characters. In xterm patch #210 (2006-03-12) however, there is a menu item and a resource that can make xterm take the Pt string in UTF-8 instead, allowing characters from all around the world. The downside is that ELinks apparently cannot ask xterm whether the setting is on or off; so add a terminal._template_.latin1_title option to ELinks and let the user edit that instead. Complete list of changes: - Add the terminal._template_.latin1_title option. But do not add that to the terminal options window because it's already rather crowded there. - In set_window_title(), take a new codepage argument. Use it to decode the title into Unicode characters, and remove only actual control characters. For example, CP437 has graphical characters in the 0x80...0x9F range, so don't remove those, even though ISO-8859-1 has control characters in the same range. Likewise, don't misinterpret single bytes of UTF-8 characters as control characters. - In set_window_title(), do not truncate the title to the width of the window. The font is likely to be different and proportional anyway. But do truncate before 1024 bytes, an xterm limit. - In struct itrm, add a title_codepage member to remember which charset the master said it was going to use in the terminal window title. Initialize title_codepage in handle_trm(), update it in dispatch_special() if the master sends the new request TERM_FN_TITLE_CODEPAGE, and use it in most set_window_title() calls; but not in the one that sets $TERM as the title, because that string was not received from the master and should consist of ASCII characters only. - In set_terminal_title(), convert the caller-provided title to ISO-8859-1 or UTF-8 if appropriate, and report the codepage to the slave with the new TERM_FN_TITLE_CODEPAGE request. The conversion can run out of memory, so return a success/error flag, rather than void. In display_window_title(), check this result and don't update caches on error. - Add a NEWS entry for all of this.	2009-01-01 16:17:03 +00:00

1 2 3 4 5 ...

3223 Commits