elinks

mirror of https://github.com/rkd77/elinks.git synced 2024-12-04 14:46:47 -05:00

Author	SHA1	Message	Date
Kalle Olavi Niemitalo	6032bc730a	Disable resuming download of formatted document If the user chose File -> Save formatted document and typed the name of an existing file, ELinks offered to resume downloading the file. There are a few problems with that: * save_formatted_finish does not actually support resuming. It would instead overwrite the beginning of the file and not truncate it. * When save_formatted calls create_download_file, cdf_hop->data ends up pointing to struct document. If the user then chooses to resume, lun_resume would read (int )cdf_hop->data, hoping to get cmdw_hop.magic or codw_hop.magic. struct document does not begin with any such magic value. * Because ELinks already has the formatted document in memory, resuming saves neither time nor I/O. So don't show the "Resume download of the original file" button in this situation.	2009-07-14 10:27:09 +03:00
Yuriy M. Kaminskiy	e5f4c59a20	Fixes failure to search for more than one double-width character.	2009-06-29 23:33:28 +03:00
Kalle Olavi Niemitalo	e452420d5f	Debian bug 534835: Don't assert ecmascript_reset_state succeeds After the recent ecmascript_get_interpreter change, I got an assertion failure in render_document, which calls ecmascript_reset_state and then asserts that it has set vs->ecmascript != NULL. ecmascript_reset_state cannot guarantee that because there might not even be enough free memory for mem_calloc(1, sizeof(struct ecmascript_interpreter). So, replace the assertion in render_document with error handling, and likewise in call_onsubmit_and_submit.	2009-06-28 11:17:06 +03:00
Kalle Olavi Niemitalo	11c0cb859b	Debian bug 534835: Check _get_interpreter return values This should fix a crash in: at /home/Kalle/src/elinks-0.12/src/ecmascript/spidermonkey.c:251 at /home/Kalle/src/elinks-0.12/src/ecmascript/ecmascript.c:104 at /home/Kalle/src/elinks-0.12/src/viewer/text/vs.c:64 It seems that spidermonkey_get_interpreter failed and returned NULL to ecmascript_get_interpreter, which did not check the return value and behaved as if the ECMAScript interpreter had been properly initialized. This caused destroy_vs to call ecmascript_put_interpreter, but backend_data which should have been a JSContext was NULL, causing a crash in SpiderMonkey. An alternative fix might be to make spidermonkey_put_interpreter skip the JS_DestroyContext call if ctx is NULL. However, I think it is better to make sure ecmascript_get_interpreter returns NULL if spidermonkey_get_interpreter fails, so that vs->ecmascript is left NULL and there's no chance that some other code might try to dereference the (JSContext *) NULL.	2009-06-28 00:18:05 +03:00
Kalle Olavi Niemitalo	10c07f9933	Debian bug 534835: Check some SpiderMonkey return values Perhaps because of bug 981, if one opened hundreds of pages with elinks --remote openURL(...), then ELinks 0.11.4 could crash with a SIGSEGV in JS_InitClass called from spidermonkey_get_interpreter. SpiderMonkey ran out of memory and began returning NULL and JS_FALSE but ELinks didn't notice them and pressed on. Add some checks to avoid the crash, although the underlying out-of-memory error remains.	2009-06-27 19:48:56 +03:00
Kalle Olavi Niemitalo	645e9f22fe	dump: Trim spaces only in color mode 0 or -1 The old code failed to write pending spaces before changing the background color. That seems hard to fix without duplicating code, and ELinks pads dumped lines to the requested width in these color modes anyway, so this commit just makes ELinks write all spaces immediately when colors are being used. Try the following command before and after this commit: elinks --no-home --eval "set document.colors.use_document_colors = 2" \ --dump-color-mode 1 --dump test/color.html	2009-06-21 19:35:50 +03:00
Kalle Olavi Niemitalo	9bc79e4ecf	dump: Define DUMP_COLOR_MODE_NONE	2009-06-21 19:35:50 +03:00
Kalle Olavi Niemitalo	773549180d	dump: Use dump functions in add_document_to_string Now that struct dump_output supports appending to a string, add_document_to_string() can just use that feature, instead of duplicating the code.	2009-06-21 19:35:50 +03:00
Kalle Olavi Niemitalo	f64463a780	dump: Let struct dump_output append to a string struct dump_output can now be initialized in such a way that data written to it will be appended to a struct string. Nothing uses this feature yet.	2009-06-21 19:35:50 +03:00
Kalle Olavi Niemitalo	04dabd5bf1	dump: Move the buffer into new struct dump_output	2009-06-21 19:35:49 +03:00
Kalle Olavi Niemitalo	2f0cefffb5	dump: Replace control characters with spaces In DUMP_FUNCTION_SPECIALIZED, use isscreensafe_ucs (for UTF-8) or isscreensafe (for unibyte) to detect control characters, and replace them with spaces. add_document_to_string already did the same.	2009-06-21 19:35:49 +03:00
Kalle Olavi Niemitalo	ee182ced2b	dump: Unify detection of fullwidth characters In DUMP_FUNCTION_SPECIALIZED (used by elinks --dump), detect the second cell of double-cell (aka fullwidth) characters by comparing to UCS_NO_CHAR, like add_document_to_string does. Don't use unicode_to_cell for this any more. Also, ignore the colors and attributes of the second cell; don't output any escape sequences for them.	2009-06-21 19:35:49 +03:00
Kalle Olavi Niemitalo	f0c88e1960	dump: One #if for declarations and another for statements	2009-06-21 19:35:49 +03:00
Kalle Olavi Niemitalo	417dcba57f	dump: Rename result variables to error Because 0 in them means OK and nonzero (currently -1) means an error.	2009-06-19 12:48:55 +03:00
Kalle Olavi Niemitalo	dedd01c970	dump: More const This is especially useful for showing that neither dump_truecolor_utf8 nor dump_truecolor_unibyte modifies its static color[] variable and it therefore does not matter whether those functions use the same array or not.	2009-06-09 03:48:40 +03:00
Kalle Olavi Niemitalo	bdcbb9f667	dump: Move local variable to reduce nesting	2009-06-09 03:39:23 +03:00
Kalle Olavi Niemitalo	79ea8d087d	dump: Elide trailing spaces in UTF-8 mode too	2009-06-09 03:39:17 +03:00
Kalle Olavi Niemitalo	952c6fa8aa	bug 1080: Fold UTF-8 and unibyte dumping together With all the comments and macros needed for this, the source files don't become much shorter, but anyway I hope they'll be easier to maintain this way.	2009-06-09 01:17:06 +03:00
Kalle Olavi Niemitalo	596a7cbd9b	bug 1080: Implement color modes for UTF-8 dumping	2009-06-09 00:07:38 +03:00
Kalle Olavi Niemitalo	35a091e8f0	bug 1080: Move common code to dump_references() This code was included in four variants of dump_to_file(). Move it to a new function dump_references() and make dump_to_file() then call that. This makes the code size a little smaller. The time cost will be negligible.	2009-06-09 00:07:38 +03:00
Kalle Olavi Niemitalo	200e36c002	bug 1080: Fold dump_color_mode* functions together Instead of having four separate function definitions, have just one sprinkled with #ifdefs, and #include that four times. The purpose being to make it clearer which parts of these functions are identical and which ones differ. As a side effect, this change makes ELinks ignore --dump-color-mode when dumping in UTF-8. Colourful UTF-8 dumping has not been implemented and the fallback is now different from before.	2009-06-09 00:06:10 +03:00
Kalle Olavi Niemitalo	da4bd42e43	bug 1017: Disable protocol.http.compression by default To work around buggy servers until bug 1017 has actually been fixed, i.e., ELinks reports decompression errors to the user.	2009-06-07 12:49:41 +03:00
Kalle Olavi Niemitalo	681e377027	Debian bug 528661: Check for gnutls_priority_set_direct Avoid compilation error with GNUTLS 1.2.9: /home/Kalle/src/elinks-0.12/src/network/ssl/ssl.c:258: error: implicit declaration of function ‘gnutls_priority_set_direct’ If the function is not available, use gnutls_set_default_priority instead. Perhaps it'll work with bugzilla.novell.com, perhaps not.	2009-05-30 14:34:01 +03:00
Witold Filipczyk	864fa0b56a	Debian bug 528661: Disable some TLS extensions on GNUTLS. - gnutls_handshake_set_private_extensions: Do not enable private cipher suites that might not be supported by anything other than GNUTLS. The GNUTLS 2.8.0 documentation notes that enabling these extensions can cause interoperability problems. - gnutls_set_default_priority: Explicitly disable OpenPGP certificates. - gnutls_certificate_type_set_priority: Do not enable OpenPGP certificates. The GNUTLS 2.8.0 documentation notes that OpenPGP certificate support requires libgnutls-extra. Because libgnutls-extra 2.2.0 and later are under GPLv3-or-later and thus not GPLv2 compatible, ELinks doesn't use libgnutls-extra, so OpenPGP certificates didn't work anyway. - gnutls_server_name_set: Do not tell the server the hostname from the URL. This was supposed to let the server choose the appropriate certificate for each name-based virtual host, but ELinks actually always sent just "localhost", so it didn't work anyway. This will have to be revisited when ELinks is changed to actually verify the subject name from the server's certificate (ELinks bug 1024). These changes should help ELinks negotiate SSL with bugzilla.novell.com. [NEWS and commit message by me. --KON]	2009-05-30 11:21:17 +03:00
Miciah Dashiel Butler Masters	1eebbb9ede	Bug 765: use ses_load to load old tab's document Yet another valiant wack at the beast. This one violates abstractions a little less deeply, so maybe it will work better. The last attempt caused a crash when a tab was cloned after the tab's loading had been aborted. (cherry picked from commit `76377d9714`)	2009-05-27 22:15:23 +03:00
Miciah Dashiel Butler Masters	f5103d0cc0	Bug 765: use load_uri to load old tab's document Kalle reported that after commit `5c96d430c9`, ELinks would crash if the document in the old tab was still loading when a new tab was opened. The problem was that the new session's download.data pointer was not updated to point to the session as doc_loading_callback expects. Instead of just calling render_document_frames, set up the download and call load_uri. (cherry picked from commit `d6116ca83a`)	2009-05-27 22:14:19 +03:00
Miciah Dashiel Butler Masters	f4a231cb9a	Bug 765: Bypass checks on base tab's view state when copying to a new tab In setup_session, use copy_location, add_to_history, and render_document_frames instead of goto_uri and copy_vs to copy the base tab's view state. By avoiding goto_uri, setup_session now bypasses MIME checks, form post confirmations, malicious URL checks, and so on when copying the base tab's current location and view state to the new tab, so the new tab should get exactly what was loaded in the base tab. This fixes bug 765: Opening a new tab can ask about the document of the previous tab. (cherry picked from commit `5c96d430c9`) Conflicts: src/session/session.c: Both elinks-0.12 and master had the ses->doc_view->vs = vs assignment, but only elinks-0.12 had vs->doc_view = ses->doc_view as well. Also, struct connection_state had been added after the original patch.	2009-05-27 22:05:22 +03:00
Kalle Olavi Niemitalo	b6aca8d9a7	Add tests for utf8_step_forward I am not hooking these to "make test", for two reasons: 1. utf8_step_forward is inside #ifdef CONFIG_UTF8 and I don't see how to make tests conditional on such options. 2. test/libtest.sh was copied from Git, which is under GPLv2-only. Adding more dependencies on it could make ELinks more difficult to relicense under GPLv2-or-later.	2009-05-27 01:11:03 +03:00
Kalle Olavi Niemitalo	5aae1b81cc	Define die() with __attribute__((noreturn)) This will prevent some compiler warnings in the test I'm about to commit.	2009-05-27 01:11:03 +03:00
Kalle Olavi Niemitalo	6d7b904fe3	Don't overcount in utf8_step_forward Reported by witekfl.	2009-05-27 01:11:02 +03:00
Witold Filipczyk	68ccb4513d	bug 765: If set download->callback set also download->data. In the task.c line 517 there is: if (is_in_progress_state((download_p)->state)) { if (have_location(ses)) download_p = &cur_loc(ses)->download; ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Here the download was changed. download->data and download->callback were NULL after the assignment, but later in loading_callback only download->callback had new value. download->data was still NULL.	2009-05-27 01:11:02 +03:00
Kalle Olavi Niemitalo	0c756fc3e8	TRE: Check for 32-bit wchar_t at configure time This check used to be in src/elinks.h. Move it to configure.in so that (1) the result can be logged and (2) ELinks won't even link with TRE if wchar_t prevents its use. Also, rename HAVE_TRE_REGEX_H to CONFIG_TRE, to reflect that it is not always defined if the header exists.	2009-05-21 17:22:12 +03:00
Witold Filipczyk	387aeac953	No segfault At the end of the destroy_vs there two assignments vs->doc_view->vs = NULL and vs->doc_view = NULL. In the setup_session the copy_vs left the vs "unbound" with any variable. At least one of these two is wrong.	2009-05-05 20:44:07 +03:00
Kalle Olavi Niemitalo	a91a08f82b	Named constants in terminal option defaults When setting up default values for terminal options, use named constants like TERM_VT100 or COLOR_MODE_16, rather than plain integers like 1. This is just to make the source code easier to read and perhaps more resistant to future bugs. The binary should not change.	2009-04-19 20:32:37 +03:00
Kalle Olavi Niemitalo	5e0032551b	Fix out-of-memory crash in globhist If globhist_simple_search ran out of memory in stracpy(search_url), it could leave gh_last_searched_title pointing to freed memory and cause a crash in the next call. Fix by not freeing gh_last_searched_title. It is then possible to have gh_last_searched_title and gh_last_searched_url pointing to strings from different searches; but that was already possible if stracpy(search_title) failed. Because this bug occurs only in out-of-memory situations and I don't think ELinks in general has been properly tested in those, the fix is perhaps not worth mentioning in NEWS and backporting to elinks-0.11.	2009-04-19 20:25:37 +03:00
Kalle Olavi Niemitalo	b4567b402b	Bug 1071: Add precautionary assertions and recovery	2009-04-05 20:59:41 +03:00
Kalle Olavi Niemitalo	b7f45ca80b	Bug 1071: Add NULL check in get_dom_node_list_index If the parent parameter of get_dom_node_list_index referred to a node that did not have children, then get_dom_node_list called by it could return the address of a null pointer, and get_dom_node_list_index would then pass that null pointer to get_dom_node_list_pos, which would crash. That would be the same kind of crash as the one in get_dom_node_child. It never happened in practice though: because all calls are in the form get_dom_node_list_index(node->parent, node), the list must contain at least the given node, and the pointer cannot be null. The documentation of get_dom_node_list_index allows arbitrary nodes as arguments however, so it's best to add a check.	2009-04-04 22:41:43 +03:00
Kalle Olavi Niemitalo	0c1a27ee99	Bug 1071: Document get_dom_node_value return values	2009-04-04 22:38:56 +03:00
Kalle Olavi Niemitalo	8465b19d0c	Bug 1071: Fix null-ptr crash in get_dom_node_child struct dom_node contains a union that contains various structs that have members of type struct dom_node * in them. get_dom_node_list_by_type returns the address (struct dom_node **) of one of those members, or NULL. However the member itself can also be NULL if no nodes have been added to the list and the list has thus not yet been allocated. (add_to_dom_node_list lazily allocates the lists.) get_dom_node_child did not expect a null pointer there and crashed, as shown in bug 1071. Fix by adding a check so that it treats a NULL list as an empty list.	2009-04-04 21:56:53 +03:00
Kalle Olavi Niemitalo	d7d18e4e43	bug 1047: inline functions C99 conformance C99 6.7.4p3 and 6.7.4p6 set some constraints on what can be done in inline functions and how they can be declared. In particular, any function declared inline must also be defined in the same translation unit. To comply with that, remove inline specifiers from function declarations in header files when the functions are not also defined in those header files. Sun Studio 11 on Solaris 9 is stricter than C99 and does not allow references to static identifiers in extern inline functions. Make the configure script detect this and define NONSTATIC_INLINE accordingly in config.h. Then use that in the definitions of all non-static inline functions. Document the restrictions and this scheme in doc/hacking.txt.	2009-03-28 20:15:08 +02:00
Kalle Olavi Niemitalo	5a43c55c9e	Rewrap lines in option documentation. Documentation strings of most options used to contain a "\n" at the end of each source line. When the option manager displayed these strings, it treated each "\n" as a hard newline. On 80x24 terminals however, the option description window has only 60 columes available for the text (with the default setup.h), and the hard newlines were further apart, so the option manager wrapped the text a second time, resulting in rather ugly output where long lones are interleaved with short ones. This could also cause the text to take up too much vertical space and not fit in the window. Replace most of those hard newlines with spaces so that the option manager (or perhaps BFU) will take care of the wrapping. At the same time, rewrap the strings in source code so that the source lines are at most 79 columns wide. In some options though, there is a list of possible values and their meanings. In those lists, if the description of one value does not fit in one line, then continuation lines should be indented. The option manager and BFU are not currently able to do that. So, keep the hard newlines in those lists, but rewrap them to 60 columns so that they are less likely to require further wrapping at runtime.	2009-03-08 15:18:10 +02:00
Kalle Olavi Niemitalo	a277c0ad3b	Wrap option descriptions in --config-help, --long-help	2009-03-08 13:29:51 +02:00
Kalle Olavi Niemitalo	bd5d3a173f	Add wrap_option_desc() in conf.c, still static Move the description wrapping code from smart_config_output_fn() to a separate function wrap_option_desc() so that --config-help can soon use it too.	2009-03-08 13:29:51 +02:00
Kalle Olavi Niemitalo	8ad32809e1	bug 153, 1066: Fix search in bookmark manager. test_search() was supposed to compare bookmark titles with strcasestr(), but in commit `311d95358d` "bug 153, 1066: Convert bookmarks to/from UTF-8 when searching." on 2009-02-08, I inadvertently changed that to strcasecmp(), even while adding a comment about why strcasestr() is needed. strcasestr() returns non-NULL if the strings match, and strcasecmp() returns nonzero if they differ, so the search didn't work at all.	2009-03-01 09:21:29 +02:00
Kalle Olavi Niemitalo	1487d206db	Bug 1069: Revert "1031: JS_SetErrorReporter only once per JSRuntime." This reverts commit `b94657869b`. I don't know where I got the idea that JS_SetErrorReporter affects the entire JSRuntime, rather than only the provided JSContext. The people on #jsapi say it has never worked that way.	2009-02-26 22:56:33 +02:00
Miciah Dashiel Butler Masters	e9370fe5b9	Comment the last change	2009-02-25 01:54:36 +00:00
Miciah Dashiel Butler Masters	f11b2a8f97	utf8_to_jsstring: Don't free mem handed over to JS In utf8_to_jsstring, do not free the string that is passed to JS_NewUCString if the latter is successful; if it is, SpiderMonkey handles the memory from then on. Use libc routines instead of ELinks's routines to allocate and free the string so that ELinks's memory debugging code does not try to keep track of it after it has been handed to SpiderMonkey. This commit fixes a bug introdued in `97d72d15a0`.	2009-02-25 01:45:56 +00:00
Kalle Olavi Niemitalo	39707c4102	Rewrap protocol.http.compression doc to 59 columns This makes the option manager display it much better on an 80x24 terminal. Alternatively, the "\n" newline characters within paragraphs could have been removed entirely. ELinks would then have line-wrapped the text to the appropriate width in the info window of the option manager, but unfortunately not in --config-help.	2009-02-22 21:28:14 +02:00
Kalle Olavi Niemitalo	2a7346d371	Tone down the protocol.http.compression warning AFAIK, all bugs in it have been fixed. Some bugs may still be lurking but they are more likely to get caught if compression is enabled. I also replaced COMP_NOTE with static text because xgettext does not support macros in the argument of N_. (cherry picked from commit `3a9b5d091d`)	2009-02-22 20:55:05 +02:00
Petr Baudis	a7f94dbbd1	Introduce protocol.http.compression knob When disabled, no Accept-Encoding header is sent. (cherry picked from commit `d4cec950ec`) Conflicts: src/protocol/http/http.c	2009-02-22 20:17:53 +02:00
Miciah Dashiel Butler Masters	84259ff26a	Fix crash on search-toggle-regex when RE disabled Check the return value of get_opt_rec on "document.browse.search.regex" before dereferencing it. The option is not there if regular expression support is disabled at build time. This commit fixes a bug introduced in commit b2d51c75ff0d6c52a4f6a2761801beb641cba3a2.	2009-02-22 04:06:51 +00:00
Witold Filipczyk	4a2fd2d964	Fallback to the raw deflate only when nothing was decompressed so far. It lets view the site from bug 1017. (cherry picked from commit `3131de4767`) Conflicts: src/protocol/http/http.c	2009-02-21 14:12:03 +01:00
Witold Filipczyk	53ab6d493e	bug 1068: Decompress data when the socket is closed. The reasons why the decompression failed: - the server gave wrong Content-Length - the socket was closed	2009-02-21 10:36:44 +01:00
Kalle Olavi Niemitalo	d14f65a331	bug 1067: Comments about freeing the DOM document node.	2009-02-15 04:27:39 +02:00
Kalle Olavi Niemitalo	eb820a57a6	bug 1067: Assertions and comments about done_dom_node(). In bug 1067, dom_rss_pop_document() freed a node with done_dom_node() even though call_dom_node_callbacks() was still using that node. This made call_dom_node_callbacks() read a function pointer from beyond the end of an array and call that. Add assertions to detect out-of-range node types, and comments to warn about the bug.	2009-02-15 03:39:00 +02:00
Witold Filipczyk	a7c2f14e6d	bug 1067: the node was freed, but still used.	2009-02-12 09:48:04 +01:00
Kalle Olavi Niemitalo	7067fc7af9	Check for JS_ReportAllocationOverflow before using it. Debian libmozjs-dev 1.9.0.4-2 has JS_ReportAllocationOverflow but js-1.7.0 reportedly hasn't. Check at configure time whether that function is available. If not, use JS_ReportOutOfMemory instead. Reported by Witold Filipczyk.	2009-02-08 23:07:22 +02:00
Witold Filipczyk	664048098a	Bug 1060: #undef HAVE_TRE_REGEX_H only in elinks.h I didn't read the code of the tre library, but I suppose that when sizes of wchar_t and unicode_val_T are equal it will work fine. [ From bug 1060 attachment 508. --KON ]	2009-02-08 18:26:22 +02:00
Witold Filipczyk	c5a7f87c43	Bug 1060: Use libtre for regexp searches. When the user tells ELinks to search for a regexp, ELinks 0.11.0 passes the regexp to regcomp() and the formatted document to regexec(), both in the terminal charset. This works OK for unibyte ASCII-compatible charsets because the regexp metacharacters are all in the ASCII range. And ELinks 0.11.0 doesn't support multibyte or ASCII-incompatible (e.g. EBCDIC) charsets in terminals, so it is no big deal if regexp searches fail in such locales. ELinks 0.12pre1 attempts to support UTF-8 as the terminal charset if CONFIG_UTF8 is defined. Then, struct search contains unicode_val_T c rather than unsigned char c, and get_srch() and add_srch_chr() together save UTF-32 values there if the terminal charset is UTF-8. In plain-text searches, is_in_range_plain() compares those values directly if the search is case sensitive, or folds them to lower case if the search is case insensitive: with towlower() if the terminal charset is UTF-8, or with tolower() otherwise. In regexp searches however, get_search_region_from_search_nodes() still truncates all values to 8 bits in order to generate the string that search_for_pattern() then passes to regexec(). In UTF-8 locales, regexec() expects this string to be in UTF-8 and can't make sense of the truncated characters. There is also a possible conflict in regcomp() if the locale is UTF-8 but the terminal charset is not, or vice versa. Rejected ways of fixing the charset mismatches: * When the terminal charset is UTF-8, recode the formatted document from UTF-32 to UTF-8 for regexp searching. This would work if the terminal and the locale both use UTF-8, or if both use unibyte ASCII-compatible charsets, but not if only one of them uses UTF-8. * Convert both the regexp and the formatted document to the charset of the locale, as that is what regcomp() and regexec() expect. ELinks would have to somehow keep track of which bytes in the converted string correspond to which characters in the document; not entirely trivial because convert_string() can replace a single unconvertible character with a string of ASCII characters. If ELinks were eventually changed to use iconv() for unrecognized charsets, such tracking would become even harder. * Temporarily switch to a locale that uses the charset of the terminal. Unfortunately, it seems there is no portable way to construct a name for such a locale. It is also possible that no suitable locale is available; especially on Windows, whose C library defines MB_LEN_MAX as 2 and thus cannot support UTF-8 locales. Instead, this commit makes ELinks do the regexp matching with regwcomp and regwexec from the TRE library. This way, ELinks can losslessly recode both the pattern and the document to Unicode and rely on the regexp code in TRE decoding them properly, regardless of locale. There are some possible problems though: 1. ELinks stores strings as UTF-32 in arrays of unicode_val_T, but TRE uses wchar_t instead. If wchar_t is UTF-16, as it is on Microsoft Windows, then TRE will misdecode the strings. It wouldn't be too hard to make ELinks convert to UTF-16 in this case, but (a) TRE doesn't currently support UTF-16 either, and it seems possible that wchar_t-independent UTF-32 interfaces will be added to TRE; and (b) there seems to be little interest on using ELinks on Windows anyway. 2. The Citrus Project apparently wanted BSD to use a locale-dependent wchar_t: e.g. UTF-32 in some locales and an ISO 2022 derivative in others. Regexp searches in ELinks now do not support the latter. [ Adapted to elinks-0.12 from bug 1060 attachment 506. Commit message by me. --KON ]	2009-02-08 18:26:22 +02:00
Kalle Olavi Niemitalo	264a66fe4d	bug 153: UTF-8 bookmark.title has been fully implemented. Mention it in NEWS too.	2009-02-08 18:26:21 +02:00
Kalle Olavi Niemitalo	311d95358d	bug 153, 1066: Convert bookmarks to/from UTF-8 when searching.	2009-02-08 18:26:21 +02:00
Kalle Olavi Niemitalo	8c0fa7f09c	bug 153, 1066: Convert strings to edit-bookmark dialog from UTF-8.	2009-02-08 18:26:21 +02:00
Kalle Olavi Niemitalo	5a29dbc4a1	bug 153, 1066: Convert strings to bookmark info dialog from UTF-8.	2009-02-08 18:26:20 +02:00
Kalle Olavi Niemitalo	b3acd2a5bc	bug 153: Convert titles in bookmark manager from UTF-8.	2009-02-08 18:26:20 +02:00
Kalle Olavi Niemitalo	b3f9d48bba	bug 153, 1066: Convert strings from add-bookmark dialogs to UTF-8. In src/bookmarks/dialogs.c, do_add_bookmark() gets the title and URL in the terminal charset and needs to know which one that is. When a bookmark is being added, save the struct terminal * to dialog.udata2 and read the charset from there. When a bookmark is being edited, dialog.udata2 is needed for the struct bookmark , but there we always have the parent struct dialog_data in dialog.udata and can get the terminal from that.	2009-02-08 18:26:19 +02:00
Kalle Olavi Niemitalo	b432b735e4	bug 1066: Attempt to convert -remote addBookmark(URL) to UTF-8. Currently, it is not clear which codepage is used in struri(). Assume it is the system codepage.	2009-02-08 18:26:19 +02:00
Kalle Olavi Niemitalo	99d1269bc5	bug 153, 1066: Convert session-snapshot bookmarks to/from UTF-8. These functions now expect or return strings in UTF-8: delete_folder_by_name (sneak in a const, too), bookmark_terminal_tabs, open_bookmark_folder, and get_auto_save_bookmark_foldername_utf8 (new function).	2009-02-08 18:26:19 +02:00
Kalle Olavi Niemitalo	11acd03eb2	Use update_bookmark() in SMJS bookmark object. When setting the title or URL of a bookmark from SMJS user scripting, use update_bookmark() instead of writing directly to struct bookmark. It triggers the bookmark-update event and sets the bookmarks_dirty flag.	2009-02-08 18:26:18 +02:00
Kalle Olavi Niemitalo	97d72d15a0	bug 153, 1066: Convert properties of SMJS bookmark to/from UTF-8. SpiderMonkey uses UTF-16 and the strings in struct bookmark are in UTF-8. Previously, the conversions behaved as if the strings had been in ISO-8859-1. SpiderMonkey also supports JS_SetCStringsAreUTF8(), which would make the existing functions convert between UTF-16 and UTF-8, but that effect is global so I dare not enable it yet. Besides, I don't know if that function works in all the SpiderMonkey versions that ELinks claims to work with.	2009-02-08 18:26:18 +02:00
Kalle Olavi Niemitalo	03b112796d	bug 153, 1066: Add codepage parameter to update_bookmark(). This also makes the bookmark-update event carry strings in UTF-8. The only current consumer of that event is bookmark_change_hook(), which ignores the strings, so no changes are needed there.	2009-02-08 18:26:18 +02:00
Kalle Olavi Niemitalo	73f925ce21	bug 153, 1066: Convert XBEL bookmarks to/from UTF-8. When the file is being read, Expat provides the strings to ELinks in UTF-8, so ELinks can put them in struct bookmark without conversions. Make sure gettext returns any placeholder strings in UTF-8, too. Replace '\r' with ' ' in bookmark titles and URLs. When the file is being written, put encoding="UTF-8" in the XML declaration, and then write out the strings from struct bookmark without character set conversions. Do replace some characters with entity references though, by calling add_html_to_string().	2009-02-08 18:26:04 +02:00
Kalle Olavi Niemitalo	8c0ae2a215	bug 153, 1066: Convert ~/.elinks/bookmarks to/from UTF-8. The ~/.elinks/bookmarks file is in the system charset, for compatibility with earlier ELinks releases, but internally the strings are in UTF-8.	2009-01-24 14:38:59 +02:00
Kalle Olavi Niemitalo	1cb81679f4	bug 153, 1066: Add add_bookmark_cp().	2009-01-24 12:18:28 +02:00
Kalle Olavi Niemitalo	d1f2f8df80	bug 153, 1066: init_bookmark() and add_bookmark() expect UTF-8. Comment changes only.	2009-01-24 12:17:48 +02:00
Kalle Olavi Niemitalo	37de386051	bug 153, 1066: Document that bookmarks should be UTF-8. Comment changes only.	2009-01-24 12:12:45 +02:00
Kalle Olavi Niemitalo	9088f11c64	Make encode_utf8() extern even without CONFIG_UTF8. It will soon be needed for conversions from UTF-16 to UTF-8.	2009-01-04 16:55:24 +02:00
Kalle Olavi Niemitalo	a82a5cc6d5	XBEL bug 761: Distinguish between names and values of attributes. When ELinks is parsing an XML element in from an XBEL bookmark file, it collects the attributes of the element to the current_node->attrs list. Previously, struct attributes had room for one string only: the last element of current_node->attrs was the name of the first attribute, and it was preceded by the value of the first attribute, the name of the second attribute, the value of the second attribute, and so on. However, when get_attribute_value() was looking for a given name, it compared the values as well. So, if you had for example <bookmark id="href" href="http://elinks.cz/">, then get_attribute_value("href") would incorrectly return "href". To fix this confusion, store values in the new member attributes.value, rather than in attributes.name.	2009-01-04 15:15:21 +02:00
Kalle Olavi Niemitalo	30dbe6a2f8	Use get_terminal_codepage in handle_interlink_event. This should have been in an earlier commit but I somehow missed it. Related to bug 1064 but does not change visible behaviour yet.	2009-01-01 22:59:11 +00:00
Kalle Olavi Niemitalo	e5722ad0d9	Bug 1061: Correctly truncate UTF-8 titles in the tab bar.	2009-01-01 20:01:50 +00:00
Kalle Olavi Niemitalo	8d19b87cb1	Bug 885: Truncate title at 600 bytes, not 1024. Although xterm allows 1024 bytes, GNU Screen apparently has a lower limit.	2009-01-01 19:54:35 +00:00
Kalle Olavi Niemitalo	29c34df62e	Fix assertion failure if IMG/@usemap refers to a different file. Change test/imgmap2.html so it can be used for testing this too. Debian Iceweasel 3.0.4 does not appear to support such external client-side image maps. Well, that's one place where ELinks is superior, I guess. There might be a security problem though if ELinks were to let scripts of the referring page examine the links in the image map.	2009-01-01 19:12:41 +00:00
Kalle Olavi Niemitalo	b6dfdf86a6	Bug 885: Proper charset support in xterm window title When ELinks runs in an X11 terminal emulator (e.g. xterm), or in GNU Screen, it tries to update the title of the window to match the title of the current document. To do this, ELinks sends an "OSC 1 ; Pt BEL" sequence to the terminal. Unfortunately, xterm expects the Pt string to be in the ISO-8859-1 charset, making it impossible to display e.g. Cyrillic characters. In xterm patch #210 (2006-03-12) however, there is a menu item and a resource that can make xterm take the Pt string in UTF-8 instead, allowing characters from all around the world. The downside is that ELinks apparently cannot ask xterm whether the setting is on or off; so add a terminal._template_.latin1_title option to ELinks and let the user edit that instead. Complete list of changes: - Add the terminal._template_.latin1_title option. But do not add that to the terminal options window because it's already rather crowded there. - In set_window_title(), take a new codepage argument. Use it to decode the title into Unicode characters, and remove only actual control characters. For example, CP437 has graphical characters in the 0x80...0x9F range, so don't remove those, even though ISO-8859-1 has control characters in the same range. Likewise, don't misinterpret single bytes of UTF-8 characters as control characters. - In set_window_title(), do not truncate the title to the width of the window. The font is likely to be different and proportional anyway. But do truncate before 1024 bytes, an xterm limit. - In struct itrm, add a title_codepage member to remember which charset the master said it was going to use in the terminal window title. Initialize title_codepage in handle_trm(), update it in dispatch_special() if the master sends the new request TERM_FN_TITLE_CODEPAGE, and use it in most set_window_title() calls; but not in the one that sets $TERM as the title, because that string was not received from the master and should consist of ASCII characters only. - In set_terminal_title(), convert the caller-provided title to ISO-8859-1 or UTF-8 if appropriate, and report the codepage to the slave with the new TERM_FN_TITLE_CODEPAGE request. The conversion can run out of memory, so return a success/error flag, rather than void. In display_window_title(), check this result and don't update caches on error. - Add a NEWS entry for all of this.	2009-01-01 16:17:03 +00:00
Kalle Olavi Niemitalo	8f4d7f9903	Define cp_to_unicode() even without CONFIG_UTF8. And make its last parameter point to const. add_cp_html_to_string() no longer needs to pretend UTF-8 is ISO-8859-1.	2009-01-01 16:17:03 +00:00
Kalle Olavi Niemitalo	ad45176dde	Add get_terminal_codepage(). This simplifies the callers a little and may help implement simultaneous support for different charsets on different terminals of the same type (bug 1064).	2009-01-01 16:16:17 +00:00
Kalle Olavi Niemitalo	25da8085b3	Fix double-free crash if EOF immediately follows </MAP>. look_for_link() used to return 0 both when it found the closing </MAP> tag, and when it hit the end of the file. In the first case, it also added menu to the memory_list; in the second case, it did not. The caller get_image_map() supposedly distinguished between these cases by checking whether pos >= eof, and freed menu separately if so. However, if the </MAP> was at the very end of the HTML file, so that not even a newline followed it, then look_for_link() left pos == eof even though it had found the </MAP> and added menu to the memory_list. This made get_image_map() misinterpret the result and mem_free(menu) even though *menu had already been freed as part of the memory_list; thus the crash. To fix this, make look_for_link() return -1 instead of 0 if it hits EOF without finding the </MAP>. Then make get_image_map() check the return value instead of comparing pos to eof. And add a test case, although not an automated one. Alternatively, look_for_link() could have been changed to decrement pos between finding the </MAP> and returning 0. Then, the pos >= eof comparison in get_image_map() would have been false. That scheme would however have been a bit more difficult to understand and maintain, I think. Reported by Paul B. Mahol. (cherry picked from commit `a2404407ce`)	2008-12-31 20:15:44 +00:00
Kalle Olavi Niemitalo	d668b3b6aa	mouse: Exit cursor-routing mode when a link is clicked Before this patch, if you first moved the cursor to link X with move-cursor-up and similar actions, and then clicked link Y with the mouse, ELinks would activate link X, i.e. not the one you clicked. This happened because the NAVIGATE_CURSOR_ROUTING mode was left enabled and made ELinks ignore the doc_view->vs->current_link member that ELinks had updated according to the click. Make ELinks return the session to NAVIGATE_LINKWISE mode, so that the update takes effect. Reported by Paul B. Mahol. (cherry picked from commit `4086418069`)	2008-12-28 13:24:07 +02:00
Peter Collingbourne	658b9cc70f	Fixed bug relating to newlines in hidden input fields This patch fixes an issue whereby a newline character appearing within a hidden input field is incorrectly reinterpreted as a space character. The patch handles almost all cases, and includes a test case. 15/18 tests pass, but the remainder currently fail due to the fact that ELinks does not currently support textarea scripting.	2008-11-09 23:28:46 +02:00
Kalle Olavi Niemitalo	c56f3928ec	Bug 1004: Rewrite FSF code to avoid GPLv2 2. c) c_strcasecmp and c_strncasecmp were taken from GNU coreutils 6.9, which is copyrighted by the Free Software Foundation and licensed under GNU GPL version 2 or later. It seems the programs in coreutils do not normally read commands interactively. So, including coreutils code in an interactive program such as ELinks could trigger GPLv2 section 2. c), which would require ELinks to display a copyright notice and a warranty disclaimer each time it is started. Rewrite those functions to remove the FSF-copyrighted code and make ELinks not a work based on GNU coreutils. Avoiding FSF code has the additional benefit that we won't have to ask FSF for permission if we want to add a licence exception that allows linking ELinks with OpenSSL. So it seems a good idea even if my interpretation of GPLv2 2. c) is overly strict. I haven't checked though whether there are other FSF-copyrighted portions in ELinks.	2008-11-02 22:15:38 +02:00
M. Vefa Bicakci	20a7a6c460	Patch 3: Further fixes including strcasestr and convert_to_lowercase	2008-11-01 22:32:43 +02:00
Kalle Olavi Niemitalo	1ba7d5a260	Bug 1004: Use c_toupper in a few more places. src/config/kbdbind.c (parse_keystroke): If the user types "Ctrl-i", it should mean "Ctrl-I" rather than "Ctrl-İ", because the Ctrl- combinations are only well known for ASCII characters. This does not matter in practice though, because src/terminal/kbd.c converts 0x09 to (KBD_MOD_NONE, KBD_TAB) and not to (KBD_MOD_CTRL, 'I'). src/osdep/beos/beos.c (get_system_env): Changing the locale does not affect the TERM environment variable, I think, so it should not affect the interpretation either.	2008-11-01 22:32:43 +02:00
Kalle Olavi Niemitalo	aaf6be8a36	Bug 1004: Fix implicit declarations of c_* functions Add #include directives to fix these errors: [CC] src/intl/gettext/l10nflist.o cc1: warnings being treated as errors .../src/intl/gettext/l10nflist.c: In function ‘_nl_normalize_codeset’: .../src/intl/gettext/l10nflist.c:352: error: implicit declaration of function ‘c_tolower’ [CC] src/dom/css/scanner.o cc1: warnings being treated as errors In file included from .../src/dom/scanner.h:4, from .../src/dom/css/scanner.h:4, from .../src/dom/css/scanner.c:12: .../src/dom/string.h: In function ‘dom_string_casecmp’: .../src/dom/string.h:34: error: implicit declaration of function ‘c_strncasecmp’	2008-11-01 22:27:08 +02:00
M. Vefa Bicakci	96b3093519	Patch 2: Modifications to the remaining parts of ELinks [Forward ported to 0.12 from bug 1004 attachment 499. --KON]	2008-11-01 22:20:25 +02:00
M. Vefa Bicakci	86085de07e	Patch 1: Finalize modifications to the HTML parser [Forward ported to 0.12 from bug 1004 attachment 498. --KON]	2008-10-26 18:00:19 +02:00
M. Vefa Bicakci	85c26ddc45	Patch 0: Partial modification of the HTML parser and modification of the FastFind subsystem [Forward ported to 0.12 from bug 1004 attachment 500. --KON]	2008-10-26 16:13:38 +02:00
Kalle Olavi Niemitalo	12d66ff043	Bug 932: Redisable 0x80...0x9F mappings in some charsets. Bug 932 is about ELinks letting control characters 0x80...0x9F through to the terminal. It did not occur with ISO 8859-1, 8859-2, 8859-15, or 8859-16, because the ELinks mappings for those charsets did not include those bytes. However, the www.unicode.org versions imported in the previous commit do include the problematic bytes. To avoid a possible regression before the ELinks 0.12.0 release, comment those control-character mappings out again. This workaround should be reverted after bug 932 has been fixed properly.	2008-10-11 15:35:34 +03:00
Kalle Olavi Niemitalo	c9ca6fd448	Refresh charsets from www.unicode.org. Add copyright and licence notices, and a NEWS entry. The data in the new versions is not entirely the same as what ELinks used to have: - Unicode/8859_1.cp: Adds control characters. - Unicode/8859_2.cp: Adds control characters. - Unicode/8859_4.cp: Adds some control characters that ELinks assumed there already. - Unicode/8859_7.cp: Adds three characters. - Unicode/8859_15.cp: Adds control characters. - Unicode/8859_16.cp: Adds control characters and swaps 0xA5 with 0xAB. - Unicode/koi8_r.cp: Changes 0x95 and adds some control characters that ELinks assumed there already. - Unicode/macroman.cp: Changes 0xC6 and removes some control characters that ELinks assumes there anyway.	2008-10-11 15:35:09 +03:00
Kalle Olavi Niemitalo	00f5831812	Bug 1053: Fix crash when download ends prematurely. Call stacks reported by valgrind: ==14702== at 0x80DD791: read_from_socket (socket.c:945) ==14702== by 0x8104D0C: read_more_http_data (http.c:1180) ==14702== by 0x81052FE: read_http_data (http.c:1388) ==14702== by 0x80DD69B: read_select (socket.c:910) ==14702== by 0x80D27AA: select_loop (select.c:307) ==14702== by 0x80D1ADE: main (main.c:358) ==14702== Address 0x4F4E598 is 56 bytes inside a block of size 81 free'd ==14702== at 0x402210F: free (vg_replace_malloc.c:233) ==14702== by 0x812BED8: debug_mem_free (memdebug.c:484) ==14702== by 0x80D7C82: done_connection (connection.c:479) ==14702== by 0x80D8A44: abort_connection (connection.c:769) ==14702== by 0x80D99CE: cancel_download (connection.c:1053) ==14702== by 0x8110EB6: abort_download (download.c:143) ==14702== by 0x81115BC: download_data_store (download.c:337) ==14702== by 0x8111AFB: download_data (download.c:446) ==14702== by 0x80D7B33: notify_connection_callbacks (connection.c:458) ==14702== by 0x80D781E: set_connection_state (connection.c:388) ==14702== by 0x80D7132: set_connection_socket_state (connection.c:234) ==14702== by 0x80DD78D: read_from_socket (socket.c:943) read_from_socket() attempted to read socket->fd in order to set handlers on it, but the socket had already been freed. Incidentally, socket->fd was -1, which would have resulted in an assertion failure if valgrind hadn't caught the bug first. To fix this, add a list of weak references to sockets. read_from_socket() registers a weak reference on entry and unregisters it before exit. done_socket() breaks any weak references to the specified socket. read_from_socket() then checks whether the weak reference was broken, and doesn't access the socket any more if so.	2008-10-04 14:19:00 +03:00
Kalle Olavi Niemitalo	bda58a124a	Revert "Use given connections id in connection_disappeared()." This reverts src/{network,sched}/connection.c CVS revision 1.43, which was made on 2003-07-03 and converted to Git commit cae65f7941628109b51ffb2e2d05882fbbdc73ef in elinks-history. It is pointless to check whether (c == d && c->id == d->id). If c == d, then surely c->id == d->id, and I wouldn't be surprised to see a compiler optimize that out. Whereas, by taking the id as a parameter, connection_disappeared() can check whether the pointer now points to a new struct connection with a different id.	2008-10-04 13:00:57 +03:00
Kalle Olavi Niemitalo	4c2ddac289	Bug 1053: Fix crash when download ends. ELinks attempted to display a message box on file_download.term, but it had already closed that terminal and freed the struct terminal. To fix this, reset file_download.term pointers to NULL when the terminal is about to be destroyed. Also, assert in download_data_store() that file_download.term is either NULL or in the global "terminals" list. Reported by أحمد المحمودي. (cherry picked from commit `6e2476ea4d`)	2008-10-03 00:18:41 +03:00
Kalle Olavi Niemitalo	b0ce4adcbe	Let Perl scripts dynamically load libraries. XML::LibXML::SAX appears to require this.	2008-09-27 21:58:08 +03:00

1 2 3 4 5 ...

2368 Commits