guile

mirror of https://git.savannah.gnu.org/git/guile.git synced 2025-07-12 12:10:30 +02:00

Author	SHA1	Message	Date
Michael Gran	3d03f9395e	write-char should handle UCS-4 characters * libguile/print.c (scm_write_char): call UCS-4 printing routine, instead of 8-bit primitive	2009-09-04 07:27:14 -07:00
Ken Raeburn	5f5e7a2cd6	Make test-case compilation with -DSCM_DEBUG=1 work. * gc.h (scm_i_expensive_validation_check): Declare SCM_API.	2009-09-03 16:59:11 -04:00
Michael Gran	bb15a36c25	Update docs and docstrings for Unicode characters * doc/ref/api-data.texi: more info about characters and codepoints * libguile/chars.c: replace 'code point' with 'Unicode code point' in docstrings	2009-09-03 08:48:23 -07:00
Michael Gran	ba8477ecce	Add char-set debugging function * libguile/srfi-14.c (scm_sys_char_set_dump): new function * libguile/srfi-14.h: declaration of scm_sys_char_set_dump	2009-09-03 08:29:45 -07:00
Michael Gran	719bb8cd5d	Distinguish between all codepoints and designated codepoints in char-sets * libguile/unidata_to_charset.pl (designated): renamed from full * libguile/srfi-14.c (scm_char_set_designated): new char-set * libguile/srfi-14.i.c (cs_designated): renamed from cs_full	2009-09-03 08:23:24 -07:00
Michael Gran	0dcd7e6153	Modify read and print of combining characters Since combining characters, such as accents, modify the appearance of the previous letter, it looks awkward in its character literal form (#\name) since it modified the backslash. This instead prints the combining character on a small circle. * libguile/chars.h (SCM_CODEPOINT_DOTTED_CIRCLE): new #define * libguile/print.c (iprint1): print combining characters on dotted circles * libguile/read.c (scm_read_character): parse the combination of combining characters and dotted circles	2009-09-03 07:47:26 -07:00
Michael Gran	aa2cba9c88	Remove always-true range checks in scm_i_ucs_range_to_char_set * libguile/srfi-14.c (scm_i_ucs_range_to_char_set): limits are always non-negative due to the type of the variable	2009-09-02 06:45:05 -07:00
Michael Gran	08ed805879	Unreachable code in charset set operator * libguile/srfi-14.c (scm_i_charset_set): remove unreachable code in scm_i_charset_set	2009-09-02 06:28:55 -07:00
Michael Gran	aff31b0f99	Optimize charset union operator * libguile/srfi-14.c (charsets_union): call scm_i_charset_set_range instead of setting characters one-by-one.	2009-09-02 06:28:47 -07:00
Michael Gran	f4cdfe6140	The charset complement operator should not include surrogates * libguile/srfi-14.c (charsets_complement): skip over surrogates when making a charset complement	2009-09-02 06:28:42 -07:00
Michael Gran	bde543e88b	char-set-filter! does not properly iterate over the charset * libguile/srfi-14.c (scm_char_set_filter_x): iterate over codepoints	2009-09-02 06:28:35 -07:00
Michael Gran	91772d8f8a	ucs-range->char-set should not store surrogates and has off-by-one error * libguile/srfi-14.c (scm_i_ucs_range_to_char_set): new function that contains the functionality of ucs_range_to_char_set, fixes off-by-one, and doesn't store surroges (scm_ucs_range_to_char_set, scm_ucs_range_to_char_set_x): call scm_i_ucs_range_to_char_set (scm_i_charset_set_range): new helper function	2009-09-02 06:28:29 -07:00
Michael Gran	693e72891f	char-set-any improperly unpacks charset data * libguile/srfi-14.c (scm_char_set_any): unpack the charset correctly	2009-09-02 06:28:20 -07:00
Michael Gran	7165abeba8	char-set-xor! should modify the input parameter char-set-xor! was not modifying its input parameter. It isn't technically required to do so by the spec, but, the other similar functions do it. * libguile/srfi-14.c (scm_char_set_xor_x): modify the input parameter	2009-09-02 06:28:11 -07:00
Michael Gran	3f12aedb50	Update docs for Unicode characters * NEWS: add note about Unicode characters * doc/ref/api-data.texi: update Characters subsection * libguile/chars.c: update docstrings to match manual	2009-08-30 16:55:52 -07:00
Michael Gran	5f5920e012	Fix escape sequence normalization for wide strings * libguile/strings.c (scm_to_stringn): convert unistring escapes to guile escapes for both wide and narrow strings	2009-08-30 16:55:17 -07:00
Michael Gran	fac32b518e	Fix encoding errors with strings returned by string ports String ports, being 8-bit, store strings using the character encoding of the port. This fixes a bug where the default character encoding, and not the port's encoding, was being used to convert the string port data back to a string. * libguile/strports.c: extra comments (scm_strport_to_string): use port's encoding when converting port data to a string * libguile/strings.c (scm_i_from_stringn): renamed from scm_from_stringn and made internal. All callers changed. (scm_from_stringn): renamed to scm_i_from_stringn. * libguile/strings.h: declaration for scm_i_from_stringn	2009-08-30 16:54:49 -07:00
Michael Gran	0ffc78e384	Range check octal-escaped characters * libguile/read.c (scm_read_character): range check octal escapes	2009-08-29 07:14:49 -07:00
Michael Gran	24d23822ee	Surrogate characters shouldn't be in charsets * libguile/srfi-14.c (charsets_complement): use surrogate #defines instead of hardcoded numbers * libguile/srfi-14.i.c (cs_full_ranges): remove surrogates from full charset * libguile/unidata_to_charset.pl (full): test for surrogates	2009-08-29 00:01:06 -07:00
Michael Gran	526ee76ac3	Better range check for codepoints * libguile/chars.h (SCM_IS_UNICODE_CHAR): check for negative codepoints	2009-08-29 00:00:58 -07:00
Michael Gran	6d736fdba2	Cast the input to isalpha et al to integer * libguile/gc_os_dep.c (GC_linux_stack_base) [LINUX_STACKBOTTOM]: cast input of ctype functions to int * libguile/inet_aton.c (inet_aton): cast input of ctype functions to int * libguile/read.c (scm_scan_for_encoding): cast input of isalnum to int * libguile/win32-socket.c (scm_i_socket_uncomment): cast input of isspace to int	2009-08-28 21:19:05 -07:00
Andy Wingo	5950cc3fcc	fix case in which compiled path had stale .go, but fallback had fresh .go * libguile/load.c (scm_primitive_load_path): If the compiled path was out of date, but the fallback path was current, we correctly detected that case, but loaded the wrong file. So here fix the typo.	2009-08-28 17:13:09 +02:00
Michael Gran	8736ef70ac	scm_getc improperly handles Latin-1 characters Upper-plane Latin-1 characters should be converted to codepoints. * libguile/ports.c (scm_getc): improper conversion of char to scm_t_wchar	2009-08-27 20:42:36 -07:00
Michael Gran	d5ecf5797d	Fix FUNC_NAME definitions and #endif in srfi-14.[ch] * libguile/srfi-14.c: whitespace and FUNC_NAME fixes * libguile/srfi-14.h: #endif comment	2009-08-27 18:52:53 -07:00
Michael Gran	d0434ddf25	Script to generate srfi-14 charsets from UnicodeData.txt This script was used to generate srfi-14.i.c from the UnicodeData.txt file supplied by ftp://www.unicode.org/Public/UNIDATA/ * libguile/unidata_to_charset.pl	2009-08-27 18:23:46 -07:00
Ludovic Courtès	1505848425	Add missing `FUNC_NAME' definition. * libguile/load.c (scm_sys_warn_autocompilation_enabled): Define `FUNC_NAME'.	2009-08-28 01:16:49 +02:00
Michael Gran	fa316af70f	Default srfi-14 character set information * libguile/srfi-14.i.c: structures containing the default srfi-14 sets	2009-08-27 09:13:22 -07:00
Michael Gran	026ed23911	Always cast input to toupper as int * libguile/read.c (scm_scan_for_encoding): add cast to int	2009-08-27 07:44:18 -07:00
Michael Gran	930ddd34c3	Segfault when writing non-Latin-1 characters under Latin-1 locale * libguile/print.c (iprin1): handle write of non-Latin-1 characters under the Latin-1 locale	2009-08-27 07:44:01 -07:00
Michael Gran	f49dbcadf3	Unicode-capable srfi-14 charsets * libguile/Makefile.am: distribute new files srfi-14.i.c and unidata_to_charset.pl * chars.c (scm_c_upcase, scm_c_downcase): use unicode-enable toupper and tolower * libguile/srfi-14.h (scm_t_char_range, scm_t_char_set): new structures to describe char-sets (scm_t_char_set_cursor): new structure to describe char-set-cursors (SCM_BITS_PER_LONG): removed (SCM_CHARSET_GET): calls function New declarations for scm_i_charset_get, scm_i_charset_set, scm_i_charset_unset, and scm_debug_char_set. * test-suite/tests/srfi-14.test: new tests * libguile/srfi-14.c (SCM_CHARSET_DATA): new macro (SCM_CHARSET_SET, SCM_CHARSET_UNSET): call function (BYTES_PER_CHARSET, LONGS_PER_CHARSET): removed (scm_i_charset_get, scm_i_charset_set, scm_i_charset_unset) (charsets_equal, charsets_leq, charsets_union) (charsets_intersection, charsets_complement, charsets_xor): new functions that are low-level charset operators (charset_print, charset_free): modified for new charset struct (charset_cursor_print, charset_cursor_free): new function (make_char_set, scm_char_set_p, scm_char_set_eq, scm_car_set_leq) (scm_char_set_hash, scm_char_set_cursor, scm_char_set_ref) (scm_char_set_cursor_next, scm_end_of_char_set_p, scm_char_set_fold) (scm_char_set_unfold, scm_char_set_unfold_x, scm_char_set_for_each) (scm_char_set_map, scm_char_set_copy, scm_char_set, scm_list_to_char_set) (scm_list_to_char_set_x, scm_string_to_char_set, scm_string_to_char_set_x) (scm_char_set_filter, scm_char_set_filter_x, scm_ucs_range_to_char_set) (scm_ucs_range_to_char_set_x, scm_to_char_set, scm_char_set_size) (scm_char_set_count, scm_char_set_to_list, scm_char_set_to_string) (scm_char_set_contains_p, scm_char_set_every, scm_char_set_any) (scm_char_set_adjoin, scm_char_set_delete, scm_char_set_adjoin_x) (scm_char_set_delete_x, scm_char_set_complement, scm_char_set_union) (scm_char_set_intersection, scm_char_set_difference, scm_char_set_xor) (scm_char_set_diff_plus_intersection, scm_char_set_complement_x) (scm_char_set_union_x, scm_char_set_intersection_x, scm_char_set_difference_x) (scm_char_set_xor_x, scm_char_set_diff_plus_intersection_x): modified to use new charset and charset-cursor data structures (CSET_BLANK_PRED, CSET_SYMBOL_PRED, CSET_PUNCT_PRED, CSET_LOWER_PRED) (CSET_UPPER_PRED, CSET_LETTER_PRED, CSET_DIGIT_PRED, CSET_WHITESPACE_PRED) (CSET_CONTROL_PRED, CSET_HEX_DIGIT_PRED, CSET_ASCII_PRED, CSET_LETTER_PRED) (CSET_LETTER_AND_DIGIT_PRED, CSET_PRINTING_PRED, CSET_TRUE_PRED) (CSET_FALSE_PRED): removed (scm_srfi_14_compute_char_sets): removed - too slow to iterate over all of unicode at startup (scm_debug_char_set) [SCM_CHARSET_DEBUG]: new function	2009-08-27 07:43:33 -07:00
Ken Raeburn	71a5964c11	Don't leave and reenter guile mode if mutex is available On Aug 5, 2009, at 10:06, Ken Raeburn wrote: > (1) In scm_pthread_mutex_lock, we leave and re-enter guile mode so > that we don't block the thread while in guile mode. But we could > use pthread_mutex_trylock first, and avoid the costs scm_leave_guile > seems to incur on the Mac. If we can't acquire the lock, it should > return immediately, and then we can do the expensive, blocking > version. A quick, hack version of this changed my run time for > A(3,8) from 17.5s to 14.5s, saving about 17%; sigaltstack and > sigprocmask are still in the picture, because they're called from > scm_catch_with_pre_unwind_handler. I'll work up a nicer patch > later. Ah, we already had scm_i_pthread_mutex_trylock lying around; that made things easy. A second timing test with A(3,9) and this version of the patch (based on 1.9.1) shows the same improvement. * libguile/threads.c (scm_pthread_mutex_lock): Try the mutex before leaving and reentering guile mode.	2009-08-26 23:36:19 +01:00
Andy Wingo	4769c9db2c	fix uninitialized variable in scm_read_character * libguile/read.c (scm_read_character): Fix uninitialized variable.	2009-08-26 13:15:07 +02:00
Andy Wingo	c6a1380bde	Merge commit 'origin/master' Conflicts: libguile/unif.c	2009-08-25 21:43:00 +02:00
Andy Wingo	108e18b18a	Merge wip-array refactor, up to `cd43fdc5b7` Conflicts: NEWS libguile/print.c	2009-08-25 18:04:02 +02:00
Michael Gran	889975e51a	Add full Unicode capability to ports and the default reader Ports are given two additional properties: a character encoding and a conversion failure strategy. These properties have getters and setters. The new properties are used to convert any locale text to/from the internal representation of strings. If unspecified, ports use a default value. The default value of these properties is held in a fluid. The default character encoding can be modified by calling setlocale. ISO-8859-1 is treated specially. Since it is a native encoding of strings, it can be processed more quickly. Source code is assumed to be ISO-8859-1 unless otherwise specified. The encoding of a source code file can be given as 'coding: XXXXX' in a magic comment at the top of a file. The C functions that deal with encoding often use a null pointer as shorthand for the native Latin-1 encoding, for efficiency's sake. * test-suite/tests/encoding-iso88591.test: new tests * test-suite/tests/encoding-iso88597.test: new tests * test-suite/tests/encoding-utf8.test: new tests * test-suite/tests/encoding-escapes.test: new tests * test-suite/tests/numbers.test: declare 'binary' encoding * test-suite/tests/ports.test: declare 'binary' encoding * test-suite/tests/r6rs-ports.test: declare 'binary' encoding * module/system/base/compile.scm (compile-file): use source-code file's self-declared encoding when compiling files * libguile/strports.c: store string ports in locale encoding (scm_strport_to_locale_u8vector, scm_call_with_output_locale_u8vector) (scm_open_input_locale_u8vector, scm_get_output_locale_u8vector): new functions * libguile/strings.h: new declaration for scm_i_string_contains_char * libguile/strings.c (scm_i_string_contains_char): new function (scm_from_stringn, scm_to_stringn): use NULL for Latin-1 (scm_from_locale_stringn, scm_to_locale_stringn): respect character encoding of input and output ports * libguile/read.h: declaration for scm_scan_for_encoding * libguile/read.c: (read_token): now takes scheme string instead of C string/length (read_complete_token): new function (scm_read_sexp, scm_read_number, scm_read_mixed_case_symbol) (scm_read_number_and_radix, scm_read_quote, scm_read_semicolon_comment) (scm_read_srfi4_vector, scm_read_bytevector, scm_read_guile_bit_vector) (scm_read_scsh_block_comment, scm_read_commented_expression) (scm_read_extended_symbol, scm_read_sharp_extension, scm_read_shart) (scm_read_expression): use scm_t_wchar for char type, use read_complete_token (scm_scan_for_encoding): new function to find a file's character encoding (scm_file_encoding): new function to find a port's character encoding * libguile/rdelim.c: don't unpack strings * libguile/print.h: declaration for modified function scm_i_charprint * libguile/print.c: use locale when printing characters and strings (scm_i_charprint): input parameter is now scm_t_wchar (scm_simple_format): don't unpack strings * libguile/posix.h: new declaration for scm_setbinary. * libguile/posix.c (scm_setlocale): set default and stdio port encodings based on the locale's character encoding (scm_setbinary): new function * libguile/ports.h (scm_t_port): add encoding and failed conversion handler to port type. Declarations for new or modified functions scm_getc, scm_unget_byte, scm_ungetc, scm_i_get_port_encoding, scm_i_set_port_encoding_x, scm_port_encoding, scm_set_port_encoding_x, scm_i_get_conversion_strategy, scm_i_set_conversion_strategy_x, scm_port_conversion_strategy, scm_set_port_conversion_strategy_x. * libguile/ports.c: assign the current ports to zero on startup so we can see if they've been set. (scm_current_input_port, scm_current_output_port, scm_current_error_port): return #f if the port is not yet initialized (scm_new_port_table_entry): set up a new port's encoding and illegal sequence handler based on the thread's current defaults (scm_i_remove_port): free port encoding name when port is removed (scm_i_mode_bits_n): now takes a scheme string instead of a c string and length. All callers changed. (SCM_MBCHAR_BUF_SIZE): new const (scm_getc): new function, since the scm_getc in inline.h is now scm_get_byte_or_eof. This pulls one codepoint from a port. (scm_lfwrite_substr, scm_lfwrite_str): now uses port's encoding (scm_unget_byte): new function, incorportaing the low-level functionality of scm_ungetc (scm_ungetc): uses scm_unget_byte * libguile/numbers.h (scm_t_wchar): compilation order problem with scm_t_wchar being use in functions in multiple headers. Forward declare scm_t_wchar. * libguile/load.c (scm_primitive_load): scan for file encoding at top of file and use it to set the load port's encoding * libguile/inline.h (scm_get_byte_or_eof): new function incorporating most of the functionality of scm_getc. * libguile/fports.c (fport_fill_input): now returns scm_t_wchar * libguile/chars.h (scm_t_wchar): avoid compilation order problem with declaration of scm_t_wchar	2009-08-25 07:54:37 -07:00
Michael Gran	9db8cf1634	Avoid unpacking symbols in GOOPS * libguile/goops.c (scm_make_extended_class_from_symbol): new function (scm_class_of): don't unpack symbol chars (wrap_init): don't unpack symbol chars (make_class_from_symbol): new function (make_struct_class): don't unpack symbol chars	2009-08-23 10:40:44 -07:00
Michael Gran	587a33556f	Modify socket and time functions for wide strings * libguile/socket.c (scm_recv): receive the message without holding the stringbuf writing lock (scm_send): try to narrow a string before using it * libguile/stime.c (strftime): convert string to UTF-8 so that it can be safely passed to strftime (strptime): convert input string to UTF-8 so that it can be safely passed through strptime * libguile/strings.c (narrow_stringbuf): new function (scm_i_try_narrow_string): new function * libguile/strings.h: new declaration for scm_i_try_narrow_string	2009-08-23 09:29:45 -07:00
Michael Gran	27646f414e	Use string and symbol accessors in struct, throw, and array funcs * libguile/struct.c (scm_make_struct_layout, scm_struct_init) (scm_struct_vtable_p, scm_struct_ref, scm_struct_set_x): use string and symbol accessors and avoid unpacking strings and symbols * libguile/throw.c (scm_ithrow): allow wide symbols in the error message * libguile/unif.c (scm_enclose_array, scm_istr2bve): use string accessors and avoid unpacking strings	2009-08-23 09:29:16 -07:00
Michael Gran	806f1ded95	Avoid type-punning warning in scm_gentemp Int and size_t may not have the same storage size * libguile/deprecated.c (scm_gentemp): use size_t for string length	2009-08-23 09:28:58 -07:00
Neil Jerram	a4dbe1ac3d	Avoid clash with system setjmp/longjmp on IA64 Problem was that if an application includes both libguile.h and the system's setjmp.h, and is compiled on IA64, it gets compile errors because of jmp_buf, setjmp and longjmp being multiply defined. * libguile/__scm.h (__ia64__): Define scm_i_jmp_buf, SCM_I_SETJMP and SCM_I_LONGJMP instead of jmp_buf, setjmp and longjmp. (all other platforms): Map scm_i_jmp_buf, SCM_I_SETJMP and SCM_I_LONGJMP to jmp_buf, setjmp and longjmp. * libguile/continuations.c (scm_make_continuation): Use `SCM_I_SETJMP' instead of `setjmp'. (copy_stack_and_call): Use `SCM_I_LONJMP' instead of `longjmp'. (scm_ia64_longjmp): Use type `scm_i_jmp_buf' instead of `jmp_buf'. * libguile/continuations.h (scm_t_contregs): Use type `scm_i_jmp_buf' instead of `jmp_buf'. * libguile/threads.c (suspend): Use `SCM_I_SETJMP' instead of `setjmp'. * libguile/threads.h (scm_i_thread): Use type `scm_i_jmp_buf' instead of `jmp_buf'. * libguile/throw.c (JBJMPBUF, make_jmpbuf, jmp_buf_and_retval): Use type `scm_i_jmp_buf' instead of `jmp_buf'. (scm_c_catch): Use `SCM_I_SETJMP' instead of `setjmp'. (scm_ithrow): Use `SCM_I_LONGJMP' instead of `longjmp'.	2009-08-21 23:29:08 +01:00
Neil Jerram	d5ed380ec8	Remove trailing whitespace	2009-08-21 23:25:28 +01:00
Neil Jerram	1b872adf2e	Fix set-source-properties so that the special source properties work * libguile/srcprop.c (scm_set_source_properties_x): Look for the special source properties, save them off, and then construct a srcprops object using them.	2009-08-21 23:25:27 +01:00
Neil Jerram	67a967348a	In srcprop.c change all occurrences of "plist" to "alist" As with the previous commit, this is to avoid any suggestion that the source properties API uses the property list format, i.e. (key1 value1 key2 value2 ...). Also remove scm_srcprops_to_plist () from the API. It doesn't have any external usefulness and has never documented.	2009-08-21 23:25:26 +01:00
Michael Gran	cdf8f9e632	Use uc_tolower in number conversion * libguile/numbers.c (XDIGIT2UINT): use uc_tolower	2009-08-21 09:30:53 -07:00
Michael Gran	3f47e52621	Use string accessors for string->number conversion * libguile/numbers.c (scm_i_print_fraction): use string accessors (XDIGIT2UINT): use libunistring function (mem2uinteger, mem2integer, mem2decimal_from_point, mem2ureal) (mem2complex): take scheme string instead of c string; use accessors (scm_i_string_to_number): new function (scm_c_locale_string_to_number): use scm_i_string_to_number * libguile/numbers.h: declaration for scm_i_string_to_number * libguile/strings.c (scm_i_string_strcmp): new function * libguile/strings.h: declaration for scm_i_string_strcmp	2009-08-21 09:18:30 -07:00
Michael Gran	e23106d53e	Add initial support for wide symbols * libguile/hash.c (scm_i_string_hash): new function (scm_hasher): don't unpack string: use scm_i_string_hash * libguile/hash.h: new declaration for scm_i_string_hash * libguile/print.c (quote_keywordish_symbol): use symbol accessors (scm_i_print_symbol_name): new function (scm_print_symbol_name): call scm_i_print_symbol_name (iprin1): use scm_i_print_symbol_name to print symbols * libguile/print.h: new declaration for scm_i_print_symbol_name * libguile/symbols.c (lookup_interned_symbol): now takes scheme string instead of c string; callers changed (lookup_interned_symbol): add wide symbol support (scm_i_c_mem2symbol): removed (scm_i_mem2symbol): removed and replaced with scm_i_str2symbol (scm_i_str2symbol): new function (scm_i_mem2uninterned_symbol): removed and replaced with scm_i_str2uninterned_symbol (scm_i_str2uninterned_symbol): new function (scm_make_symbol, scm_string_to_symbol, scm_from_locale_symbol) (scm_from_locale_symboln): use scm_i_str2symbol * test-suite/tests/symbols.test: new tests	2009-08-21 08:57:35 -07:00
Michael Gran	90305ce9e4	Use symbol accessors in scm_gc_mark_dependencies * libguile/gc-mark.c (scm_gc_mark_dependencies): don't unpack symbols. Use symbol accessors.	2009-08-20 22:41:12 -07:00
Michael Gran	832bbc95a2	Use string accessors in scm_basename and scm_dirname * libguile/filesys.c (basename, dirname): don't unpack strings. Use string accessor functions.	2009-08-20 22:40:15 -07:00
Michael Gran	43d5626ce7	Avoid type-limits warning in SCM_TO_TYPE_PROTO * libguile/conv-uinteger.i.c (SCM_TO_TYPE_PROTO): avoid a comparison that is always true due to the limited range of the data type	2009-08-20 21:40:03 -07:00
Michael Gran	68a30f5730	Type-limits error in GC environment initialization * libguile/gc-malloc.c (scm_gc_init_malloc): GUILE_INIT_MALLOC_LIMIT is cast to unsigned then tested as if it were still signed	2009-08-20 21:40:00 -07:00

1 2 3 4 5 ...

5817 commits