guile

mirror of https://git.savannah.gnu.org/git/guile.git synced 2025-07-13 20:50:25 +02:00

Author	SHA1	Message	Date
Ludovic Courtès	190d4b0d93	Make VM string literals immutable. * libguile/strings.c (scm_i_make_string, scm_i_make_wide_string): Add `read_only_p' parameter. All callers updated. * libguile/vm-i-loader.c (load_string, load_wide_string): Push read-only strings. * test-suite/tests/strings.test ("literals"): New test prefix.	2011-03-20 23:34:42 +01:00
Andy Wingo	d900843c72	fix encoding scanning for non-seekable ports * libguile/read.c (scm_i_scan_for_encoding): If possible, just use the read buffer for the encoding scan, and avoid seeking. Fixes `(open-input-file "/dev/urandom")', because /dev/urandom can't be seeked backwards.	2011-03-03 12:57:46 +01:00
Ludovic Courtès	58b1db5f24	Have `read' update line/column numbers when reading SCSH block comments. * libguile/read.c (scm_read_scsh_block_comment): Use `scm_getc' instead of `scm_get_byte_or_eof'. * test-suite/tests/reader.test ("read-options")["position of SCSH block comment"]: New test.	2011-02-28 23:33:47 +01:00
Andy Wingo	6c51a40ace	read-enable 'positions by default * libguile/read.c (scm_read_opts): Default "positions" to #t. The compiler was already turning it on anyway, and this allows primitive-load without --auto-compile to also propagate source information through the expander, for better errors and to let macros know their source. * module/language/scheme/spec.scm: No need to enable positions here now.	2011-02-13 15:06:11 +01:00
Ralf Wildenhues	ffb62a43dc	fix typos in the manual bits generated from source comments. * libguile/bitvectors.c, libguile/chars.c, libguile/deprecated.c, libguile/numbers.c, libguile/random.c, libguile/read.c, libguile/root.c, libguile/srfi-1.c, libguile/srfi-13.c, libguile/srfi-14.c, libguile/uniform.c: Fix typos, add missing newlines.	2011-02-09 22:28:49 +00:00
Andy Wingo	684d664e39	implement r6rs hungry escaped EOL * libguile/private-options.h (SCM_HUNGRY_EOL_ESCAPES_P): New private option. * libguile/read.c: Define SCM_HUNGRY_EOL_ESCAPES_P, defaulting to #f. (skip_intraline_whitespace): New helper. (scm_read_string): If SCM_HUNGRY_EOL_ESCAPES_P, skip_intraline_whitespace after an escaped EOL. * test-suite/tests/reader.test ("read-options"): Add test.	2011-01-21 09:24:32 +01:00
Andy Wingo	4a655e50a3	use scm_from_latin1_symboln for string literals and load-symbol * libguile/bytevectors.c: * libguile/eval.c: * libguile/goops.c: * libguile/i18n.c: * libguile/load.c: * libguile/memoize.c: * libguile/modules.c: * libguile/ports.c: * libguile/print.c: * libguile/procs.c: * libguile/programs.c: * libguile/read.c: * libguile/script.c: * libguile/srfi-14.c: * libguile/stacks.c: * libguile/strings.c: * libguile/throw.c: * libguile/vm.c: Use scm_from_latin1_symboln to make symbols from string literals, because they aren't in the user's locale -- they are in ASCII, and we can optimize this case. * libguile/vm-i-loader.c: Also use scm_from_latin1_symboln when loading narrow symbols.	2011-01-07 09:18:41 -08:00
Andy Wingo	e25f37271a	fix a number of assuptions that a long could hold an inum * libguile/bytevectors.c: * libguile/goops.c: * libguile/instructions.c: * libguile/numbers.c: * libguile/random.c: * libguile/read.c: * libguile/vm-i-scheme.c: Fix a number of assumptions that a long could hold an inum. This is not the case on platforms whose void* is larger than their long. * libguile/numbers.c (scm_i_inum2big): New helper, only implemented for sizeof(void*) == sizeof(long); produces a compile error on other platforms. Basically gmp doesn't have a nice interface for converting between mpz values and intmax_t.	2010-11-19 15:22:43 +01:00
Andy Wingo	3d27ef4bd3	fix a number of assumptions that a pointer could fit into a long * libguile/debug.c: * libguile/eval.c: * libguile/frames.c: * libguile/objcodes.c: * libguile/print.c: * libguile/programs.c: * libguile/read.c: * libguile/struct.c: * libguile/vm.c: Fix a number of instances in which we assumed we could fit a pointer into a long.	2010-11-19 15:22:43 +01:00
Andy Wingo	190fa72a8f	NUL vs NULL fix * libguile/read.c (scm_i_scan_for_encoding): Fix NUL rather than NULL.	2010-11-12 17:18:08 +01:00
Julian Graham	f25e1b6713	Fix buffer over-read in port encoding scan. * libguile/read.c (scm_i_scan_for_encoding): Add a NULL terminator to the end of header to prevent over-read by subsequent call to strstr.	2010-11-12 08:45:09 -05:00
Michael Gran	a4e4722944	need read error for extra closing square brackets * libguile/read.c (scm_read_expression): add test	2010-11-04 22:07:50 -07:00
Andreas Rottmann	d458073bc0	Use a fluid for the list of the reader's "hash procedures" This allows customizing the reader behavior for a dynamic extent more easily. * libguile/read.c (scm_read_hash_procedures): Renamed to `scm_i_read_hash_procedures'. (scm_i_read_hash_procedures_ref, scm_i_read_hash_procedures_set_x): New (internal) accessor functions for the fluid. (scm_read_hash_extend, scm_get_hash_procedure): Use these accessor functions. (scm_init_read): Create the fluid, named `%read-hash-procedures' instead of the previous plain list `read-hash-procedures'. * test-suite/tests/reader.test: Adapt the "R6RS/SRFI-30 block comment syntax overridden" test to make use of the fluid. * module/ice-9/deprecated.scm (read-hash-procedures): New identifier macro -- backward-compatibility shim. Signed-off-by: Ludovic Courtès <ludo@gnu.org>	2010-11-03 00:09:57 +01:00
Andy Wingo	c1a0ba1cef	uninitialized var in scm_read_character * libguile/read.c (scm_read_character): Fix error condition where charname could be uninitialized.	2010-10-18 13:29:58 +02:00
Andy Wingo	24259edb8b	remove elisp-strings and elisp-vectors read options * libguile/private-options.h (SCM_ELISP_VECTORS_P, SCM_ESCAPED_PARENS_P): * libguile/read.c (scm_read_opts): Remove unused elisp-vectors option, and the elisp-strings option (which allowed \( and \) escapes in strings). (scm_read_string): Remove the elisp-strings case. * doc/ref/api-options.texi (Reader options): Update, and update wording of the case-insensitive bit.	2010-10-01 10:20:54 +02:00
Michael Gran	0f3a70cfa8	Enable character hex escapes by default R6RS character hex escapes do not conflict with legacy Guile octal character escapes, so they can be enabled by default. * libguile/read.c (scm_read_character): modified * test-suite/tests/reader.test: modify character escape tests * doc/ref/api-data.texi: modified * doc/ref/api-options.texi: modified	2010-07-17 04:16:57 -07:00
Michael Gran	daedbca7de	More explicit variable names in scm_i_scan_for_encoding Note especially that the variable 'i' has two different uses in this function, and they get confused. * libguile/read.c (scm_i_scan_for_encoding): cleanup	2010-07-16 05:39:52 -07:00
Andy Wingo	5b69315ed3	fix '(] infinite loop * libguile/read.c (scm_read_sexp): Fix reader infinite loop. Thanks to Bill Schottstaedt for the report. * test-suite/tests/reader.test: Add test.	2010-07-13 21:53:41 +02:00
Julian Graham	911b03b20c	Support for the #!r6rs lexeme. * libguile/read.c (scm_read_shebang): New function; (scm_read_sharp): Call scm_read_shebang on '!', which delegates to scm_read_scsh_block_comment as necessary. * test-suite/tests/reader.test ("R6RS lexeme comment", "partial R6RS lexeme comment"): New tests.	2010-05-27 09:21:18 -04:00
Andy Wingo	7c4aad9cc7	add read syntax for #nil * libguile/evalext.c (scm_self_evaluating_p): #nil is self-evaluating. * libguile/read.c (scm_read_nil, scm_read_sharp): Add read syntax for #nil.	2010-04-09 14:15:16 +02:00
Andy Wingo	c1b7c940ec	lisp nil always enabled * configure.ac: Remove --disable-elisp option. Lisp nil is always enabled. * libguile/boolean.h: * libguile/gen-scmconfig.c: * libguile/gen-scmconfig.h.in: * libguile/init.c: * libguile/lang.c: * libguile/lang.h: * libguile/pairs.h: * libguile/private-options.h: * libguile/read.c: Remove conditionals for disabling elisp.	2010-04-09 14:03:02 +02:00
Michael Gran	8a8da78d97	Faster read of semicolon comments There is no need to do character encoding processing within semicolon comments. * libguile/read.c (scm_read_semicolon_comment): changed	2010-02-15 20:45:58 -08:00
Michael Gran	69f90b0b05	Optimize reader by preferring stack-allocated buffers * libguile/read.c (read_token): now takes a C buffer instead of a SCM. string. All callers changed. (read_complete_token): now takes C buffers, not SCM strings. No longer does port position updates or encoding processing. All callers changed. (scm_read_number, scm_read_mixed_case_symbol, scm_read_number_and_radix) (scm_read_character): Do port updates and string processing no longer done by read_complete_token. Some reordering for optimization.	2010-02-02 20:33:41 -08:00
Andy Wingo	5afa815c9c	add reader option for parsing [] as (). * libguile/private-options.h: * libguile/read.c (scm_read_opts, SCM_SQUARE_BRACKETS_P): Add an option for treating [ and ] as parentheses, on by default. Note that this makes them delimiters also, so [ and ] cannot appear in a symbol name, with this read option on. (scm_read_sexp): If we start with [, we end with ]. (scm_read_expression): Add case for [.	2010-01-15 22:31:23 +01:00
Andy Wingo	a8fc38526a	remove unused var in read.c * libguile/read.c (scm_read_character): Remove unused var.	2010-01-13 21:08:54 +01:00
Michael Gran	898a0b5a2e	Disable \u and \U escapes when r6rs-hex-escapes enabled When the reader option 'r6rs-hex-escapes is enabled, the \uNNNN and \UNNNNNN string escape sequences should be disabled. * libguile/read.c (scm_read_string): added checks for SCM_R6RS_ESCAPES_P	2010-01-13 07:02:07 -08:00
Michael Gran	dea901d66e	Reader option for R6RS hex escapes This adds a reader option 'r6rs-hex-escapes that modifies the behavior of numeric escapes in characters and strings. When enabled, variable-length character hex escapes (#\xNNN) are allowed and become the default output format for numerically-escaped characters. Also, string hex escapes switch to a semicolon terminated hex escape (\xNNNN;). * libguile/print.c (PRINT_CHAR_ESCAPE): new macro (iprin1): use new macro PRINT_CHAR_ESCAPE * libguile/private-options.h (SCM_R6RS_ESCAPES_P): new #define * libguile/read.c (scm_read_opts): add new option r6rs-hex-escapes (SCM_READ_HEX_ESCAPE): modify to take a terminator parameter (scm_read_string): parse R6RS hex string escapes (scm_read_character): parse R6RS hex character escapes * test-suite/tests/chars.test (with-read-options): new procedure (R6RS hex escapes): new tests * test-suite/tests/strings.test (with-read-options): new procedure (R6RS hex escapes): new tests	2010-01-12 21:02:41 -08:00
Michael Gran	c5661d2860	Refactor repeated code in scm_read_string * libguile/read.c (SCM_READ_HEX_ESCAPE): new macro (scm_read_string): use new macro SCM_READ_HEX_ESCAPE	2010-01-10 18:24:23 -08:00
Michael Gran	67a4a16d8e	Add R6RS backspace string escape R6RS suggests that '\b' should be a string escape for the backspace character. * libguile/read.c (scm_read_string): parse backspace escape * test-suite/tests/strings.test (R6RS backslash escapes): new test (Guile extensions backslash escapes): remove R6RS escapes from test. * doc/ref/api-data.texi (Strings): document new string escape	2010-01-10 15:41:37 -08:00
Andy Wingo	b521d265b1	Fix bugs reading long tokens The commit "don't take string-write mutex in read.c:read_token", from `8b0d7b9d94`, had a number of bugs. Not sure how I missed these before. * libguile/read.c (read_token): Remove a couple of bogus scm_i_string_stop_writing () calls, now that we no longer take the string-write mutex. (read_complete_token): read_token really needs a fresh buffer, which was not the case when we are reading long tokens and thus hit the overflow case. Fixes fractions.test.	2009-12-28 17:35:48 +01:00
Andy Wingo	8b0d7b9d94	don't take string-write mutex in read.c:read_token * libguile/read.c (read_token): Don't take the string-write mutex when reading a token into a buffer, because it's assumed that the buffer is fresh (not seen by other threads), and a soft port can call a procedure that needs the string-write mutex.	2009-12-21 22:12:31 +01:00
Ken Raeburn	6751d6db6b	Allow more characters in coding system names in Emacs-style declarations. * libguile/read.c (scm_i_scan_for_encoding): Allow more punctuation symbols in coding system names.	2009-12-17 20:02:35 -05:00
Ludovic Courtès	62316c7f81	Avoid `SCM_UNPACK ()' in constant expressions. They made Sun C 5.8 emit a warning such as: line 71: warning: dead part of constant expression is nonconstant * libguile/print.c (scm_print_opts): Don't use `SCM_UNPACK ()' here. * libguile/read.c (scm_read_opts): Likewise.	2009-12-15 01:01:17 +01:00
Ludovic Courtès	cd169c5a22	Remove uses of the non-standard `__FUNCTION__'. * libguile/gc.c (scm_gc_sweep): Replace `__FUNCTION__' by `FUNC_NAME'. * libguile/read.c (scm_read_r6rs_block_comment): Likewise.	2009-12-15 01:01:16 +01:00
Ludovic Courtès	49bb5bd307	Disable encoding scanning on non-seekable file ports. * libguile/read.c (scm_i_scan_for_encoding): Don't attempt to scan non-seekable file ports.	2009-11-27 17:00:51 +01:00
Ludovic Courtès	a270e133f3	Correct manual wrt. encoding names. * doc/ref/api-evaluation.texi (Character Encoding of Source Files): Don't suggest `latin1' as a good encoding name since Emacs cannot deal with it. * libguile/read.c (scm_file_encoding): Fix "Emacs" spelling.	2009-11-23 23:51:02 +01:00
Ludovic Courtès	7f991c7d32	Fix C99-style declarations after statements. * libguile/eval.i.c (ceval): Move declarations before statements. * libguile/read.c (scm_read_extended_symbol): Likewise. * libguile/struct.c (scm_make_struct_layout): Likewise. * libguile/threads.c (fat_mutex_unlock): Likewise. * libguile/vm-i-system.c (br_if_nargs_ne, br_if_nargs_lt): Likewise. * libguile/vm.c (make_vm): Likewise.	2009-11-17 23:13:58 +01:00
Ken Raeburn	b9fd657add	Fix off-by-one error in processing Emacs-style coding declaration. * libguile/read.c (scm_i_scan_for_encoding): Don't copy the first character after the coding declaration into the new string.	2009-11-16 01:10:15 -05:00
Ludovic Courtès	f8a1c9a859	Have `scm_scan_for_encoding ()' use GC-managed memory. * libguile/read.c (scm_scan_for_encoding): Rename to ... (scm_i_scan_for_encoding): ... this; update callers. Use `scm_gc_strndup ()' instead of `scm_malloc ()'. * libguile/read.h: Update accordingly. * libguile/load.c (scm_primitive_load): Don't call free(3) on the value returned by `scm_i_scan_for_encoding ()'.	2009-11-14 16:59:25 +01:00
Ludovic Courtès	620c89651a	Add support for R6RS/SRFI-30 nested block comments. Suggested by Andreas Rottmann <a.rottmann@gmx.at>. * libguile/read.c (flush_ws, scm_read_sharp): Add support for R6RS/SRFI-30 block comments. (scm_read_r6rs_block_comment): New function. * test-suite/tests/reader.test (exception:unterminated-block-comment): Adjust to match both block comment styles. ("reading")["R6RS/SRFI-30 block comment", "R6RS/SRFI-30 nested block comment", "R6RS/SRFI-30 block comment syntax overridden"]: New tests. ("exceptions")["R6RS/SRFI-30 unterminated nested block comment"]: New test. * doc/ref/api-evaluation.texi (Block Comments): Mention SRFI-30/R6RS block comments. * doc/ref/srfi-modules.texi (SRFI-30): New node.	2009-10-19 22:40:01 +02:00
Michael Gran	060e305adc	Avoid string buffer overrun in scm_scan_for_encoding * libguile/read.c (scm_scan_for_encoding): possible overrun if coding declaration is at end of file	2009-09-05 11:10:07 -07:00
Michael Gran	0dcd7e6153	Modify read and print of combining characters Since combining characters, such as accents, modify the appearance of the previous letter, it looks awkward in its character literal form (#\name) since it modified the backslash. This instead prints the combining character on a small circle. * libguile/chars.h (SCM_CODEPOINT_DOTTED_CIRCLE): new #define * libguile/print.c (iprint1): print combining characters on dotted circles * libguile/read.c (scm_read_character): parse the combination of combining characters and dotted circles	2009-09-03 07:47:26 -07:00
Michael Gran	0ffc78e384	Range check octal-escaped characters * libguile/read.c (scm_read_character): range check octal escapes	2009-08-29 07:14:49 -07:00
Michael Gran	6d736fdba2	Cast the input to isalpha et al to integer * libguile/gc_os_dep.c (GC_linux_stack_base) [LINUX_STACKBOTTOM]: cast input of ctype functions to int * libguile/inet_aton.c (inet_aton): cast input of ctype functions to int * libguile/read.c (scm_scan_for_encoding): cast input of isalnum to int * libguile/win32-socket.c (scm_i_socket_uncomment): cast input of isspace to int	2009-08-28 21:19:05 -07:00
Michael Gran	026ed23911	Always cast input to toupper as int * libguile/read.c (scm_scan_for_encoding): add cast to int	2009-08-27 07:44:18 -07:00
Andy Wingo	4769c9db2c	fix uninitialized variable in scm_read_character * libguile/read.c (scm_read_character): Fix uninitialized variable.	2009-08-26 13:15:07 +02:00
Andy Wingo	c6a1380bde	Merge commit 'origin/master' Conflicts: libguile/unif.c	2009-08-25 21:43:00 +02:00
Andy Wingo	108e18b18a	Merge wip-array refactor, up to `cd43fdc5b7` Conflicts: NEWS libguile/print.c	2009-08-25 18:04:02 +02:00
Michael Gran	889975e51a	Add full Unicode capability to ports and the default reader Ports are given two additional properties: a character encoding and a conversion failure strategy. These properties have getters and setters. The new properties are used to convert any locale text to/from the internal representation of strings. If unspecified, ports use a default value. The default value of these properties is held in a fluid. The default character encoding can be modified by calling setlocale. ISO-8859-1 is treated specially. Since it is a native encoding of strings, it can be processed more quickly. Source code is assumed to be ISO-8859-1 unless otherwise specified. The encoding of a source code file can be given as 'coding: XXXXX' in a magic comment at the top of a file. The C functions that deal with encoding often use a null pointer as shorthand for the native Latin-1 encoding, for efficiency's sake. * test-suite/tests/encoding-iso88591.test: new tests * test-suite/tests/encoding-iso88597.test: new tests * test-suite/tests/encoding-utf8.test: new tests * test-suite/tests/encoding-escapes.test: new tests * test-suite/tests/numbers.test: declare 'binary' encoding * test-suite/tests/ports.test: declare 'binary' encoding * test-suite/tests/r6rs-ports.test: declare 'binary' encoding * module/system/base/compile.scm (compile-file): use source-code file's self-declared encoding when compiling files * libguile/strports.c: store string ports in locale encoding (scm_strport_to_locale_u8vector, scm_call_with_output_locale_u8vector) (scm_open_input_locale_u8vector, scm_get_output_locale_u8vector): new functions * libguile/strings.h: new declaration for scm_i_string_contains_char * libguile/strings.c (scm_i_string_contains_char): new function (scm_from_stringn, scm_to_stringn): use NULL for Latin-1 (scm_from_locale_stringn, scm_to_locale_stringn): respect character encoding of input and output ports * libguile/read.h: declaration for scm_scan_for_encoding * libguile/read.c: (read_token): now takes scheme string instead of C string/length (read_complete_token): new function (scm_read_sexp, scm_read_number, scm_read_mixed_case_symbol) (scm_read_number_and_radix, scm_read_quote, scm_read_semicolon_comment) (scm_read_srfi4_vector, scm_read_bytevector, scm_read_guile_bit_vector) (scm_read_scsh_block_comment, scm_read_commented_expression) (scm_read_extended_symbol, scm_read_sharp_extension, scm_read_shart) (scm_read_expression): use scm_t_wchar for char type, use read_complete_token (scm_scan_for_encoding): new function to find a file's character encoding (scm_file_encoding): new function to find a port's character encoding * libguile/rdelim.c: don't unpack strings * libguile/print.h: declaration for modified function scm_i_charprint * libguile/print.c: use locale when printing characters and strings (scm_i_charprint): input parameter is now scm_t_wchar (scm_simple_format): don't unpack strings * libguile/posix.h: new declaration for scm_setbinary. * libguile/posix.c (scm_setlocale): set default and stdio port encodings based on the locale's character encoding (scm_setbinary): new function * libguile/ports.h (scm_t_port): add encoding and failed conversion handler to port type. Declarations for new or modified functions scm_getc, scm_unget_byte, scm_ungetc, scm_i_get_port_encoding, scm_i_set_port_encoding_x, scm_port_encoding, scm_set_port_encoding_x, scm_i_get_conversion_strategy, scm_i_set_conversion_strategy_x, scm_port_conversion_strategy, scm_set_port_conversion_strategy_x. * libguile/ports.c: assign the current ports to zero on startup so we can see if they've been set. (scm_current_input_port, scm_current_output_port, scm_current_error_port): return #f if the port is not yet initialized (scm_new_port_table_entry): set up a new port's encoding and illegal sequence handler based on the thread's current defaults (scm_i_remove_port): free port encoding name when port is removed (scm_i_mode_bits_n): now takes a scheme string instead of a c string and length. All callers changed. (SCM_MBCHAR_BUF_SIZE): new const (scm_getc): new function, since the scm_getc in inline.h is now scm_get_byte_or_eof. This pulls one codepoint from a port. (scm_lfwrite_substr, scm_lfwrite_str): now uses port's encoding (scm_unget_byte): new function, incorportaing the low-level functionality of scm_ungetc (scm_ungetc): uses scm_unget_byte * libguile/numbers.h (scm_t_wchar): compilation order problem with scm_t_wchar being use in functions in multiple headers. Forward declare scm_t_wchar. * libguile/load.c (scm_primitive_load): scan for file encoding at top of file and use it to set the load port's encoding * libguile/inline.h (scm_get_byte_or_eof): new function incorporating most of the functionality of scm_getc. * libguile/fports.c (fport_fill_input): now returns scm_t_wchar * libguile/chars.h (scm_t_wchar): avoid compilation order problem with declaration of scm_t_wchar	2009-08-25 07:54:37 -07:00
Michael Gran	30a6b9caa9	Only pass ints to tolower and toupper * libguile/strings.c (unistring_escapes_to_guile_escapes): cast tolower's parameter to int * libguile/read.c (CHAR_DOWNCASE): cast tolower's parameter to int	2009-08-11 21:12:52 -07:00

1 2 3 4 5 ...

294 commits