@page @node Pretty Printing @chapter Pretty Printing @c FIXME::martin: Review me! @cindex pretty printing The module @code{(ice-9 pretty-print)} provides the procedure @code{pretty-print}, which provides nicely formatted output of Scheme objects. This is especially useful for deeply nested or complex data structures, such as lists and vectors. The module is loaded by simply saying. @lisp (use-modules (ice-9 pretty-print)) @end lisp This makes the procedure @code{pretty-print} available. As an example how @code{pretty-print} will format the output, see the following: @lisp (pretty-print '(define (foo) (lambda (x) (cond ((zero? x) #t) ((negative? x) -x) (else (if (= x 1) 2 (* x x x))))))) @print{} (define (foo) (lambda (x) (cond ((zero? x) #t) ((negative? x) -x) (else (if (= x 1) 2 (* x x x)))))) @end lisp @deffn {Scheme Procedure} pretty-print obj [port] Print the textual representation of the Scheme object @var{obj} to @var{port}. @var{port} defaults to the current output port, if not given. @end deffn Beware: Since @code{pretty-print} uses it's own write procedure, it's output will not be the same as for example the output of @code{write}. Consider the following example. @lisp (write (lambda (x) x)) @print{} # (pretty-print (lambda (x) x)) @print{} #[procedure] @end lisp The reason is that @code{pretty-print} does not know as much about Guile's object types as the builtin procedures. This is particularly important for smobs, for which a write procedure can be defined and be used by @code{write}, but not by @code{pretty-print}. @page @node Formatted Output @chapter Formatted Output @c FIXME::martin: Review me! @cindex format @cindex formatted output Outputting messages or other texts which are composed of literal strings, variable contents, newlines and other formatting can be cumbersome, when only the standard procedures like @code{display}, @code{write} and @code{newline} are available. Additionally, one often wants to collect the output in strings. With the standard routines, the user is required to set up a string port, add this port as a parameter to the output procedure calls and then retrieve the resulting string from the string port. The @code{format} procedure, to be found in module @code{(ice-9 format)}, can do all this, and even more. If you are a C programmer, you can think of this procedure as Guile's @code{fprintf}. @deffn {Scheme Procedure} format destination format-string args @dots{} The first parameter is the @var{destination}, it determines where the output of @code{format} will go. @table @asis @item @code{#t} Send the formatted output to the current output port and return @code{#t}. @item @code{#f} Return the formatted output as a string. @item Any number value Send the formatted output to the current error port and return @code{#t}. @item A valid output port Send the formatted output to the port @var{destination} and return @code{#t}. @end table The second parameter is the format string. It has a similar function to the format string in calls to @code{printf} or @code{fprintf} in C. It is output to the specified destination, but all escape sequences are replaced by the results of formatting the corresponding sequence. Note that escape sequences are marked with the character @code{~} (tilde), and not with a @code{%} (percent sign), as in C. The escape sequences in the following table are supported. When there appears ``corresponding @var{arg}', that means any of the additional arguments, after dropping all arguments which have been used up by escape sequences which have been processed earlier. Some of the format characters (the characters following the tilde) can be prefixed by @code{:}, @code{@@}, or @code{:@@}, to modify the behaviour of the format character. How the modified behaviour differs from the default behaviour is described for every character in the table where appropriate. @table @code @item ~~ Output a single @code{~} (tilde) character. @item ~% Output a newline character, thus advancing to the next output line. @item ~& Start a new line, that is, output a newline character if not already at the start of a line. @item ~_ Output a single space character. @item ~/ Output a single tabulator character. @item ~| Output a page separator (formfeed) character. @item ~t Advance to the next tabulator position. @item ~y Pretty-print the corresponding @var{arg}. @item ~a Output the corresponding @var{arg} like @code{display}. @item ~s Output the corresponding @var{arg} like @code{write}. @item ~d Output the corresponding @var{arg} as a decimal number. @item ~x Output the corresponding @var{arg} as a hexadecimal number. @item ~o Output the corresponding @var{arg} as an octal number. @item ~b Output the corresponding @var{arg} as a binary number. @item ~r Output the corresponding @var{arg} as a number word, e.g. 10 prints as @code{ten}. If prefixed with @code{:}, @code{tenth} is printed, if prefixed with @code{:@@}, Roman numbers are printed. @item ~f Output the corresponding @var{arg} as a fixed format floating point number, such as @code{1.34}. @item ~e Output the corresponding @var{arg} in exponential notation, such as @code{1.34E+0}. @item ~g This works either like @code{~f} or like @code{~e}, whichever produces less characters to be written. @item ~$ Like @code{~f}, but only with two digits after the decimal point. @item ~i Output the corresponding @var{arg} as a complex number. @item ~c Output the corresponding @var{arg} as a character. If prefixed with @code{@@}, it is printed like with @code{write}. If prefixed with @code{:}, control characters are treated specially, for example @code{#\newline} will be printed as @code{^J}. @item ~p ``Plural''. If the corresponding @var{arg} is 1, nothing is printed (or @code{y} if prefixed with @code{@@} or @code{:@@}), otherwise @code{s} is printed (or @code{ies} if prefixed with @code{@@} or @code{:@@}). @item ~?, ~k Take the corresponding argument as a format string, and the following argument as a list of values. Then format the values with respect to the format string. @item ~! Flush the output to the output port. @item ~#\newline (tilde-newline) @c FIXME::martin: I don't understand this from the source. Continuation lines. @item ~* Argument jumping. Navigate in the argument list as specified by the corresponding argument. If prefixed with @code{:}, jump backwards in the argument list, if prefixed by @code{:@@}, jump to the parameter with the absolute index, otherwise jump forward in the argument list. @item ~( Case conversion begin. If prefixed by @code{:}, the following output string will be capitalized, if prefixed by @code{@@}, the first character will be capitalized, if prefixed by @code{:@@} it will be upcased and otherwise it will be downcased. Conversion stops when the ``Case conversion end'' @code{~)}sequence is encountered. @item ~) Case conversion end. Stop any case conversion currently in effect. @item ~[ @c FIXME::martin: I don't understand this from the source. Conditional begin. @item ~; @c FIXME::martin: I don't understand this from the source. Conditional separator. @item ~] @c FIXME::martin: I don't understand this from the source. Conditional end. @item ~@{ @c FIXME::martin: I don't understand this from the source. Iteration begin. @item ~@} @c FIXME::martin: I don't understand this from the source. Iteration end. @item ~^ @c FIXME::martin: I don't understand this from the source. Up and out. @item ~' @c FIXME::martin: I don't understand this from the source. Character parameter. @item ~0 @dots{} ~9, ~-, ~+ @c FIXME::martin: I don't understand this from the source. Numeric parameter. @item ~v @c FIXME::martin: I don't understand this from the source. Variable parameter from next argument. @item ~# Parameter is number of remaining args. The number of the remaining arguments is prepended to the list of unprocessed arguments. @item ~, @c FIXME::martin: I don't understand this from the source. Parameter separators. @item ~q Inquiry message. Insert a copyright message into the output. @end table If any type conversions should fail (for example when using an escape sequence for number output, but the argument is a string), an error will be signalled. @end deffn You may have noticed that Guile contains a @code{format} procedure even when the module @code{(ice-9 format)} is not loaded. The default @code{format} procedure does not support all escape sequences documented in this chapter, and will signal an error if you try to use one of them. The reason for providing two versions of @code{format} is that the full-featured module is fairly large and requires some time to get loaded. So the Guile maintainers decided not to load the large version of @code{format} by default, so that the start-up time of the interpreter is not unnecessarily increased. @page @node Rx Regexps @chapter The Rx Regular Expression Library [FIXME: this is taken from Gary and Mark's quick summaries and should be reviewed and expanded. Rx is pretty stable, so could already be done!] @cindex rx @cindex finite automaton The @file{guile-lang-allover} package provides an interface to Tom Lord's Rx library (currently only to POSIX regular expressions). Use of the library requires a two step process: compile a regular expression into an efficient structure, then use the structure in any number of string comparisons. For example, given the regular expression @samp{abc.} (which matches any string containing @samp{abc} followed by any single character): @smalllisp guile> @kbd{(define r (regcomp "abc."))} guile> @kbd{r} # guile> @kbd{(regexec r "abc")} #f guile> @kbd{(regexec r "abcd")} #((0 . 4)) guile> @end smalllisp The definitions of @code{regcomp} and @code{regexec} are as follows: @deffn {Scheme Procedure} regcomp pattern [flags] Compile the regular expression pattern using POSIX rules. Flags is optional and should be specified using symbolic names: @defvar REG_EXTENDED use extended POSIX syntax @end defvar @defvar REG_ICASE use case-insensitive matching @end defvar @defvar REG_NEWLINE allow anchors to match after newline characters in the string and prevents @code{.} or @code{[^...]} from matching newlines. @end defvar The @code{logior} procedure can be used to combine multiple flags. The default is to use POSIX basic syntax, which makes @code{+} and @code{?} literals and @code{\+} and @code{\?} operators. Backslashes in @var{pattern} must be escaped if specified in a literal string e.g., @code{"\\(a\\)\\?"}. @end deffn @deffn {Scheme Procedure} regexec regex string [match-pick] [flags] Match @var{string} against the compiled POSIX regular expression @var{regex}. @var{match-pick} and @var{flags} are optional. Possible flags (which can be combined using the logior procedure) are: @defvar REG_NOTBOL The beginning of line operator won't match the beginning of @var{string} (presumably because it's not the beginning of a line) @end defvar @defvar REG_NOTEOL Similar to REG_NOTBOL, but prevents the end of line operator from matching the end of @var{string}. @end defvar If no match is possible, regexec returns #f. Otherwise @var{match-pick} determines the return value: @code{#t} or unspecified: a newly-allocated vector is returned, containing pairs with the indices of the matched part of @var{string} and any substrings. @code{""}: a list is returned: the first element contains a nested list with the matched part of @var{string} surrounded by the the unmatched parts. Remaining elements are matched substrings (if any). All returned substrings share memory with @var{string}. @code{#f}: regexec returns #t if a match is made, otherwise #f. vector: the supplied vector is returned, with the first element replaced by a pair containing the indices of the matched portion of @var{string} and further elements replaced by pairs containing the indices of matched substrings (if any). list: a list will be returned, with each member of the list specified by a code in the corresponding position of the supplied list: a number: the numbered matching substring (0 for the entire match). @code{#\<}: the beginning of @var{string} to the beginning of the part matched by regex. @code{#\>}: the end of the matched part of @var{string} to the end of @var{string}. @code{#\c}: the "final tag", which seems to be associated with the "cut operator", which doesn't seem to be available through the posix interface. e.g., @code{(list #\< 0 1 #\>)}. The returned substrings share memory with @var{string}. @end deffn Here are some other procedures that might be used when using regular expressions: @deffn {Scheme Procedure} compiled-regexp? obj Test whether obj is a compiled regular expression. @end deffn @deffn {Scheme Procedure} regexp->dfa regex [flags] @end deffn @deffn {Scheme Procedure} dfa-fork dfa @end deffn @deffn {Scheme Procedure} reset-dfa! dfa @end deffn @deffn {Scheme Procedure} dfa-final-tag dfa @end deffn @deffn {Scheme Procedure} dfa-continuable? dfa @end deffn @deffn {Scheme Procedure} advance-dfa! dfa string @end deffn @c Local Variables: @c TeX-master: "guile.texi" @c End: