1
Fork 0
mirror of https://git.savannah.gnu.org/git/guile.git synced 2025-04-30 03:40:34 +02:00

Allow decoding of UTF-8 containing U+FFFD, the replacement character.

* libguile/strings.c (scm_from_utf8_stringn): Use 'u8_mbtoucr' and check
  for a decoding error by its 'nbytes' return value.  Previously we used
  'u8_mbtouc' and improperly assumed that a U+FFFD character indicated a
  decoding error.
* libguile/symbols.c (utf8_string_equals_wide_string): Likewise.
* test-suite/tests/bytevectors.test (exception:decoding-error): New
  variable.
  ("2.9 Operations on Strings"): Add tests.
This commit is contained in:
Mark H Weaver 2015-08-13 01:31:36 -04:00
parent 48412395c6
commit 00884bb79f
3 changed files with 21 additions and 8 deletions

View file

@ -1,4 +1,5 @@
/* Copyright (C) 1995,1996,1998,2000,2001, 2004, 2006, 2008, 2009, 2010, 2011, 2012, 2013, 2014 Free Software Foundation, Inc.
/* Copyright (C) 1995, 1996, 1998, 2000, 2001, 2004, 2006,
* 2008-2015 Free Software Foundation, Inc.
*
* This library is free software; you can redistribute it and/or
* modify it under the terms of the GNU Lesser General Public License
@ -1673,9 +1674,9 @@ scm_from_utf8_stringn (const char *str, size_t len)
ascii = 0;
nbytes = u8_mbtouc (&c, ustr + i, len - i);
nbytes = u8_mbtoucr (&c, ustr + i, len - i);
if (c == 0xfffd)
if (nbytes < 0)
/* Bad UTF-8. */
decoding_error (__func__, errno, str, len);