mirror of
https://git.savannah.gnu.org/git/guile.git
synced 2025-04-30 03:40:34 +02:00
scm_i_utf8_string_hash: compute u8 chars not bytes
Noticed while investigating a migration to utf-8 strings. After making changes that routed non-ascii symbol hashing through this function, encoding-iso88597.test began intermittently failing because it would traverse trailing garbage when u8_strnlen reported 8 chars instead of 4. Change the scm_i_str2symbol and scm_i_str2uninterned_symbol internal hash type to unsigned long to explicitly match the scm_i_string_hash result type. * libguile/hash.c (scm_i_utf8_string_hash): Call u8_mbsnlen not u8_strnlen. * libguile/symbols.c (scm_i_str2symbol, scm_i_str2uninterned_symbol): Use unsigned long for scm_i_string_hash result. * test-suite/standalone/.gitignore: Add test-hashing. * test-suite/standalone/Makefile.am: Add test-hashing. * test-suite/standalone/test-hashing.c: Add.
This commit is contained in:
parent
f0df1ed0fd
commit
ffb95239aa
6 changed files with 86 additions and 3 deletions
|
@ -185,7 +185,7 @@ scm_i_utf8_string_hash (const char *str, size_t len)
|
|||
/* Invalid UTF-8; punt. */
|
||||
return scm_i_string_hash (scm_from_utf8_stringn (str, len));
|
||||
|
||||
length = u8_strnlen (ustr, len);
|
||||
length = u8_mbsnlen (ustr, len);
|
||||
|
||||
/* Set up the internal state. */
|
||||
a = b = c = 0xdeadbeef + ((uint32_t)(length<<2)) + 47;
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue