mailing list of musl libc
 help / color / mirror / code / Atom feed
* [musl] swprintf cannot handle the character 0xff
@ 2023-06-12 12:30 Bruno Haible
  2023-06-12 20:22 ` Rich Felker
  0 siblings, 1 reply; 7+ messages in thread
From: Bruno Haible @ 2023-06-12 12:30 UTC (permalink / raw)
  To: musl

When swprintf is meant to convert a character to a wide character, through
the %c directive, it fails if that character is '\xff'.

Seen with musl libc 1.2.4, in Alpine Linux 3.18.0.

How to reproduce:
============================ foo.c ============================
#include <stdio.h>
#include <wchar.h>
int main ()
{
  wchar_t buf[12];
  for (int c = 1; c < 256; c++)
    {
      fprintf (stderr, "c = %d: ", c);
      int ret = swprintf (buf, 12, L"%c", c);
      if (ret >= 0)
        fprintf (stderr, "OK, %d bytes\n", ret);
      else
        perror ("swprintf failed");
    }
}
===============================================================
$ gcc -Wall foo.c
$ ./a.out

Expected output:
c = 1: OK, 1 bytes
c = 2: OK, 1 bytes
c = 3: OK, 1 bytes
c = 4: OK, 1 bytes
c = 5: OK, 1 bytes
c = 6: OK, 1 bytes
c = 7: OK, 1 bytes
c = 8: OK, 1 bytes
c = 9: OK, 1 bytes
c = 10: OK, 1 bytes
c = 11: OK, 1 bytes
c = 12: OK, 1 bytes
c = 13: OK, 1 bytes
c = 14: OK, 1 bytes
c = 15: OK, 1 bytes
c = 16: OK, 1 bytes
c = 17: OK, 1 bytes
c = 18: OK, 1 bytes
c = 19: OK, 1 bytes
c = 20: OK, 1 bytes
c = 21: OK, 1 bytes
c = 22: OK, 1 bytes
c = 23: OK, 1 bytes
c = 24: OK, 1 bytes
c = 25: OK, 1 bytes
c = 26: OK, 1 bytes
c = 27: OK, 1 bytes
c = 28: OK, 1 bytes
c = 29: OK, 1 bytes
c = 30: OK, 1 bytes
c = 31: OK, 1 bytes
c = 32: OK, 1 bytes
c = 33: OK, 1 bytes
c = 34: OK, 1 bytes
c = 35: OK, 1 bytes
c = 36: OK, 1 bytes
c = 37: OK, 1 bytes
c = 38: OK, 1 bytes
c = 39: OK, 1 bytes
c = 40: OK, 1 bytes
c = 41: OK, 1 bytes
c = 42: OK, 1 bytes
c = 43: OK, 1 bytes
c = 44: OK, 1 bytes
c = 45: OK, 1 bytes
c = 46: OK, 1 bytes
c = 47: OK, 1 bytes
c = 48: OK, 1 bytes
c = 49: OK, 1 bytes
c = 50: OK, 1 bytes
c = 51: OK, 1 bytes
c = 52: OK, 1 bytes
c = 53: OK, 1 bytes
c = 54: OK, 1 bytes
c = 55: OK, 1 bytes
c = 56: OK, 1 bytes
c = 57: OK, 1 bytes
c = 58: OK, 1 bytes
c = 59: OK, 1 bytes
c = 60: OK, 1 bytes
c = 61: OK, 1 bytes
c = 62: OK, 1 bytes
c = 63: OK, 1 bytes
c = 64: OK, 1 bytes
c = 65: OK, 1 bytes
c = 66: OK, 1 bytes
c = 67: OK, 1 bytes
c = 68: OK, 1 bytes
c = 69: OK, 1 bytes
c = 70: OK, 1 bytes
c = 71: OK, 1 bytes
c = 72: OK, 1 bytes
c = 73: OK, 1 bytes
c = 74: OK, 1 bytes
c = 75: OK, 1 bytes
c = 76: OK, 1 bytes
c = 77: OK, 1 bytes
c = 78: OK, 1 bytes
c = 79: OK, 1 bytes
c = 80: OK, 1 bytes
c = 81: OK, 1 bytes
c = 82: OK, 1 bytes
c = 83: OK, 1 bytes
c = 84: OK, 1 bytes
c = 85: OK, 1 bytes
c = 86: OK, 1 bytes
c = 87: OK, 1 bytes
c = 88: OK, 1 bytes
c = 89: OK, 1 bytes
c = 90: OK, 1 bytes
c = 91: OK, 1 bytes
c = 92: OK, 1 bytes
c = 93: OK, 1 bytes
c = 94: OK, 1 bytes
c = 95: OK, 1 bytes
c = 96: OK, 1 bytes
c = 97: OK, 1 bytes
c = 98: OK, 1 bytes
c = 99: OK, 1 bytes
c = 100: OK, 1 bytes
c = 101: OK, 1 bytes
c = 102: OK, 1 bytes
c = 103: OK, 1 bytes
c = 104: OK, 1 bytes
c = 105: OK, 1 bytes
c = 106: OK, 1 bytes
c = 107: OK, 1 bytes
c = 108: OK, 1 bytes
c = 109: OK, 1 bytes
c = 110: OK, 1 bytes
c = 111: OK, 1 bytes
c = 112: OK, 1 bytes
c = 113: OK, 1 bytes
c = 114: OK, 1 bytes
c = 115: OK, 1 bytes
c = 116: OK, 1 bytes
c = 117: OK, 1 bytes
c = 118: OK, 1 bytes
c = 119: OK, 1 bytes
c = 120: OK, 1 bytes
c = 121: OK, 1 bytes
c = 122: OK, 1 bytes
c = 123: OK, 1 bytes
c = 124: OK, 1 bytes
c = 125: OK, 1 bytes
c = 126: OK, 1 bytes
c = 127: OK, 1 bytes
c = 128: OK, 1 bytes
c = 129: OK, 1 bytes
c = 130: OK, 1 bytes
c = 131: OK, 1 bytes
c = 132: OK, 1 bytes
c = 133: OK, 1 bytes
c = 134: OK, 1 bytes
c = 135: OK, 1 bytes
c = 136: OK, 1 bytes
c = 137: OK, 1 bytes
c = 138: OK, 1 bytes
c = 139: OK, 1 bytes
c = 140: OK, 1 bytes
c = 141: OK, 1 bytes
c = 142: OK, 1 bytes
c = 143: OK, 1 bytes
c = 144: OK, 1 bytes
c = 145: OK, 1 bytes
c = 146: OK, 1 bytes
c = 147: OK, 1 bytes
c = 148: OK, 1 bytes
c = 149: OK, 1 bytes
c = 150: OK, 1 bytes
c = 151: OK, 1 bytes
c = 152: OK, 1 bytes
c = 153: OK, 1 bytes
c = 154: OK, 1 bytes
c = 155: OK, 1 bytes
c = 156: OK, 1 bytes
c = 157: OK, 1 bytes
c = 158: OK, 1 bytes
c = 159: OK, 1 bytes
c = 160: OK, 1 bytes
c = 161: OK, 1 bytes
c = 162: OK, 1 bytes
c = 163: OK, 1 bytes
c = 164: OK, 1 bytes
c = 165: OK, 1 bytes
c = 166: OK, 1 bytes
c = 167: OK, 1 bytes
c = 168: OK, 1 bytes
c = 169: OK, 1 bytes
c = 170: OK, 1 bytes
c = 171: OK, 1 bytes
c = 172: OK, 1 bytes
c = 173: OK, 1 bytes
c = 174: OK, 1 bytes
c = 175: OK, 1 bytes
c = 176: OK, 1 bytes
c = 177: OK, 1 bytes
c = 178: OK, 1 bytes
c = 179: OK, 1 bytes
c = 180: OK, 1 bytes
c = 181: OK, 1 bytes
c = 182: OK, 1 bytes
c = 183: OK, 1 bytes
c = 184: OK, 1 bytes
c = 185: OK, 1 bytes
c = 186: OK, 1 bytes
c = 187: OK, 1 bytes
c = 188: OK, 1 bytes
c = 189: OK, 1 bytes
c = 190: OK, 1 bytes
c = 191: OK, 1 bytes
c = 192: OK, 1 bytes
c = 193: OK, 1 bytes
c = 194: OK, 1 bytes
c = 195: OK, 1 bytes
c = 196: OK, 1 bytes
c = 197: OK, 1 bytes
c = 198: OK, 1 bytes
c = 199: OK, 1 bytes
c = 200: OK, 1 bytes
c = 201: OK, 1 bytes
c = 202: OK, 1 bytes
c = 203: OK, 1 bytes
c = 204: OK, 1 bytes
c = 205: OK, 1 bytes
c = 206: OK, 1 bytes
c = 207: OK, 1 bytes
c = 208: OK, 1 bytes
c = 209: OK, 1 bytes
c = 210: OK, 1 bytes
c = 211: OK, 1 bytes
c = 212: OK, 1 bytes
c = 213: OK, 1 bytes
c = 214: OK, 1 bytes
c = 215: OK, 1 bytes
c = 216: OK, 1 bytes
c = 217: OK, 1 bytes
c = 218: OK, 1 bytes
c = 219: OK, 1 bytes
c = 220: OK, 1 bytes
c = 221: OK, 1 bytes
c = 222: OK, 1 bytes
c = 223: OK, 1 bytes
c = 224: OK, 1 bytes
c = 225: OK, 1 bytes
c = 226: OK, 1 bytes
c = 227: OK, 1 bytes
c = 228: OK, 1 bytes
c = 229: OK, 1 bytes
c = 230: OK, 1 bytes
c = 231: OK, 1 bytes
c = 232: OK, 1 bytes
c = 233: OK, 1 bytes
c = 234: OK, 1 bytes
c = 235: OK, 1 bytes
c = 236: OK, 1 bytes
c = 237: OK, 1 bytes
c = 238: OK, 1 bytes
c = 239: OK, 1 bytes
c = 240: OK, 1 bytes
c = 241: OK, 1 bytes
c = 242: OK, 1 bytes
c = 243: OK, 1 bytes
c = 244: OK, 1 bytes
c = 245: OK, 1 bytes
c = 246: OK, 1 bytes
c = 247: OK, 1 bytes
c = 248: OK, 1 bytes
c = 249: OK, 1 bytes
c = 250: OK, 1 bytes
c = 251: OK, 1 bytes
c = 252: OK, 1 bytes
c = 253: OK, 1 bytes
c = 254: OK, 1 bytes
c = 255: OK, 1 bytes

Actual output:
c = 1: OK, 1 bytes
c = 2: OK, 1 bytes
c = 3: OK, 1 bytes
c = 4: OK, 1 bytes
c = 5: OK, 1 bytes
c = 6: OK, 1 bytes
c = 7: OK, 1 bytes
c = 8: OK, 1 bytes
c = 9: OK, 1 bytes
c = 10: OK, 1 bytes
c = 11: OK, 1 bytes
c = 12: OK, 1 bytes
c = 13: OK, 1 bytes
c = 14: OK, 1 bytes
c = 15: OK, 1 bytes
c = 16: OK, 1 bytes
c = 17: OK, 1 bytes
c = 18: OK, 1 bytes
c = 19: OK, 1 bytes
c = 20: OK, 1 bytes
c = 21: OK, 1 bytes
c = 22: OK, 1 bytes
c = 23: OK, 1 bytes
c = 24: OK, 1 bytes
c = 25: OK, 1 bytes
c = 26: OK, 1 bytes
c = 27: OK, 1 bytes
c = 28: OK, 1 bytes
c = 29: OK, 1 bytes
c = 30: OK, 1 bytes
c = 31: OK, 1 bytes
c = 32: OK, 1 bytes
c = 33: OK, 1 bytes
c = 34: OK, 1 bytes
c = 35: OK, 1 bytes
c = 36: OK, 1 bytes
c = 37: OK, 1 bytes
c = 38: OK, 1 bytes
c = 39: OK, 1 bytes
c = 40: OK, 1 bytes
c = 41: OK, 1 bytes
c = 42: OK, 1 bytes
c = 43: OK, 1 bytes
c = 44: OK, 1 bytes
c = 45: OK, 1 bytes
c = 46: OK, 1 bytes
c = 47: OK, 1 bytes
c = 48: OK, 1 bytes
c = 49: OK, 1 bytes
c = 50: OK, 1 bytes
c = 51: OK, 1 bytes
c = 52: OK, 1 bytes
c = 53: OK, 1 bytes
c = 54: OK, 1 bytes
c = 55: OK, 1 bytes
c = 56: OK, 1 bytes
c = 57: OK, 1 bytes
c = 58: OK, 1 bytes
c = 59: OK, 1 bytes
c = 60: OK, 1 bytes
c = 61: OK, 1 bytes
c = 62: OK, 1 bytes
c = 63: OK, 1 bytes
c = 64: OK, 1 bytes
c = 65: OK, 1 bytes
c = 66: OK, 1 bytes
c = 67: OK, 1 bytes
c = 68: OK, 1 bytes
c = 69: OK, 1 bytes
c = 70: OK, 1 bytes
c = 71: OK, 1 bytes
c = 72: OK, 1 bytes
c = 73: OK, 1 bytes
c = 74: OK, 1 bytes
c = 75: OK, 1 bytes
c = 76: OK, 1 bytes
c = 77: OK, 1 bytes
c = 78: OK, 1 bytes
c = 79: OK, 1 bytes
c = 80: OK, 1 bytes
c = 81: OK, 1 bytes
c = 82: OK, 1 bytes
c = 83: OK, 1 bytes
c = 84: OK, 1 bytes
c = 85: OK, 1 bytes
c = 86: OK, 1 bytes
c = 87: OK, 1 bytes
c = 88: OK, 1 bytes
c = 89: OK, 1 bytes
c = 90: OK, 1 bytes
c = 91: OK, 1 bytes
c = 92: OK, 1 bytes
c = 93: OK, 1 bytes
c = 94: OK, 1 bytes
c = 95: OK, 1 bytes
c = 96: OK, 1 bytes
c = 97: OK, 1 bytes
c = 98: OK, 1 bytes
c = 99: OK, 1 bytes
c = 100: OK, 1 bytes
c = 101: OK, 1 bytes
c = 102: OK, 1 bytes
c = 103: OK, 1 bytes
c = 104: OK, 1 bytes
c = 105: OK, 1 bytes
c = 106: OK, 1 bytes
c = 107: OK, 1 bytes
c = 108: OK, 1 bytes
c = 109: OK, 1 bytes
c = 110: OK, 1 bytes
c = 111: OK, 1 bytes
c = 112: OK, 1 bytes
c = 113: OK, 1 bytes
c = 114: OK, 1 bytes
c = 115: OK, 1 bytes
c = 116: OK, 1 bytes
c = 117: OK, 1 bytes
c = 118: OK, 1 bytes
c = 119: OK, 1 bytes
c = 120: OK, 1 bytes
c = 121: OK, 1 bytes
c = 122: OK, 1 bytes
c = 123: OK, 1 bytes
c = 124: OK, 1 bytes
c = 125: OK, 1 bytes
c = 126: OK, 1 bytes
c = 127: OK, 1 bytes
c = 128: OK, 1 bytes
c = 129: OK, 1 bytes
c = 130: OK, 1 bytes
c = 131: OK, 1 bytes
c = 132: OK, 1 bytes
c = 133: OK, 1 bytes
c = 134: OK, 1 bytes
c = 135: OK, 1 bytes
c = 136: OK, 1 bytes
c = 137: OK, 1 bytes
c = 138: OK, 1 bytes
c = 139: OK, 1 bytes
c = 140: OK, 1 bytes
c = 141: OK, 1 bytes
c = 142: OK, 1 bytes
c = 143: OK, 1 bytes
c = 144: OK, 1 bytes
c = 145: OK, 1 bytes
c = 146: OK, 1 bytes
c = 147: OK, 1 bytes
c = 148: OK, 1 bytes
c = 149: OK, 1 bytes
c = 150: OK, 1 bytes
c = 151: OK, 1 bytes
c = 152: OK, 1 bytes
c = 153: OK, 1 bytes
c = 154: OK, 1 bytes
c = 155: OK, 1 bytes
c = 156: OK, 1 bytes
c = 157: OK, 1 bytes
c = 158: OK, 1 bytes
c = 159: OK, 1 bytes
c = 160: OK, 1 bytes
c = 161: OK, 1 bytes
c = 162: OK, 1 bytes
c = 163: OK, 1 bytes
c = 164: OK, 1 bytes
c = 165: OK, 1 bytes
c = 166: OK, 1 bytes
c = 167: OK, 1 bytes
c = 168: OK, 1 bytes
c = 169: OK, 1 bytes
c = 170: OK, 1 bytes
c = 171: OK, 1 bytes
c = 172: OK, 1 bytes
c = 173: OK, 1 bytes
c = 174: OK, 1 bytes
c = 175: OK, 1 bytes
c = 176: OK, 1 bytes
c = 177: OK, 1 bytes
c = 178: OK, 1 bytes
c = 179: OK, 1 bytes
c = 180: OK, 1 bytes
c = 181: OK, 1 bytes
c = 182: OK, 1 bytes
c = 183: OK, 1 bytes
c = 184: OK, 1 bytes
c = 185: OK, 1 bytes
c = 186: OK, 1 bytes
c = 187: OK, 1 bytes
c = 188: OK, 1 bytes
c = 189: OK, 1 bytes
c = 190: OK, 1 bytes
c = 191: OK, 1 bytes
c = 192: OK, 1 bytes
c = 193: OK, 1 bytes
c = 194: OK, 1 bytes
c = 195: OK, 1 bytes
c = 196: OK, 1 bytes
c = 197: OK, 1 bytes
c = 198: OK, 1 bytes
c = 199: OK, 1 bytes
c = 200: OK, 1 bytes
c = 201: OK, 1 bytes
c = 202: OK, 1 bytes
c = 203: OK, 1 bytes
c = 204: OK, 1 bytes
c = 205: OK, 1 bytes
c = 206: OK, 1 bytes
c = 207: OK, 1 bytes
c = 208: OK, 1 bytes
c = 209: OK, 1 bytes
c = 210: OK, 1 bytes
c = 211: OK, 1 bytes
c = 212: OK, 1 bytes
c = 213: OK, 1 bytes
c = 214: OK, 1 bytes
c = 215: OK, 1 bytes
c = 216: OK, 1 bytes
c = 217: OK, 1 bytes
c = 218: OK, 1 bytes
c = 219: OK, 1 bytes
c = 220: OK, 1 bytes
c = 221: OK, 1 bytes
c = 222: OK, 1 bytes
c = 223: OK, 1 bytes
c = 224: OK, 1 bytes
c = 225: OK, 1 bytes
c = 226: OK, 1 bytes
c = 227: OK, 1 bytes
c = 228: OK, 1 bytes
c = 229: OK, 1 bytes
c = 230: OK, 1 bytes
c = 231: OK, 1 bytes
c = 232: OK, 1 bytes
c = 233: OK, 1 bytes
c = 234: OK, 1 bytes
c = 235: OK, 1 bytes
c = 236: OK, 1 bytes
c = 237: OK, 1 bytes
c = 238: OK, 1 bytes
c = 239: OK, 1 bytes
c = 240: OK, 1 bytes
c = 241: OK, 1 bytes
c = 242: OK, 1 bytes
c = 243: OK, 1 bytes
c = 244: OK, 1 bytes
c = 245: OK, 1 bytes
c = 246: OK, 1 bytes
c = 247: OK, 1 bytes
c = 248: OK, 1 bytes
c = 249: OK, 1 bytes
c = 250: OK, 1 bytes
c = 251: OK, 1 bytes
c = 252: OK, 1 bytes
c = 253: OK, 1 bytes
c = 254: OK, 1 bytes
c = 255: swprintf failed: Illegal byte sequence

This is a bug, because POSIX says that in the C / POSIX locale, "all byte
values are valid characters" [1].

Bruno

[1] https://pubs.opengroup.org/onlinepubs/9699919799/functions/mbrtowc.html




^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [musl] swprintf cannot handle the character 0xff
  2023-06-12 12:30 [musl] swprintf cannot handle the character 0xff Bruno Haible
@ 2023-06-12 20:22 ` Rich Felker
  2023-06-12 21:01   ` Rich Felker
  0 siblings, 1 reply; 7+ messages in thread
From: Rich Felker @ 2023-06-12 20:22 UTC (permalink / raw)
  To: Bruno Haible; +Cc: musl

[-- Attachment #1: Type: text/plain, Size: 12606 bytes --]

On Mon, Jun 12, 2023 at 02:30:44PM +0200, Bruno Haible wrote:
> When swprintf is meant to convert a character to a wide character, through
> the %c directive, it fails if that character is '\xff'.
> 
> Seen with musl libc 1.2.4, in Alpine Linux 3.18.0.
> 
> How to reproduce:
> ============================ foo.c ============================
> #include <stdio.h>
> #include <wchar.h>
> int main ()
> {
>   wchar_t buf[12];
>   for (int c = 1; c < 256; c++)
>     {
>       fprintf (stderr, "c = %d: ", c);
>       int ret = swprintf (buf, 12, L"%c", c);
>       if (ret >= 0)
>         fprintf (stderr, "OK, %d bytes\n", ret);
>       else
>         perror ("swprintf failed");
>     }
> }
> ===============================================================
> $ gcc -Wall foo.c
> $ ./a.out
> 
> Expected output:
> c = 1: OK, 1 bytes
> c = 2: OK, 1 bytes
> c = 3: OK, 1 bytes
> c = 4: OK, 1 bytes
> c = 5: OK, 1 bytes
> c = 6: OK, 1 bytes
> c = 7: OK, 1 bytes
> c = 8: OK, 1 bytes
> c = 9: OK, 1 bytes
> c = 10: OK, 1 bytes
> c = 11: OK, 1 bytes
> c = 12: OK, 1 bytes
> c = 13: OK, 1 bytes
> c = 14: OK, 1 bytes
> c = 15: OK, 1 bytes
> c = 16: OK, 1 bytes
> c = 17: OK, 1 bytes
> c = 18: OK, 1 bytes
> c = 19: OK, 1 bytes
> c = 20: OK, 1 bytes
> c = 21: OK, 1 bytes
> c = 22: OK, 1 bytes
> c = 23: OK, 1 bytes
> c = 24: OK, 1 bytes
> c = 25: OK, 1 bytes
> c = 26: OK, 1 bytes
> c = 27: OK, 1 bytes
> c = 28: OK, 1 bytes
> c = 29: OK, 1 bytes
> c = 30: OK, 1 bytes
> c = 31: OK, 1 bytes
> c = 32: OK, 1 bytes
> c = 33: OK, 1 bytes
> c = 34: OK, 1 bytes
> c = 35: OK, 1 bytes
> c = 36: OK, 1 bytes
> c = 37: OK, 1 bytes
> c = 38: OK, 1 bytes
> c = 39: OK, 1 bytes
> c = 40: OK, 1 bytes
> c = 41: OK, 1 bytes
> c = 42: OK, 1 bytes
> c = 43: OK, 1 bytes
> c = 44: OK, 1 bytes
> c = 45: OK, 1 bytes
> c = 46: OK, 1 bytes
> c = 47: OK, 1 bytes
> c = 48: OK, 1 bytes
> c = 49: OK, 1 bytes
> c = 50: OK, 1 bytes
> c = 51: OK, 1 bytes
> c = 52: OK, 1 bytes
> c = 53: OK, 1 bytes
> c = 54: OK, 1 bytes
> c = 55: OK, 1 bytes
> c = 56: OK, 1 bytes
> c = 57: OK, 1 bytes
> c = 58: OK, 1 bytes
> c = 59: OK, 1 bytes
> c = 60: OK, 1 bytes
> c = 61: OK, 1 bytes
> c = 62: OK, 1 bytes
> c = 63: OK, 1 bytes
> c = 64: OK, 1 bytes
> c = 65: OK, 1 bytes
> c = 66: OK, 1 bytes
> c = 67: OK, 1 bytes
> c = 68: OK, 1 bytes
> c = 69: OK, 1 bytes
> c = 70: OK, 1 bytes
> c = 71: OK, 1 bytes
> c = 72: OK, 1 bytes
> c = 73: OK, 1 bytes
> c = 74: OK, 1 bytes
> c = 75: OK, 1 bytes
> c = 76: OK, 1 bytes
> c = 77: OK, 1 bytes
> c = 78: OK, 1 bytes
> c = 79: OK, 1 bytes
> c = 80: OK, 1 bytes
> c = 81: OK, 1 bytes
> c = 82: OK, 1 bytes
> c = 83: OK, 1 bytes
> c = 84: OK, 1 bytes
> c = 85: OK, 1 bytes
> c = 86: OK, 1 bytes
> c = 87: OK, 1 bytes
> c = 88: OK, 1 bytes
> c = 89: OK, 1 bytes
> c = 90: OK, 1 bytes
> c = 91: OK, 1 bytes
> c = 92: OK, 1 bytes
> c = 93: OK, 1 bytes
> c = 94: OK, 1 bytes
> c = 95: OK, 1 bytes
> c = 96: OK, 1 bytes
> c = 97: OK, 1 bytes
> c = 98: OK, 1 bytes
> c = 99: OK, 1 bytes
> c = 100: OK, 1 bytes
> c = 101: OK, 1 bytes
> c = 102: OK, 1 bytes
> c = 103: OK, 1 bytes
> c = 104: OK, 1 bytes
> c = 105: OK, 1 bytes
> c = 106: OK, 1 bytes
> c = 107: OK, 1 bytes
> c = 108: OK, 1 bytes
> c = 109: OK, 1 bytes
> c = 110: OK, 1 bytes
> c = 111: OK, 1 bytes
> c = 112: OK, 1 bytes
> c = 113: OK, 1 bytes
> c = 114: OK, 1 bytes
> c = 115: OK, 1 bytes
> c = 116: OK, 1 bytes
> c = 117: OK, 1 bytes
> c = 118: OK, 1 bytes
> c = 119: OK, 1 bytes
> c = 120: OK, 1 bytes
> c = 121: OK, 1 bytes
> c = 122: OK, 1 bytes
> c = 123: OK, 1 bytes
> c = 124: OK, 1 bytes
> c = 125: OK, 1 bytes
> c = 126: OK, 1 bytes
> c = 127: OK, 1 bytes
> c = 128: OK, 1 bytes
> c = 129: OK, 1 bytes
> c = 130: OK, 1 bytes
> c = 131: OK, 1 bytes
> c = 132: OK, 1 bytes
> c = 133: OK, 1 bytes
> c = 134: OK, 1 bytes
> c = 135: OK, 1 bytes
> c = 136: OK, 1 bytes
> c = 137: OK, 1 bytes
> c = 138: OK, 1 bytes
> c = 139: OK, 1 bytes
> c = 140: OK, 1 bytes
> c = 141: OK, 1 bytes
> c = 142: OK, 1 bytes
> c = 143: OK, 1 bytes
> c = 144: OK, 1 bytes
> c = 145: OK, 1 bytes
> c = 146: OK, 1 bytes
> c = 147: OK, 1 bytes
> c = 148: OK, 1 bytes
> c = 149: OK, 1 bytes
> c = 150: OK, 1 bytes
> c = 151: OK, 1 bytes
> c = 152: OK, 1 bytes
> c = 153: OK, 1 bytes
> c = 154: OK, 1 bytes
> c = 155: OK, 1 bytes
> c = 156: OK, 1 bytes
> c = 157: OK, 1 bytes
> c = 158: OK, 1 bytes
> c = 159: OK, 1 bytes
> c = 160: OK, 1 bytes
> c = 161: OK, 1 bytes
> c = 162: OK, 1 bytes
> c = 163: OK, 1 bytes
> c = 164: OK, 1 bytes
> c = 165: OK, 1 bytes
> c = 166: OK, 1 bytes
> c = 167: OK, 1 bytes
> c = 168: OK, 1 bytes
> c = 169: OK, 1 bytes
> c = 170: OK, 1 bytes
> c = 171: OK, 1 bytes
> c = 172: OK, 1 bytes
> c = 173: OK, 1 bytes
> c = 174: OK, 1 bytes
> c = 175: OK, 1 bytes
> c = 176: OK, 1 bytes
> c = 177: OK, 1 bytes
> c = 178: OK, 1 bytes
> c = 179: OK, 1 bytes
> c = 180: OK, 1 bytes
> c = 181: OK, 1 bytes
> c = 182: OK, 1 bytes
> c = 183: OK, 1 bytes
> c = 184: OK, 1 bytes
> c = 185: OK, 1 bytes
> c = 186: OK, 1 bytes
> c = 187: OK, 1 bytes
> c = 188: OK, 1 bytes
> c = 189: OK, 1 bytes
> c = 190: OK, 1 bytes
> c = 191: OK, 1 bytes
> c = 192: OK, 1 bytes
> c = 193: OK, 1 bytes
> c = 194: OK, 1 bytes
> c = 195: OK, 1 bytes
> c = 196: OK, 1 bytes
> c = 197: OK, 1 bytes
> c = 198: OK, 1 bytes
> c = 199: OK, 1 bytes
> c = 200: OK, 1 bytes
> c = 201: OK, 1 bytes
> c = 202: OK, 1 bytes
> c = 203: OK, 1 bytes
> c = 204: OK, 1 bytes
> c = 205: OK, 1 bytes
> c = 206: OK, 1 bytes
> c = 207: OK, 1 bytes
> c = 208: OK, 1 bytes
> c = 209: OK, 1 bytes
> c = 210: OK, 1 bytes
> c = 211: OK, 1 bytes
> c = 212: OK, 1 bytes
> c = 213: OK, 1 bytes
> c = 214: OK, 1 bytes
> c = 215: OK, 1 bytes
> c = 216: OK, 1 bytes
> c = 217: OK, 1 bytes
> c = 218: OK, 1 bytes
> c = 219: OK, 1 bytes
> c = 220: OK, 1 bytes
> c = 221: OK, 1 bytes
> c = 222: OK, 1 bytes
> c = 223: OK, 1 bytes
> c = 224: OK, 1 bytes
> c = 225: OK, 1 bytes
> c = 226: OK, 1 bytes
> c = 227: OK, 1 bytes
> c = 228: OK, 1 bytes
> c = 229: OK, 1 bytes
> c = 230: OK, 1 bytes
> c = 231: OK, 1 bytes
> c = 232: OK, 1 bytes
> c = 233: OK, 1 bytes
> c = 234: OK, 1 bytes
> c = 235: OK, 1 bytes
> c = 236: OK, 1 bytes
> c = 237: OK, 1 bytes
> c = 238: OK, 1 bytes
> c = 239: OK, 1 bytes
> c = 240: OK, 1 bytes
> c = 241: OK, 1 bytes
> c = 242: OK, 1 bytes
> c = 243: OK, 1 bytes
> c = 244: OK, 1 bytes
> c = 245: OK, 1 bytes
> c = 246: OK, 1 bytes
> c = 247: OK, 1 bytes
> c = 248: OK, 1 bytes
> c = 249: OK, 1 bytes
> c = 250: OK, 1 bytes
> c = 251: OK, 1 bytes
> c = 252: OK, 1 bytes
> c = 253: OK, 1 bytes
> c = 254: OK, 1 bytes
> c = 255: OK, 1 bytes
> 
> Actual output:
> c = 1: OK, 1 bytes
> c = 2: OK, 1 bytes
> c = 3: OK, 1 bytes
> c = 4: OK, 1 bytes
> c = 5: OK, 1 bytes
> c = 6: OK, 1 bytes
> c = 7: OK, 1 bytes
> c = 8: OK, 1 bytes
> c = 9: OK, 1 bytes
> c = 10: OK, 1 bytes
> c = 11: OK, 1 bytes
> c = 12: OK, 1 bytes
> c = 13: OK, 1 bytes
> c = 14: OK, 1 bytes
> c = 15: OK, 1 bytes
> c = 16: OK, 1 bytes
> c = 17: OK, 1 bytes
> c = 18: OK, 1 bytes
> c = 19: OK, 1 bytes
> c = 20: OK, 1 bytes
> c = 21: OK, 1 bytes
> c = 22: OK, 1 bytes
> c = 23: OK, 1 bytes
> c = 24: OK, 1 bytes
> c = 25: OK, 1 bytes
> c = 26: OK, 1 bytes
> c = 27: OK, 1 bytes
> c = 28: OK, 1 bytes
> c = 29: OK, 1 bytes
> c = 30: OK, 1 bytes
> c = 31: OK, 1 bytes
> c = 32: OK, 1 bytes
> c = 33: OK, 1 bytes
> c = 34: OK, 1 bytes
> c = 35: OK, 1 bytes
> c = 36: OK, 1 bytes
> c = 37: OK, 1 bytes
> c = 38: OK, 1 bytes
> c = 39: OK, 1 bytes
> c = 40: OK, 1 bytes
> c = 41: OK, 1 bytes
> c = 42: OK, 1 bytes
> c = 43: OK, 1 bytes
> c = 44: OK, 1 bytes
> c = 45: OK, 1 bytes
> c = 46: OK, 1 bytes
> c = 47: OK, 1 bytes
> c = 48: OK, 1 bytes
> c = 49: OK, 1 bytes
> c = 50: OK, 1 bytes
> c = 51: OK, 1 bytes
> c = 52: OK, 1 bytes
> c = 53: OK, 1 bytes
> c = 54: OK, 1 bytes
> c = 55: OK, 1 bytes
> c = 56: OK, 1 bytes
> c = 57: OK, 1 bytes
> c = 58: OK, 1 bytes
> c = 59: OK, 1 bytes
> c = 60: OK, 1 bytes
> c = 61: OK, 1 bytes
> c = 62: OK, 1 bytes
> c = 63: OK, 1 bytes
> c = 64: OK, 1 bytes
> c = 65: OK, 1 bytes
> c = 66: OK, 1 bytes
> c = 67: OK, 1 bytes
> c = 68: OK, 1 bytes
> c = 69: OK, 1 bytes
> c = 70: OK, 1 bytes
> c = 71: OK, 1 bytes
> c = 72: OK, 1 bytes
> c = 73: OK, 1 bytes
> c = 74: OK, 1 bytes
> c = 75: OK, 1 bytes
> c = 76: OK, 1 bytes
> c = 77: OK, 1 bytes
> c = 78: OK, 1 bytes
> c = 79: OK, 1 bytes
> c = 80: OK, 1 bytes
> c = 81: OK, 1 bytes
> c = 82: OK, 1 bytes
> c = 83: OK, 1 bytes
> c = 84: OK, 1 bytes
> c = 85: OK, 1 bytes
> c = 86: OK, 1 bytes
> c = 87: OK, 1 bytes
> c = 88: OK, 1 bytes
> c = 89: OK, 1 bytes
> c = 90: OK, 1 bytes
> c = 91: OK, 1 bytes
> c = 92: OK, 1 bytes
> c = 93: OK, 1 bytes
> c = 94: OK, 1 bytes
> c = 95: OK, 1 bytes
> c = 96: OK, 1 bytes
> c = 97: OK, 1 bytes
> c = 98: OK, 1 bytes
> c = 99: OK, 1 bytes
> c = 100: OK, 1 bytes
> c = 101: OK, 1 bytes
> c = 102: OK, 1 bytes
> c = 103: OK, 1 bytes
> c = 104: OK, 1 bytes
> c = 105: OK, 1 bytes
> c = 106: OK, 1 bytes
> c = 107: OK, 1 bytes
> c = 108: OK, 1 bytes
> c = 109: OK, 1 bytes
> c = 110: OK, 1 bytes
> c = 111: OK, 1 bytes
> c = 112: OK, 1 bytes
> c = 113: OK, 1 bytes
> c = 114: OK, 1 bytes
> c = 115: OK, 1 bytes
> c = 116: OK, 1 bytes
> c = 117: OK, 1 bytes
> c = 118: OK, 1 bytes
> c = 119: OK, 1 bytes
> c = 120: OK, 1 bytes
> c = 121: OK, 1 bytes
> c = 122: OK, 1 bytes
> c = 123: OK, 1 bytes
> c = 124: OK, 1 bytes
> c = 125: OK, 1 bytes
> c = 126: OK, 1 bytes
> c = 127: OK, 1 bytes
> c = 128: OK, 1 bytes
> c = 129: OK, 1 bytes
> c = 130: OK, 1 bytes
> c = 131: OK, 1 bytes
> c = 132: OK, 1 bytes
> c = 133: OK, 1 bytes
> c = 134: OK, 1 bytes
> c = 135: OK, 1 bytes
> c = 136: OK, 1 bytes
> c = 137: OK, 1 bytes
> c = 138: OK, 1 bytes
> c = 139: OK, 1 bytes
> c = 140: OK, 1 bytes
> c = 141: OK, 1 bytes
> c = 142: OK, 1 bytes
> c = 143: OK, 1 bytes
> c = 144: OK, 1 bytes
> c = 145: OK, 1 bytes
> c = 146: OK, 1 bytes
> c = 147: OK, 1 bytes
> c = 148: OK, 1 bytes
> c = 149: OK, 1 bytes
> c = 150: OK, 1 bytes
> c = 151: OK, 1 bytes
> c = 152: OK, 1 bytes
> c = 153: OK, 1 bytes
> c = 154: OK, 1 bytes
> c = 155: OK, 1 bytes
> c = 156: OK, 1 bytes
> c = 157: OK, 1 bytes
> c = 158: OK, 1 bytes
> c = 159: OK, 1 bytes
> c = 160: OK, 1 bytes
> c = 161: OK, 1 bytes
> c = 162: OK, 1 bytes
> c = 163: OK, 1 bytes
> c = 164: OK, 1 bytes
> c = 165: OK, 1 bytes
> c = 166: OK, 1 bytes
> c = 167: OK, 1 bytes
> c = 168: OK, 1 bytes
> c = 169: OK, 1 bytes
> c = 170: OK, 1 bytes
> c = 171: OK, 1 bytes
> c = 172: OK, 1 bytes
> c = 173: OK, 1 bytes
> c = 174: OK, 1 bytes
> c = 175: OK, 1 bytes
> c = 176: OK, 1 bytes
> c = 177: OK, 1 bytes
> c = 178: OK, 1 bytes
> c = 179: OK, 1 bytes
> c = 180: OK, 1 bytes
> c = 181: OK, 1 bytes
> c = 182: OK, 1 bytes
> c = 183: OK, 1 bytes
> c = 184: OK, 1 bytes
> c = 185: OK, 1 bytes
> c = 186: OK, 1 bytes
> c = 187: OK, 1 bytes
> c = 188: OK, 1 bytes
> c = 189: OK, 1 bytes
> c = 190: OK, 1 bytes
> c = 191: OK, 1 bytes
> c = 192: OK, 1 bytes
> c = 193: OK, 1 bytes
> c = 194: OK, 1 bytes
> c = 195: OK, 1 bytes
> c = 196: OK, 1 bytes
> c = 197: OK, 1 bytes
> c = 198: OK, 1 bytes
> c = 199: OK, 1 bytes
> c = 200: OK, 1 bytes
> c = 201: OK, 1 bytes
> c = 202: OK, 1 bytes
> c = 203: OK, 1 bytes
> c = 204: OK, 1 bytes
> c = 205: OK, 1 bytes
> c = 206: OK, 1 bytes
> c = 207: OK, 1 bytes
> c = 208: OK, 1 bytes
> c = 209: OK, 1 bytes
> c = 210: OK, 1 bytes
> c = 211: OK, 1 bytes
> c = 212: OK, 1 bytes
> c = 213: OK, 1 bytes
> c = 214: OK, 1 bytes
> c = 215: OK, 1 bytes
> c = 216: OK, 1 bytes
> c = 217: OK, 1 bytes
> c = 218: OK, 1 bytes
> c = 219: OK, 1 bytes
> c = 220: OK, 1 bytes
> c = 221: OK, 1 bytes
> c = 222: OK, 1 bytes
> c = 223: OK, 1 bytes
> c = 224: OK, 1 bytes
> c = 225: OK, 1 bytes
> c = 226: OK, 1 bytes
> c = 227: OK, 1 bytes
> c = 228: OK, 1 bytes
> c = 229: OK, 1 bytes
> c = 230: OK, 1 bytes
> c = 231: OK, 1 bytes
> c = 232: OK, 1 bytes
> c = 233: OK, 1 bytes
> c = 234: OK, 1 bytes
> c = 235: OK, 1 bytes
> c = 236: OK, 1 bytes
> c = 237: OK, 1 bytes
> c = 238: OK, 1 bytes
> c = 239: OK, 1 bytes
> c = 240: OK, 1 bytes
> c = 241: OK, 1 bytes
> c = 242: OK, 1 bytes
> c = 243: OK, 1 bytes
> c = 244: OK, 1 bytes
> c = 245: OK, 1 bytes
> c = 246: OK, 1 bytes
> c = 247: OK, 1 bytes
> c = 248: OK, 1 bytes
> c = 249: OK, 1 bytes
> c = 250: OK, 1 bytes
> c = 251: OK, 1 bytes
> c = 252: OK, 1 bytes
> c = 253: OK, 1 bytes
> c = 254: OK, 1 bytes
> c = 255: swprintf failed: Illegal byte sequence
> 
> This is a bug, because POSIX says that in the C / POSIX locale, "all byte
> values are valid characters" [1].

Yes, this is a bug and seems to be an instance of mishandling of
signed conversions. Attached should correct it.

Rich

[-- Attachment #2: wprintf-255.diff --]
[-- Type: text/plain, Size: 478 bytes --]

diff --git a/src/stdio/vfwprintf.c b/src/stdio/vfwprintf.c
index 53697701..a653e233 100644
--- a/src/stdio/vfwprintf.c
+++ b/src/stdio/vfwprintf.c
@@ -271,7 +271,7 @@ static int wprintf_core(FILE *f, const wchar_t *fmt, va_list *ap, union arg *nl_
 		case 'C':
 			if (w<1) w=1;
 			pad(f, w-1, fl);
-			out(f, &(wchar_t){t=='C' ? arg.i : btowc(arg.i)}, 1);
+			out(f, &(wchar_t){t=='C' ? arg.i : btowc(arg.i & 0xff)}, 1);
 			pad(f, w-1, fl^LEFT_ADJ);
 			l = w;
 			continue;

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [musl] swprintf cannot handle the character 0xff
  2023-06-12 20:22 ` Rich Felker
@ 2023-06-12 21:01   ` Rich Felker
  2023-06-12 21:13     ` Rich Felker
  0 siblings, 1 reply; 7+ messages in thread
From: Rich Felker @ 2023-06-12 21:01 UTC (permalink / raw)
  To: Bruno Haible; +Cc: musl

On Mon, Jun 12, 2023 at 04:22:42PM -0400, Rich Felker wrote:
> On Mon, Jun 12, 2023 at 02:30:44PM +0200, Bruno Haible wrote:
> > When swprintf is meant to convert a character to a wide character, through
> > the %c directive, it fails if that character is '\xff'.
> > 
> > Seen with musl libc 1.2.4, in Alpine Linux 3.18.0.
> > 
> > How to reproduce:
> > ============================ foo.c ============================
> > #include <stdio.h>
> > #include <wchar.h>
> > int main ()
> > {
> >   wchar_t buf[12];
> >   for (int c = 1; c < 256; c++)
> >     {
> >       fprintf (stderr, "c = %d: ", c);
> >       int ret = swprintf (buf, 12, L"%c", c);
> >       if (ret >= 0)
> >         fprintf (stderr, "OK, %d bytes\n", ret);
> >       else
> >         perror ("swprintf failed");
> >     }
> > }
> > ===============================================================
> > $ gcc -Wall foo.c
> > $ ./a.out
> > 
> > Expected output:
> > c = 1: OK, 1 bytes
> > c = 2: OK, 1 bytes
> > c = 3: OK, 1 bytes
> > c = 4: OK, 1 bytes
> > c = 5: OK, 1 bytes
> > c = 6: OK, 1 bytes
> > c = 7: OK, 1 bytes
> > c = 8: OK, 1 bytes
> > c = 9: OK, 1 bytes
> > c = 10: OK, 1 bytes
> > c = 11: OK, 1 bytes
> > c = 12: OK, 1 bytes
> > c = 13: OK, 1 bytes
> > c = 14: OK, 1 bytes
> > c = 15: OK, 1 bytes
> > c = 16: OK, 1 bytes
> > c = 17: OK, 1 bytes
> > c = 18: OK, 1 bytes
> > c = 19: OK, 1 bytes
> > c = 20: OK, 1 bytes
> > c = 21: OK, 1 bytes
> > c = 22: OK, 1 bytes
> > c = 23: OK, 1 bytes
> > c = 24: OK, 1 bytes
> > c = 25: OK, 1 bytes
> > c = 26: OK, 1 bytes
> > c = 27: OK, 1 bytes
> > c = 28: OK, 1 bytes
> > c = 29: OK, 1 bytes
> > c = 30: OK, 1 bytes
> > c = 31: OK, 1 bytes
> > c = 32: OK, 1 bytes
> > c = 33: OK, 1 bytes
> > c = 34: OK, 1 bytes
> > c = 35: OK, 1 bytes
> > c = 36: OK, 1 bytes
> > c = 37: OK, 1 bytes
> > c = 38: OK, 1 bytes
> > c = 39: OK, 1 bytes
> > c = 40: OK, 1 bytes
> > c = 41: OK, 1 bytes
> > c = 42: OK, 1 bytes
> > c = 43: OK, 1 bytes
> > c = 44: OK, 1 bytes
> > c = 45: OK, 1 bytes
> > c = 46: OK, 1 bytes
> > c = 47: OK, 1 bytes
> > c = 48: OK, 1 bytes
> > c = 49: OK, 1 bytes
> > c = 50: OK, 1 bytes
> > c = 51: OK, 1 bytes
> > c = 52: OK, 1 bytes
> > c = 53: OK, 1 bytes
> > c = 54: OK, 1 bytes
> > c = 55: OK, 1 bytes
> > c = 56: OK, 1 bytes
> > c = 57: OK, 1 bytes
> > c = 58: OK, 1 bytes
> > c = 59: OK, 1 bytes
> > c = 60: OK, 1 bytes
> > c = 61: OK, 1 bytes
> > c = 62: OK, 1 bytes
> > c = 63: OK, 1 bytes
> > c = 64: OK, 1 bytes
> > c = 65: OK, 1 bytes
> > c = 66: OK, 1 bytes
> > c = 67: OK, 1 bytes
> > c = 68: OK, 1 bytes
> > c = 69: OK, 1 bytes
> > c = 70: OK, 1 bytes
> > c = 71: OK, 1 bytes
> > c = 72: OK, 1 bytes
> > c = 73: OK, 1 bytes
> > c = 74: OK, 1 bytes
> > c = 75: OK, 1 bytes
> > c = 76: OK, 1 bytes
> > c = 77: OK, 1 bytes
> > c = 78: OK, 1 bytes
> > c = 79: OK, 1 bytes
> > c = 80: OK, 1 bytes
> > c = 81: OK, 1 bytes
> > c = 82: OK, 1 bytes
> > c = 83: OK, 1 bytes
> > c = 84: OK, 1 bytes
> > c = 85: OK, 1 bytes
> > c = 86: OK, 1 bytes
> > c = 87: OK, 1 bytes
> > c = 88: OK, 1 bytes
> > c = 89: OK, 1 bytes
> > c = 90: OK, 1 bytes
> > c = 91: OK, 1 bytes
> > c = 92: OK, 1 bytes
> > c = 93: OK, 1 bytes
> > c = 94: OK, 1 bytes
> > c = 95: OK, 1 bytes
> > c = 96: OK, 1 bytes
> > c = 97: OK, 1 bytes
> > c = 98: OK, 1 bytes
> > c = 99: OK, 1 bytes
> > c = 100: OK, 1 bytes
> > c = 101: OK, 1 bytes
> > c = 102: OK, 1 bytes
> > c = 103: OK, 1 bytes
> > c = 104: OK, 1 bytes
> > c = 105: OK, 1 bytes
> > c = 106: OK, 1 bytes
> > c = 107: OK, 1 bytes
> > c = 108: OK, 1 bytes
> > c = 109: OK, 1 bytes
> > c = 110: OK, 1 bytes
> > c = 111: OK, 1 bytes
> > c = 112: OK, 1 bytes
> > c = 113: OK, 1 bytes
> > c = 114: OK, 1 bytes
> > c = 115: OK, 1 bytes
> > c = 116: OK, 1 bytes
> > c = 117: OK, 1 bytes
> > c = 118: OK, 1 bytes
> > c = 119: OK, 1 bytes
> > c = 120: OK, 1 bytes
> > c = 121: OK, 1 bytes
> > c = 122: OK, 1 bytes
> > c = 123: OK, 1 bytes
> > c = 124: OK, 1 bytes
> > c = 125: OK, 1 bytes
> > c = 126: OK, 1 bytes
> > c = 127: OK, 1 bytes
> > c = 128: OK, 1 bytes
> > c = 129: OK, 1 bytes
> > c = 130: OK, 1 bytes
> > c = 131: OK, 1 bytes
> > c = 132: OK, 1 bytes
> > c = 133: OK, 1 bytes
> > c = 134: OK, 1 bytes
> > c = 135: OK, 1 bytes
> > c = 136: OK, 1 bytes
> > c = 137: OK, 1 bytes
> > c = 138: OK, 1 bytes
> > c = 139: OK, 1 bytes
> > c = 140: OK, 1 bytes
> > c = 141: OK, 1 bytes
> > c = 142: OK, 1 bytes
> > c = 143: OK, 1 bytes
> > c = 144: OK, 1 bytes
> > c = 145: OK, 1 bytes
> > c = 146: OK, 1 bytes
> > c = 147: OK, 1 bytes
> > c = 148: OK, 1 bytes
> > c = 149: OK, 1 bytes
> > c = 150: OK, 1 bytes
> > c = 151: OK, 1 bytes
> > c = 152: OK, 1 bytes
> > c = 153: OK, 1 bytes
> > c = 154: OK, 1 bytes
> > c = 155: OK, 1 bytes
> > c = 156: OK, 1 bytes
> > c = 157: OK, 1 bytes
> > c = 158: OK, 1 bytes
> > c = 159: OK, 1 bytes
> > c = 160: OK, 1 bytes
> > c = 161: OK, 1 bytes
> > c = 162: OK, 1 bytes
> > c = 163: OK, 1 bytes
> > c = 164: OK, 1 bytes
> > c = 165: OK, 1 bytes
> > c = 166: OK, 1 bytes
> > c = 167: OK, 1 bytes
> > c = 168: OK, 1 bytes
> > c = 169: OK, 1 bytes
> > c = 170: OK, 1 bytes
> > c = 171: OK, 1 bytes
> > c = 172: OK, 1 bytes
> > c = 173: OK, 1 bytes
> > c = 174: OK, 1 bytes
> > c = 175: OK, 1 bytes
> > c = 176: OK, 1 bytes
> > c = 177: OK, 1 bytes
> > c = 178: OK, 1 bytes
> > c = 179: OK, 1 bytes
> > c = 180: OK, 1 bytes
> > c = 181: OK, 1 bytes
> > c = 182: OK, 1 bytes
> > c = 183: OK, 1 bytes
> > c = 184: OK, 1 bytes
> > c = 185: OK, 1 bytes
> > c = 186: OK, 1 bytes
> > c = 187: OK, 1 bytes
> > c = 188: OK, 1 bytes
> > c = 189: OK, 1 bytes
> > c = 190: OK, 1 bytes
> > c = 191: OK, 1 bytes
> > c = 192: OK, 1 bytes
> > c = 193: OK, 1 bytes
> > c = 194: OK, 1 bytes
> > c = 195: OK, 1 bytes
> > c = 196: OK, 1 bytes
> > c = 197: OK, 1 bytes
> > c = 198: OK, 1 bytes
> > c = 199: OK, 1 bytes
> > c = 200: OK, 1 bytes
> > c = 201: OK, 1 bytes
> > c = 202: OK, 1 bytes
> > c = 203: OK, 1 bytes
> > c = 204: OK, 1 bytes
> > c = 205: OK, 1 bytes
> > c = 206: OK, 1 bytes
> > c = 207: OK, 1 bytes
> > c = 208: OK, 1 bytes
> > c = 209: OK, 1 bytes
> > c = 210: OK, 1 bytes
> > c = 211: OK, 1 bytes
> > c = 212: OK, 1 bytes
> > c = 213: OK, 1 bytes
> > c = 214: OK, 1 bytes
> > c = 215: OK, 1 bytes
> > c = 216: OK, 1 bytes
> > c = 217: OK, 1 bytes
> > c = 218: OK, 1 bytes
> > c = 219: OK, 1 bytes
> > c = 220: OK, 1 bytes
> > c = 221: OK, 1 bytes
> > c = 222: OK, 1 bytes
> > c = 223: OK, 1 bytes
> > c = 224: OK, 1 bytes
> > c = 225: OK, 1 bytes
> > c = 226: OK, 1 bytes
> > c = 227: OK, 1 bytes
> > c = 228: OK, 1 bytes
> > c = 229: OK, 1 bytes
> > c = 230: OK, 1 bytes
> > c = 231: OK, 1 bytes
> > c = 232: OK, 1 bytes
> > c = 233: OK, 1 bytes
> > c = 234: OK, 1 bytes
> > c = 235: OK, 1 bytes
> > c = 236: OK, 1 bytes
> > c = 237: OK, 1 bytes
> > c = 238: OK, 1 bytes
> > c = 239: OK, 1 bytes
> > c = 240: OK, 1 bytes
> > c = 241: OK, 1 bytes
> > c = 242: OK, 1 bytes
> > c = 243: OK, 1 bytes
> > c = 244: OK, 1 bytes
> > c = 245: OK, 1 bytes
> > c = 246: OK, 1 bytes
> > c = 247: OK, 1 bytes
> > c = 248: OK, 1 bytes
> > c = 249: OK, 1 bytes
> > c = 250: OK, 1 bytes
> > c = 251: OK, 1 bytes
> > c = 252: OK, 1 bytes
> > c = 253: OK, 1 bytes
> > c = 254: OK, 1 bytes
> > c = 255: OK, 1 bytes
> > 
> > Actual output:
> > c = 1: OK, 1 bytes
> > c = 2: OK, 1 bytes
> > c = 3: OK, 1 bytes
> > c = 4: OK, 1 bytes
> > c = 5: OK, 1 bytes
> > c = 6: OK, 1 bytes
> > c = 7: OK, 1 bytes
> > c = 8: OK, 1 bytes
> > c = 9: OK, 1 bytes
> > c = 10: OK, 1 bytes
> > c = 11: OK, 1 bytes
> > c = 12: OK, 1 bytes
> > c = 13: OK, 1 bytes
> > c = 14: OK, 1 bytes
> > c = 15: OK, 1 bytes
> > c = 16: OK, 1 bytes
> > c = 17: OK, 1 bytes
> > c = 18: OK, 1 bytes
> > c = 19: OK, 1 bytes
> > c = 20: OK, 1 bytes
> > c = 21: OK, 1 bytes
> > c = 22: OK, 1 bytes
> > c = 23: OK, 1 bytes
> > c = 24: OK, 1 bytes
> > c = 25: OK, 1 bytes
> > c = 26: OK, 1 bytes
> > c = 27: OK, 1 bytes
> > c = 28: OK, 1 bytes
> > c = 29: OK, 1 bytes
> > c = 30: OK, 1 bytes
> > c = 31: OK, 1 bytes
> > c = 32: OK, 1 bytes
> > c = 33: OK, 1 bytes
> > c = 34: OK, 1 bytes
> > c = 35: OK, 1 bytes
> > c = 36: OK, 1 bytes
> > c = 37: OK, 1 bytes
> > c = 38: OK, 1 bytes
> > c = 39: OK, 1 bytes
> > c = 40: OK, 1 bytes
> > c = 41: OK, 1 bytes
> > c = 42: OK, 1 bytes
> > c = 43: OK, 1 bytes
> > c = 44: OK, 1 bytes
> > c = 45: OK, 1 bytes
> > c = 46: OK, 1 bytes
> > c = 47: OK, 1 bytes
> > c = 48: OK, 1 bytes
> > c = 49: OK, 1 bytes
> > c = 50: OK, 1 bytes
> > c = 51: OK, 1 bytes
> > c = 52: OK, 1 bytes
> > c = 53: OK, 1 bytes
> > c = 54: OK, 1 bytes
> > c = 55: OK, 1 bytes
> > c = 56: OK, 1 bytes
> > c = 57: OK, 1 bytes
> > c = 58: OK, 1 bytes
> > c = 59: OK, 1 bytes
> > c = 60: OK, 1 bytes
> > c = 61: OK, 1 bytes
> > c = 62: OK, 1 bytes
> > c = 63: OK, 1 bytes
> > c = 64: OK, 1 bytes
> > c = 65: OK, 1 bytes
> > c = 66: OK, 1 bytes
> > c = 67: OK, 1 bytes
> > c = 68: OK, 1 bytes
> > c = 69: OK, 1 bytes
> > c = 70: OK, 1 bytes
> > c = 71: OK, 1 bytes
> > c = 72: OK, 1 bytes
> > c = 73: OK, 1 bytes
> > c = 74: OK, 1 bytes
> > c = 75: OK, 1 bytes
> > c = 76: OK, 1 bytes
> > c = 77: OK, 1 bytes
> > c = 78: OK, 1 bytes
> > c = 79: OK, 1 bytes
> > c = 80: OK, 1 bytes
> > c = 81: OK, 1 bytes
> > c = 82: OK, 1 bytes
> > c = 83: OK, 1 bytes
> > c = 84: OK, 1 bytes
> > c = 85: OK, 1 bytes
> > c = 86: OK, 1 bytes
> > c = 87: OK, 1 bytes
> > c = 88: OK, 1 bytes
> > c = 89: OK, 1 bytes
> > c = 90: OK, 1 bytes
> > c = 91: OK, 1 bytes
> > c = 92: OK, 1 bytes
> > c = 93: OK, 1 bytes
> > c = 94: OK, 1 bytes
> > c = 95: OK, 1 bytes
> > c = 96: OK, 1 bytes
> > c = 97: OK, 1 bytes
> > c = 98: OK, 1 bytes
> > c = 99: OK, 1 bytes
> > c = 100: OK, 1 bytes
> > c = 101: OK, 1 bytes
> > c = 102: OK, 1 bytes
> > c = 103: OK, 1 bytes
> > c = 104: OK, 1 bytes
> > c = 105: OK, 1 bytes
> > c = 106: OK, 1 bytes
> > c = 107: OK, 1 bytes
> > c = 108: OK, 1 bytes
> > c = 109: OK, 1 bytes
> > c = 110: OK, 1 bytes
> > c = 111: OK, 1 bytes
> > c = 112: OK, 1 bytes
> > c = 113: OK, 1 bytes
> > c = 114: OK, 1 bytes
> > c = 115: OK, 1 bytes
> > c = 116: OK, 1 bytes
> > c = 117: OK, 1 bytes
> > c = 118: OK, 1 bytes
> > c = 119: OK, 1 bytes
> > c = 120: OK, 1 bytes
> > c = 121: OK, 1 bytes
> > c = 122: OK, 1 bytes
> > c = 123: OK, 1 bytes
> > c = 124: OK, 1 bytes
> > c = 125: OK, 1 bytes
> > c = 126: OK, 1 bytes
> > c = 127: OK, 1 bytes
> > c = 128: OK, 1 bytes
> > c = 129: OK, 1 bytes
> > c = 130: OK, 1 bytes
> > c = 131: OK, 1 bytes
> > c = 132: OK, 1 bytes
> > c = 133: OK, 1 bytes
> > c = 134: OK, 1 bytes
> > c = 135: OK, 1 bytes
> > c = 136: OK, 1 bytes
> > c = 137: OK, 1 bytes
> > c = 138: OK, 1 bytes
> > c = 139: OK, 1 bytes
> > c = 140: OK, 1 bytes
> > c = 141: OK, 1 bytes
> > c = 142: OK, 1 bytes
> > c = 143: OK, 1 bytes
> > c = 144: OK, 1 bytes
> > c = 145: OK, 1 bytes
> > c = 146: OK, 1 bytes
> > c = 147: OK, 1 bytes
> > c = 148: OK, 1 bytes
> > c = 149: OK, 1 bytes
> > c = 150: OK, 1 bytes
> > c = 151: OK, 1 bytes
> > c = 152: OK, 1 bytes
> > c = 153: OK, 1 bytes
> > c = 154: OK, 1 bytes
> > c = 155: OK, 1 bytes
> > c = 156: OK, 1 bytes
> > c = 157: OK, 1 bytes
> > c = 158: OK, 1 bytes
> > c = 159: OK, 1 bytes
> > c = 160: OK, 1 bytes
> > c = 161: OK, 1 bytes
> > c = 162: OK, 1 bytes
> > c = 163: OK, 1 bytes
> > c = 164: OK, 1 bytes
> > c = 165: OK, 1 bytes
> > c = 166: OK, 1 bytes
> > c = 167: OK, 1 bytes
> > c = 168: OK, 1 bytes
> > c = 169: OK, 1 bytes
> > c = 170: OK, 1 bytes
> > c = 171: OK, 1 bytes
> > c = 172: OK, 1 bytes
> > c = 173: OK, 1 bytes
> > c = 174: OK, 1 bytes
> > c = 175: OK, 1 bytes
> > c = 176: OK, 1 bytes
> > c = 177: OK, 1 bytes
> > c = 178: OK, 1 bytes
> > c = 179: OK, 1 bytes
> > c = 180: OK, 1 bytes
> > c = 181: OK, 1 bytes
> > c = 182: OK, 1 bytes
> > c = 183: OK, 1 bytes
> > c = 184: OK, 1 bytes
> > c = 185: OK, 1 bytes
> > c = 186: OK, 1 bytes
> > c = 187: OK, 1 bytes
> > c = 188: OK, 1 bytes
> > c = 189: OK, 1 bytes
> > c = 190: OK, 1 bytes
> > c = 191: OK, 1 bytes
> > c = 192: OK, 1 bytes
> > c = 193: OK, 1 bytes
> > c = 194: OK, 1 bytes
> > c = 195: OK, 1 bytes
> > c = 196: OK, 1 bytes
> > c = 197: OK, 1 bytes
> > c = 198: OK, 1 bytes
> > c = 199: OK, 1 bytes
> > c = 200: OK, 1 bytes
> > c = 201: OK, 1 bytes
> > c = 202: OK, 1 bytes
> > c = 203: OK, 1 bytes
> > c = 204: OK, 1 bytes
> > c = 205: OK, 1 bytes
> > c = 206: OK, 1 bytes
> > c = 207: OK, 1 bytes
> > c = 208: OK, 1 bytes
> > c = 209: OK, 1 bytes
> > c = 210: OK, 1 bytes
> > c = 211: OK, 1 bytes
> > c = 212: OK, 1 bytes
> > c = 213: OK, 1 bytes
> > c = 214: OK, 1 bytes
> > c = 215: OK, 1 bytes
> > c = 216: OK, 1 bytes
> > c = 217: OK, 1 bytes
> > c = 218: OK, 1 bytes
> > c = 219: OK, 1 bytes
> > c = 220: OK, 1 bytes
> > c = 221: OK, 1 bytes
> > c = 222: OK, 1 bytes
> > c = 223: OK, 1 bytes
> > c = 224: OK, 1 bytes
> > c = 225: OK, 1 bytes
> > c = 226: OK, 1 bytes
> > c = 227: OK, 1 bytes
> > c = 228: OK, 1 bytes
> > c = 229: OK, 1 bytes
> > c = 230: OK, 1 bytes
> > c = 231: OK, 1 bytes
> > c = 232: OK, 1 bytes
> > c = 233: OK, 1 bytes
> > c = 234: OK, 1 bytes
> > c = 235: OK, 1 bytes
> > c = 236: OK, 1 bytes
> > c = 237: OK, 1 bytes
> > c = 238: OK, 1 bytes
> > c = 239: OK, 1 bytes
> > c = 240: OK, 1 bytes
> > c = 241: OK, 1 bytes
> > c = 242: OK, 1 bytes
> > c = 243: OK, 1 bytes
> > c = 244: OK, 1 bytes
> > c = 245: OK, 1 bytes
> > c = 246: OK, 1 bytes
> > c = 247: OK, 1 bytes
> > c = 248: OK, 1 bytes
> > c = 249: OK, 1 bytes
> > c = 250: OK, 1 bytes
> > c = 251: OK, 1 bytes
> > c = 252: OK, 1 bytes
> > c = 253: OK, 1 bytes
> > c = 254: OK, 1 bytes
> > c = 255: swprintf failed: Illegal byte sequence
> > 
> > This is a bug, because POSIX says that in the C / POSIX locale, "all byte
> > values are valid characters" [1].
> 
> Yes, this is a bug and seems to be an instance of mishandling of
> signed conversions. Attached should correct it.
> 
> Rich

> diff --git a/src/stdio/vfwprintf.c b/src/stdio/vfwprintf.c
> index 53697701..a653e233 100644
> --- a/src/stdio/vfwprintf.c
> +++ b/src/stdio/vfwprintf.c
> @@ -271,7 +271,7 @@ static int wprintf_core(FILE *f, const wchar_t *fmt, va_list *ap, union arg *nl_
>  		case 'C':
>  			if (w<1) w=1;
>  			pad(f, w-1, fl);
> -			out(f, &(wchar_t){t=='C' ? arg.i : btowc(arg.i)}, 1);
> +			out(f, &(wchar_t){t=='C' ? arg.i : btowc(arg.i & 0xff)}, 1);
>  			pad(f, w-1, fl^LEFT_ADJ);
>  			l = w;
>  			continue;

Hmm -- while this works, formally, %c takes an argument of type int
not char, so I think we should probably actually change the state
machine for both narrow and wide printf to terminate with state INT
rather than CHAR for 'c'. And likewise, %lc/%C takes wint_t, which has
type unsigned, so although it doesn't matter the state machine should
terminate with state UINT rather than INT (wint_t is unsigned).

Rich

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [musl] swprintf cannot handle the character 0xff
  2023-06-12 21:01   ` Rich Felker
@ 2023-06-12 21:13     ` Rich Felker
  2023-06-12 23:09       ` Bruno Haible
  0 siblings, 1 reply; 7+ messages in thread
From: Rich Felker @ 2023-06-12 21:13 UTC (permalink / raw)
  To: Bruno Haible; +Cc: musl

On Mon, Jun 12, 2023 at 05:01:59PM -0400, Rich Felker wrote:
> On Mon, Jun 12, 2023 at 04:22:42PM -0400, Rich Felker wrote:
> > On Mon, Jun 12, 2023 at 02:30:44PM +0200, Bruno Haible wrote:
> > > When swprintf is meant to convert a character to a wide character, through
> > > the %c directive, it fails if that character is '\xff'.
> > > 
> > > Seen with musl libc 1.2.4, in Alpine Linux 3.18.0.
> > > 
> > > How to reproduce:
> > > ============================ foo.c ============================
> > > #include <stdio.h>
> > > #include <wchar.h>
> > > int main ()
> > > {
> > >   wchar_t buf[12];
> > >   for (int c = 1; c < 256; c++)
> > >     {
> > >       fprintf (stderr, "c = %d: ", c);
> > >       int ret = swprintf (buf, 12, L"%c", c);
> > >       if (ret >= 0)
> > >         fprintf (stderr, "OK, %d bytes\n", ret);
> > >       else
> > >         perror ("swprintf failed");
> > >     }
> > > }
> > > ===============================================================
> > > $ gcc -Wall foo.c
> > > $ ./a.out
> > > 
> > > Expected output:
> > > c = 1: OK, 1 bytes
> > > c = 2: OK, 1 bytes
> > > c = 3: OK, 1 bytes
> > > c = 4: OK, 1 bytes
> > > c = 5: OK, 1 bytes
> > > c = 6: OK, 1 bytes
> > > c = 7: OK, 1 bytes
> > > c = 8: OK, 1 bytes
> > > c = 9: OK, 1 bytes
> > > c = 10: OK, 1 bytes
> > > c = 11: OK, 1 bytes
> > > c = 12: OK, 1 bytes
> > > c = 13: OK, 1 bytes
> > > c = 14: OK, 1 bytes
> > > c = 15: OK, 1 bytes
> > > c = 16: OK, 1 bytes
> > > c = 17: OK, 1 bytes
> > > c = 18: OK, 1 bytes
> > > c = 19: OK, 1 bytes
> > > c = 20: OK, 1 bytes
> > > c = 21: OK, 1 bytes
> > > c = 22: OK, 1 bytes
> > > c = 23: OK, 1 bytes
> > > c = 24: OK, 1 bytes
> > > c = 25: OK, 1 bytes
> > > c = 26: OK, 1 bytes
> > > c = 27: OK, 1 bytes
> > > c = 28: OK, 1 bytes
> > > c = 29: OK, 1 bytes
> > > c = 30: OK, 1 bytes
> > > c = 31: OK, 1 bytes
> > > c = 32: OK, 1 bytes
> > > c = 33: OK, 1 bytes
> > > c = 34: OK, 1 bytes
> > > c = 35: OK, 1 bytes
> > > c = 36: OK, 1 bytes
> > > c = 37: OK, 1 bytes
> > > c = 38: OK, 1 bytes
> > > c = 39: OK, 1 bytes
> > > c = 40: OK, 1 bytes
> > > c = 41: OK, 1 bytes
> > > c = 42: OK, 1 bytes
> > > c = 43: OK, 1 bytes
> > > c = 44: OK, 1 bytes
> > > c = 45: OK, 1 bytes
> > > c = 46: OK, 1 bytes
> > > c = 47: OK, 1 bytes
> > > c = 48: OK, 1 bytes
> > > c = 49: OK, 1 bytes
> > > c = 50: OK, 1 bytes
> > > c = 51: OK, 1 bytes
> > > c = 52: OK, 1 bytes
> > > c = 53: OK, 1 bytes
> > > c = 54: OK, 1 bytes
> > > c = 55: OK, 1 bytes
> > > c = 56: OK, 1 bytes
> > > c = 57: OK, 1 bytes
> > > c = 58: OK, 1 bytes
> > > c = 59: OK, 1 bytes
> > > c = 60: OK, 1 bytes
> > > c = 61: OK, 1 bytes
> > > c = 62: OK, 1 bytes
> > > c = 63: OK, 1 bytes
> > > c = 64: OK, 1 bytes
> > > c = 65: OK, 1 bytes
> > > c = 66: OK, 1 bytes
> > > c = 67: OK, 1 bytes
> > > c = 68: OK, 1 bytes
> > > c = 69: OK, 1 bytes
> > > c = 70: OK, 1 bytes
> > > c = 71: OK, 1 bytes
> > > c = 72: OK, 1 bytes
> > > c = 73: OK, 1 bytes
> > > c = 74: OK, 1 bytes
> > > c = 75: OK, 1 bytes
> > > c = 76: OK, 1 bytes
> > > c = 77: OK, 1 bytes
> > > c = 78: OK, 1 bytes
> > > c = 79: OK, 1 bytes
> > > c = 80: OK, 1 bytes
> > > c = 81: OK, 1 bytes
> > > c = 82: OK, 1 bytes
> > > c = 83: OK, 1 bytes
> > > c = 84: OK, 1 bytes
> > > c = 85: OK, 1 bytes
> > > c = 86: OK, 1 bytes
> > > c = 87: OK, 1 bytes
> > > c = 88: OK, 1 bytes
> > > c = 89: OK, 1 bytes
> > > c = 90: OK, 1 bytes
> > > c = 91: OK, 1 bytes
> > > c = 92: OK, 1 bytes
> > > c = 93: OK, 1 bytes
> > > c = 94: OK, 1 bytes
> > > c = 95: OK, 1 bytes
> > > c = 96: OK, 1 bytes
> > > c = 97: OK, 1 bytes
> > > c = 98: OK, 1 bytes
> > > c = 99: OK, 1 bytes
> > > c = 100: OK, 1 bytes
> > > c = 101: OK, 1 bytes
> > > c = 102: OK, 1 bytes
> > > c = 103: OK, 1 bytes
> > > c = 104: OK, 1 bytes
> > > c = 105: OK, 1 bytes
> > > c = 106: OK, 1 bytes
> > > c = 107: OK, 1 bytes
> > > c = 108: OK, 1 bytes
> > > c = 109: OK, 1 bytes
> > > c = 110: OK, 1 bytes
> > > c = 111: OK, 1 bytes
> > > c = 112: OK, 1 bytes
> > > c = 113: OK, 1 bytes
> > > c = 114: OK, 1 bytes
> > > c = 115: OK, 1 bytes
> > > c = 116: OK, 1 bytes
> > > c = 117: OK, 1 bytes
> > > c = 118: OK, 1 bytes
> > > c = 119: OK, 1 bytes
> > > c = 120: OK, 1 bytes
> > > c = 121: OK, 1 bytes
> > > c = 122: OK, 1 bytes
> > > c = 123: OK, 1 bytes
> > > c = 124: OK, 1 bytes
> > > c = 125: OK, 1 bytes
> > > c = 126: OK, 1 bytes
> > > c = 127: OK, 1 bytes
> > > c = 128: OK, 1 bytes
> > > c = 129: OK, 1 bytes
> > > c = 130: OK, 1 bytes
> > > c = 131: OK, 1 bytes
> > > c = 132: OK, 1 bytes
> > > c = 133: OK, 1 bytes
> > > c = 134: OK, 1 bytes
> > > c = 135: OK, 1 bytes
> > > c = 136: OK, 1 bytes
> > > c = 137: OK, 1 bytes
> > > c = 138: OK, 1 bytes
> > > c = 139: OK, 1 bytes
> > > c = 140: OK, 1 bytes
> > > c = 141: OK, 1 bytes
> > > c = 142: OK, 1 bytes
> > > c = 143: OK, 1 bytes
> > > c = 144: OK, 1 bytes
> > > c = 145: OK, 1 bytes
> > > c = 146: OK, 1 bytes
> > > c = 147: OK, 1 bytes
> > > c = 148: OK, 1 bytes
> > > c = 149: OK, 1 bytes
> > > c = 150: OK, 1 bytes
> > > c = 151: OK, 1 bytes
> > > c = 152: OK, 1 bytes
> > > c = 153: OK, 1 bytes
> > > c = 154: OK, 1 bytes
> > > c = 155: OK, 1 bytes
> > > c = 156: OK, 1 bytes
> > > c = 157: OK, 1 bytes
> > > c = 158: OK, 1 bytes
> > > c = 159: OK, 1 bytes
> > > c = 160: OK, 1 bytes
> > > c = 161: OK, 1 bytes
> > > c = 162: OK, 1 bytes
> > > c = 163: OK, 1 bytes
> > > c = 164: OK, 1 bytes
> > > c = 165: OK, 1 bytes
> > > c = 166: OK, 1 bytes
> > > c = 167: OK, 1 bytes
> > > c = 168: OK, 1 bytes
> > > c = 169: OK, 1 bytes
> > > c = 170: OK, 1 bytes
> > > c = 171: OK, 1 bytes
> > > c = 172: OK, 1 bytes
> > > c = 173: OK, 1 bytes
> > > c = 174: OK, 1 bytes
> > > c = 175: OK, 1 bytes
> > > c = 176: OK, 1 bytes
> > > c = 177: OK, 1 bytes
> > > c = 178: OK, 1 bytes
> > > c = 179: OK, 1 bytes
> > > c = 180: OK, 1 bytes
> > > c = 181: OK, 1 bytes
> > > c = 182: OK, 1 bytes
> > > c = 183: OK, 1 bytes
> > > c = 184: OK, 1 bytes
> > > c = 185: OK, 1 bytes
> > > c = 186: OK, 1 bytes
> > > c = 187: OK, 1 bytes
> > > c = 188: OK, 1 bytes
> > > c = 189: OK, 1 bytes
> > > c = 190: OK, 1 bytes
> > > c = 191: OK, 1 bytes
> > > c = 192: OK, 1 bytes
> > > c = 193: OK, 1 bytes
> > > c = 194: OK, 1 bytes
> > > c = 195: OK, 1 bytes
> > > c = 196: OK, 1 bytes
> > > c = 197: OK, 1 bytes
> > > c = 198: OK, 1 bytes
> > > c = 199: OK, 1 bytes
> > > c = 200: OK, 1 bytes
> > > c = 201: OK, 1 bytes
> > > c = 202: OK, 1 bytes
> > > c = 203: OK, 1 bytes
> > > c = 204: OK, 1 bytes
> > > c = 205: OK, 1 bytes
> > > c = 206: OK, 1 bytes
> > > c = 207: OK, 1 bytes
> > > c = 208: OK, 1 bytes
> > > c = 209: OK, 1 bytes
> > > c = 210: OK, 1 bytes
> > > c = 211: OK, 1 bytes
> > > c = 212: OK, 1 bytes
> > > c = 213: OK, 1 bytes
> > > c = 214: OK, 1 bytes
> > > c = 215: OK, 1 bytes
> > > c = 216: OK, 1 bytes
> > > c = 217: OK, 1 bytes
> > > c = 218: OK, 1 bytes
> > > c = 219: OK, 1 bytes
> > > c = 220: OK, 1 bytes
> > > c = 221: OK, 1 bytes
> > > c = 222: OK, 1 bytes
> > > c = 223: OK, 1 bytes
> > > c = 224: OK, 1 bytes
> > > c = 225: OK, 1 bytes
> > > c = 226: OK, 1 bytes
> > > c = 227: OK, 1 bytes
> > > c = 228: OK, 1 bytes
> > > c = 229: OK, 1 bytes
> > > c = 230: OK, 1 bytes
> > > c = 231: OK, 1 bytes
> > > c = 232: OK, 1 bytes
> > > c = 233: OK, 1 bytes
> > > c = 234: OK, 1 bytes
> > > c = 235: OK, 1 bytes
> > > c = 236: OK, 1 bytes
> > > c = 237: OK, 1 bytes
> > > c = 238: OK, 1 bytes
> > > c = 239: OK, 1 bytes
> > > c = 240: OK, 1 bytes
> > > c = 241: OK, 1 bytes
> > > c = 242: OK, 1 bytes
> > > c = 243: OK, 1 bytes
> > > c = 244: OK, 1 bytes
> > > c = 245: OK, 1 bytes
> > > c = 246: OK, 1 bytes
> > > c = 247: OK, 1 bytes
> > > c = 248: OK, 1 bytes
> > > c = 249: OK, 1 bytes
> > > c = 250: OK, 1 bytes
> > > c = 251: OK, 1 bytes
> > > c = 252: OK, 1 bytes
> > > c = 253: OK, 1 bytes
> > > c = 254: OK, 1 bytes
> > > c = 255: OK, 1 bytes
> > > 
> > > Actual output:
> > > c = 1: OK, 1 bytes
> > > c = 2: OK, 1 bytes
> > > c = 3: OK, 1 bytes
> > > c = 4: OK, 1 bytes
> > > c = 5: OK, 1 bytes
> > > c = 6: OK, 1 bytes
> > > c = 7: OK, 1 bytes
> > > c = 8: OK, 1 bytes
> > > c = 9: OK, 1 bytes
> > > c = 10: OK, 1 bytes
> > > c = 11: OK, 1 bytes
> > > c = 12: OK, 1 bytes
> > > c = 13: OK, 1 bytes
> > > c = 14: OK, 1 bytes
> > > c = 15: OK, 1 bytes
> > > c = 16: OK, 1 bytes
> > > c = 17: OK, 1 bytes
> > > c = 18: OK, 1 bytes
> > > c = 19: OK, 1 bytes
> > > c = 20: OK, 1 bytes
> > > c = 21: OK, 1 bytes
> > > c = 22: OK, 1 bytes
> > > c = 23: OK, 1 bytes
> > > c = 24: OK, 1 bytes
> > > c = 25: OK, 1 bytes
> > > c = 26: OK, 1 bytes
> > > c = 27: OK, 1 bytes
> > > c = 28: OK, 1 bytes
> > > c = 29: OK, 1 bytes
> > > c = 30: OK, 1 bytes
> > > c = 31: OK, 1 bytes
> > > c = 32: OK, 1 bytes
> > > c = 33: OK, 1 bytes
> > > c = 34: OK, 1 bytes
> > > c = 35: OK, 1 bytes
> > > c = 36: OK, 1 bytes
> > > c = 37: OK, 1 bytes
> > > c = 38: OK, 1 bytes
> > > c = 39: OK, 1 bytes
> > > c = 40: OK, 1 bytes
> > > c = 41: OK, 1 bytes
> > > c = 42: OK, 1 bytes
> > > c = 43: OK, 1 bytes
> > > c = 44: OK, 1 bytes
> > > c = 45: OK, 1 bytes
> > > c = 46: OK, 1 bytes
> > > c = 47: OK, 1 bytes
> > > c = 48: OK, 1 bytes
> > > c = 49: OK, 1 bytes
> > > c = 50: OK, 1 bytes
> > > c = 51: OK, 1 bytes
> > > c = 52: OK, 1 bytes
> > > c = 53: OK, 1 bytes
> > > c = 54: OK, 1 bytes
> > > c = 55: OK, 1 bytes
> > > c = 56: OK, 1 bytes
> > > c = 57: OK, 1 bytes
> > > c = 58: OK, 1 bytes
> > > c = 59: OK, 1 bytes
> > > c = 60: OK, 1 bytes
> > > c = 61: OK, 1 bytes
> > > c = 62: OK, 1 bytes
> > > c = 63: OK, 1 bytes
> > > c = 64: OK, 1 bytes
> > > c = 65: OK, 1 bytes
> > > c = 66: OK, 1 bytes
> > > c = 67: OK, 1 bytes
> > > c = 68: OK, 1 bytes
> > > c = 69: OK, 1 bytes
> > > c = 70: OK, 1 bytes
> > > c = 71: OK, 1 bytes
> > > c = 72: OK, 1 bytes
> > > c = 73: OK, 1 bytes
> > > c = 74: OK, 1 bytes
> > > c = 75: OK, 1 bytes
> > > c = 76: OK, 1 bytes
> > > c = 77: OK, 1 bytes
> > > c = 78: OK, 1 bytes
> > > c = 79: OK, 1 bytes
> > > c = 80: OK, 1 bytes
> > > c = 81: OK, 1 bytes
> > > c = 82: OK, 1 bytes
> > > c = 83: OK, 1 bytes
> > > c = 84: OK, 1 bytes
> > > c = 85: OK, 1 bytes
> > > c = 86: OK, 1 bytes
> > > c = 87: OK, 1 bytes
> > > c = 88: OK, 1 bytes
> > > c = 89: OK, 1 bytes
> > > c = 90: OK, 1 bytes
> > > c = 91: OK, 1 bytes
> > > c = 92: OK, 1 bytes
> > > c = 93: OK, 1 bytes
> > > c = 94: OK, 1 bytes
> > > c = 95: OK, 1 bytes
> > > c = 96: OK, 1 bytes
> > > c = 97: OK, 1 bytes
> > > c = 98: OK, 1 bytes
> > > c = 99: OK, 1 bytes
> > > c = 100: OK, 1 bytes
> > > c = 101: OK, 1 bytes
> > > c = 102: OK, 1 bytes
> > > c = 103: OK, 1 bytes
> > > c = 104: OK, 1 bytes
> > > c = 105: OK, 1 bytes
> > > c = 106: OK, 1 bytes
> > > c = 107: OK, 1 bytes
> > > c = 108: OK, 1 bytes
> > > c = 109: OK, 1 bytes
> > > c = 110: OK, 1 bytes
> > > c = 111: OK, 1 bytes
> > > c = 112: OK, 1 bytes
> > > c = 113: OK, 1 bytes
> > > c = 114: OK, 1 bytes
> > > c = 115: OK, 1 bytes
> > > c = 116: OK, 1 bytes
> > > c = 117: OK, 1 bytes
> > > c = 118: OK, 1 bytes
> > > c = 119: OK, 1 bytes
> > > c = 120: OK, 1 bytes
> > > c = 121: OK, 1 bytes
> > > c = 122: OK, 1 bytes
> > > c = 123: OK, 1 bytes
> > > c = 124: OK, 1 bytes
> > > c = 125: OK, 1 bytes
> > > c = 126: OK, 1 bytes
> > > c = 127: OK, 1 bytes
> > > c = 128: OK, 1 bytes
> > > c = 129: OK, 1 bytes
> > > c = 130: OK, 1 bytes
> > > c = 131: OK, 1 bytes
> > > c = 132: OK, 1 bytes
> > > c = 133: OK, 1 bytes
> > > c = 134: OK, 1 bytes
> > > c = 135: OK, 1 bytes
> > > c = 136: OK, 1 bytes
> > > c = 137: OK, 1 bytes
> > > c = 138: OK, 1 bytes
> > > c = 139: OK, 1 bytes
> > > c = 140: OK, 1 bytes
> > > c = 141: OK, 1 bytes
> > > c = 142: OK, 1 bytes
> > > c = 143: OK, 1 bytes
> > > c = 144: OK, 1 bytes
> > > c = 145: OK, 1 bytes
> > > c = 146: OK, 1 bytes
> > > c = 147: OK, 1 bytes
> > > c = 148: OK, 1 bytes
> > > c = 149: OK, 1 bytes
> > > c = 150: OK, 1 bytes
> > > c = 151: OK, 1 bytes
> > > c = 152: OK, 1 bytes
> > > c = 153: OK, 1 bytes
> > > c = 154: OK, 1 bytes
> > > c = 155: OK, 1 bytes
> > > c = 156: OK, 1 bytes
> > > c = 157: OK, 1 bytes
> > > c = 158: OK, 1 bytes
> > > c = 159: OK, 1 bytes
> > > c = 160: OK, 1 bytes
> > > c = 161: OK, 1 bytes
> > > c = 162: OK, 1 bytes
> > > c = 163: OK, 1 bytes
> > > c = 164: OK, 1 bytes
> > > c = 165: OK, 1 bytes
> > > c = 166: OK, 1 bytes
> > > c = 167: OK, 1 bytes
> > > c = 168: OK, 1 bytes
> > > c = 169: OK, 1 bytes
> > > c = 170: OK, 1 bytes
> > > c = 171: OK, 1 bytes
> > > c = 172: OK, 1 bytes
> > > c = 173: OK, 1 bytes
> > > c = 174: OK, 1 bytes
> > > c = 175: OK, 1 bytes
> > > c = 176: OK, 1 bytes
> > > c = 177: OK, 1 bytes
> > > c = 178: OK, 1 bytes
> > > c = 179: OK, 1 bytes
> > > c = 180: OK, 1 bytes
> > > c = 181: OK, 1 bytes
> > > c = 182: OK, 1 bytes
> > > c = 183: OK, 1 bytes
> > > c = 184: OK, 1 bytes
> > > c = 185: OK, 1 bytes
> > > c = 186: OK, 1 bytes
> > > c = 187: OK, 1 bytes
> > > c = 188: OK, 1 bytes
> > > c = 189: OK, 1 bytes
> > > c = 190: OK, 1 bytes
> > > c = 191: OK, 1 bytes
> > > c = 192: OK, 1 bytes
> > > c = 193: OK, 1 bytes
> > > c = 194: OK, 1 bytes
> > > c = 195: OK, 1 bytes
> > > c = 196: OK, 1 bytes
> > > c = 197: OK, 1 bytes
> > > c = 198: OK, 1 bytes
> > > c = 199: OK, 1 bytes
> > > c = 200: OK, 1 bytes
> > > c = 201: OK, 1 bytes
> > > c = 202: OK, 1 bytes
> > > c = 203: OK, 1 bytes
> > > c = 204: OK, 1 bytes
> > > c = 205: OK, 1 bytes
> > > c = 206: OK, 1 bytes
> > > c = 207: OK, 1 bytes
> > > c = 208: OK, 1 bytes
> > > c = 209: OK, 1 bytes
> > > c = 210: OK, 1 bytes
> > > c = 211: OK, 1 bytes
> > > c = 212: OK, 1 bytes
> > > c = 213: OK, 1 bytes
> > > c = 214: OK, 1 bytes
> > > c = 215: OK, 1 bytes
> > > c = 216: OK, 1 bytes
> > > c = 217: OK, 1 bytes
> > > c = 218: OK, 1 bytes
> > > c = 219: OK, 1 bytes
> > > c = 220: OK, 1 bytes
> > > c = 221: OK, 1 bytes
> > > c = 222: OK, 1 bytes
> > > c = 223: OK, 1 bytes
> > > c = 224: OK, 1 bytes
> > > c = 225: OK, 1 bytes
> > > c = 226: OK, 1 bytes
> > > c = 227: OK, 1 bytes
> > > c = 228: OK, 1 bytes
> > > c = 229: OK, 1 bytes
> > > c = 230: OK, 1 bytes
> > > c = 231: OK, 1 bytes
> > > c = 232: OK, 1 bytes
> > > c = 233: OK, 1 bytes
> > > c = 234: OK, 1 bytes
> > > c = 235: OK, 1 bytes
> > > c = 236: OK, 1 bytes
> > > c = 237: OK, 1 bytes
> > > c = 238: OK, 1 bytes
> > > c = 239: OK, 1 bytes
> > > c = 240: OK, 1 bytes
> > > c = 241: OK, 1 bytes
> > > c = 242: OK, 1 bytes
> > > c = 243: OK, 1 bytes
> > > c = 244: OK, 1 bytes
> > > c = 245: OK, 1 bytes
> > > c = 246: OK, 1 bytes
> > > c = 247: OK, 1 bytes
> > > c = 248: OK, 1 bytes
> > > c = 249: OK, 1 bytes
> > > c = 250: OK, 1 bytes
> > > c = 251: OK, 1 bytes
> > > c = 252: OK, 1 bytes
> > > c = 253: OK, 1 bytes
> > > c = 254: OK, 1 bytes
> > > c = 255: swprintf failed: Illegal byte sequence
> > > 
> > > This is a bug, because POSIX says that in the C / POSIX locale, "all byte
> > > values are valid characters" [1].
> > 
> > Yes, this is a bug and seems to be an instance of mishandling of
> > signed conversions. Attached should correct it.
> > 
> > Rich
> 
> > diff --git a/src/stdio/vfwprintf.c b/src/stdio/vfwprintf.c
> > index 53697701..a653e233 100644
> > --- a/src/stdio/vfwprintf.c
> > +++ b/src/stdio/vfwprintf.c
> > @@ -271,7 +271,7 @@ static int wprintf_core(FILE *f, const wchar_t *fmt, va_list *ap, union arg *nl_
> >  		case 'C':
> >  			if (w<1) w=1;
> >  			pad(f, w-1, fl);
> > -			out(f, &(wchar_t){t=='C' ? arg.i : btowc(arg.i)}, 1);
> > +			out(f, &(wchar_t){t=='C' ? arg.i : btowc(arg.i & 0xff)}, 1);
> >  			pad(f, w-1, fl^LEFT_ADJ);
> >  			l = w;
> >  			continue;
> 
> Hmm -- while this works, formally, %c takes an argument of type int
> not char, so I think we should probably actually change the state
> machine for both narrow and wide printf to terminate with state INT
> rather than CHAR for 'c'. And likewise, %lc/%C takes wint_t, which has
> type unsigned, so although it doesn't matter the state machine should
> terminate with state UINT rather than INT (wint_t is unsigned).

And the plot thickens...

For narrow printf, it's required to work even if you pass -1:

    c
    The int argument shall be converted to an unsigned char, and the
    resulting byte shall be written.

This already works regardless of what we do just by assignment into a
char array.

However, for wide printf:

    c
    If no l (ell) qualifier is present, the int argument shall be
    converted to a wide character as if by calling the btowc()
    function and the resulting wide character shall be written.

There's no specification of what happens if btowc fails here, but
passing EOF to btowc is required to fail and return WEOF. It can also
fail depending on locale.

Rich

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [musl] swprintf cannot handle the character 0xff
  2023-06-12 21:13     ` Rich Felker
@ 2023-06-12 23:09       ` Bruno Haible
  2023-06-12 23:54         ` Rich Felker
  0 siblings, 1 reply; 7+ messages in thread
From: Bruno Haible @ 2023-06-12 23:09 UTC (permalink / raw)
  To: Rich Felker; +Cc: musl

Rich Felker wrote:
> However, for wide printf:
> 
>     c
>     If no l (ell) qualifier is present, the int argument shall be
>     converted to a wide character as if by calling the btowc()
>     function and the resulting wide character shall be written.
> 
> There's no specification of what happens if btowc fails here, but
> passing EOF to btowc is required to fail and return WEOF.

Possibly. But in the test program that I provided, I pass 255, not
-1 (= EOF).

It's well-known that the preferred way to convert a 'char' to 'int'
is not by direct assigment/cast, but by casting to 'unsigned char'.
That's well-known from [f]getc(), the <ctype.h> functions, etc.
You don't need to particularly care about programmers who pass
'\xff' to a function that expects an 'int'.

Bruno




^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [musl] swprintf cannot handle the character 0xff
  2023-06-12 23:09       ` Bruno Haible
@ 2023-06-12 23:54         ` Rich Felker
  2023-06-13  0:23           ` Bruno Haible
  0 siblings, 1 reply; 7+ messages in thread
From: Rich Felker @ 2023-06-12 23:54 UTC (permalink / raw)
  To: Bruno Haible; +Cc: musl

On Tue, Jun 13, 2023 at 01:09:26AM +0200, Bruno Haible wrote:
> Rich Felker wrote:
> > However, for wide printf:
> > 
> >     c
> >     If no l (ell) qualifier is present, the int argument shall be
> >     converted to a wide character as if by calling the btowc()
> >     function and the resulting wide character shall be written.
> > 
> > There's no specification of what happens if btowc fails here, but
> > passing EOF to btowc is required to fail and return WEOF.
> 
> Possibly. But in the test program that I provided, I pass 255, not
> -1 (= EOF).

Right. I'm not questioning that your bug report is correct, just my
initial proposal for fixing it. Since wide printf is supposed to
perform the conversion as if by btowc, it should presumably handle -1
(as opposed to 255) as an error path.

> It's well-known that the preferred way to convert a 'char' to 'int'
> is not by direct assigment/cast, but by casting to 'unsigned char'.
> That's well-known from [f]getc(), the <ctype.h> functions, etc.
> You don't need to particularly care about programmers who pass
> '\xff' to a function that expects an 'int'.

On targets where plain char is signed, '\xff' has value -1, meaning it
would work for %c with narrow printf but not with wide printf. I
wonder if this should actually be a defect and if wide printf should
be specified as converting the value, converted to (unsigned char), as
if by btowc, rather than the raw value.

Rich

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [musl] swprintf cannot handle the character 0xff
  2023-06-12 23:54         ` Rich Felker
@ 2023-06-13  0:23           ` Bruno Haible
  0 siblings, 0 replies; 7+ messages in thread
From: Bruno Haible @ 2023-06-13  0:23 UTC (permalink / raw)
  To: Rich Felker; +Cc: musl

Rich Felker wrote:
> I wonder if this should actually be a defect and if wide printf should
> be specified as converting the value, converted to (unsigned char), as
> if by btowc, rather than the raw value.

I agree. If ISO C and POSIX had specified or would specify %c in wide printf
like this, it would eliminate a (small) pitfall for programmers.

Bruno




^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2023-06-13  0:23 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-06-12 12:30 [musl] swprintf cannot handle the character 0xff Bruno Haible
2023-06-12 20:22 ` Rich Felker
2023-06-12 21:01   ` Rich Felker
2023-06-12 21:13     ` Rich Felker
2023-06-12 23:09       ` Bruno Haible
2023-06-12 23:54         ` Rich Felker
2023-06-13  0:23           ` Bruno Haible

Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/musl/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).