Our /lib/unicode is a bit out of date, this updates our stripped down version of UnicodeData.txt that we keep in /lib to cover characters and code ranges above the Basic Multilingual Plane. This does balloon the file a bit compared to the ~200k original. ; 800k /lib/unicode The full patch is attached. Of note the non-zero padding of the BMP range is replicated in the upstream UnicodeData.txt, I would be open to zero padding ours but this would change the results of existing scripts that use look(1) with /lib/unicode. Not sure how much use others get out of /lib/unicode, but wanted to ask if people thought it was worth the size to update. Thanks, moody