From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from pcdesk.net (mail.pcdesk.net [70.58.191.25]) by hurricane.the-brannons.com (Postfix) with ESMTPS id 6825E78F1F for ; Sat, 31 Jan 2015 10:24:43 -0800 (PST) Message-ID: <54CD1D36.8020806@pcdesk.net> Date: Sat, 31 Jan 2015 10:21:42 -0800 From: Tyler Spivey User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.4.0 MIME-Version: 1.0 To: edbrowse-dev@lists.the-brannons.com References: <20150031114312.eklhad@comcast.net> In-Reply-To: <20150031114312.eklhad@comcast.net> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Subject: Re: [Edbrowse-dev] High Unicodes X-BeenThere: edbrowse-dev@lists.the-brannons.com X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Edbrowse Development List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 31 Jan 2015 18:24:43 -0000 I agree. Let the adapters do it. You'll still have to transform things like “ but into their unicode equivalents such as “. The transforms also are being done at the wrong time. If someone writes a UTF-8 html file, any UTF8 characters they type won't be transformed. For example: &#ldquo;““ turns into “““ in Firefox, but "“` in edbrowse. They're all the same character, just encoded differently. On 1/31/2015 8:43 AM, Karl Dahlke wrote: > This is where speech adapters and edbrowse intersect, > thus posted to both groups. > Twitter, and I'm sure other social media, > make it easy for people to embed emoticons in their messages. > These are rendered by edbrowse, then spoken by speech software. > An example is 😀 for a grinning face. > So, what should we do about this? > Right now both my worlds have a partial solution. > Edbrowse translates a few common high unicodes into words, > but of course this only happens when browsing html, and the resulting words are > hard coded English. > My jupiter speech has the right solution: > set pronunciations in your config file. > Other speech adapters do this as well. > The line in my config file looks like this. > > x1f600 grin > > That's all you need. > > I think edbrowse shouldn't be doing this at all. > Just turn the &#codes into unicode, then utf8, then > to the screen, and let speech software say the symbols as you wish, > in your language. > And this would work for all files everywhere, not just html that edbrowse browses. > But, some have said that some speech adapters aren't this flexible, > and wouldn't it be nice if edbrowse would continue to perform > some of these translations, so at least surfing the web would work properly. > Well maybe, I'm not sure, but if we retain this functionality > it should probably be configurable, not hard coded. > > Karl Dahlke > _______________________________________________ > Edbrowse-dev mailing list > Edbrowse-dev@lists.the-brannons.com > http://lists.the-brannons.com/mailman/listinfo/edbrowse-dev >