edbrowse-dev - development list for edbrowse
 help / color / mirror / Atom feed
* [Edbrowse-dev] High Unicodes
@ 2015-01-31 16:43 Karl Dahlke
  2015-01-31 18:21 ` Tyler Spivey
  0 siblings, 1 reply; 6+ messages in thread
From: Karl Dahlke @ 2015-01-31 16:43 UTC (permalink / raw)
  To: acsint, Edbrowse-dev

This is where speech adapters and edbrowse intersect,
thus posted to both groups.
Twitter, and I'm sure other social media,
make it easy for people to embed emoticons in their messages.
These are rendered by edbrowse, then spoken by speech software.
An example is 😀 for a grinning face.
So, what should we do about this?
Right now both my worlds have a partial solution.
Edbrowse translates a few common high unicodes into words,
but of course this only happens when browsing html, and the resulting words are
hard coded English.
My jupiter speech has the right solution:
set pronunciations in your config file.
Other speech adapters do this as well.
The line in my config file looks like this.

x1f600 grin

That's all you need.

I think edbrowse shouldn't be doing this at all.
Just turn the &#codes into unicode, then utf8, then
to the screen, and let speech software say the symbols as you wish,
in your language.
And this would work for all files everywhere, not just html that edbrowse browses.
But, some have said that some speech adapters aren't this flexible,
and wouldn't it be nice if edbrowse would continue to perform
some of these translations, so at least surfing the web would work properly.
Well maybe, I'm not sure, but if we retain this functionality
it should probably be configurable, not hard coded.

Karl Dahlke

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Edbrowse-dev] High Unicodes
  2015-01-31 16:43 [Edbrowse-dev] High Unicodes Karl Dahlke
@ 2015-01-31 18:21 ` Tyler Spivey
  2015-01-31 19:09   ` Adam Thompson
  0 siblings, 1 reply; 6+ messages in thread
From: Tyler Spivey @ 2015-01-31 18:21 UTC (permalink / raw)
  To: edbrowse-dev

I agree. Let the adapters do it.
You'll still have to transform things like “ but into their
unicode equivalents such as “.

The transforms also are being done at the wrong time. If someone writes
a UTF-8 html file,
any UTF8 characters they type won't be transformed. For example:
&#ldquo;““
turns into “““ in Firefox, but "“` in edbrowse.
They're all the same character, just encoded differently.

On 1/31/2015 8:43 AM, Karl Dahlke wrote:
> This is where speech adapters and edbrowse intersect,
> thus posted to both groups.
> Twitter, and I'm sure other social media,
> make it easy for people to embed emoticons in their messages.
> These are rendered by edbrowse, then spoken by speech software.
> An example is 😀 for a grinning face.
> So, what should we do about this?
> Right now both my worlds have a partial solution.
> Edbrowse translates a few common high unicodes into words,
> but of course this only happens when browsing html, and the resulting words are
> hard coded English.
> My jupiter speech has the right solution:
> set pronunciations in your config file.
> Other speech adapters do this as well.
> The line in my config file looks like this.
> 
> x1f600 grin
> 
> That's all you need.
> 
> I think edbrowse shouldn't be doing this at all.
> Just turn the &#codes into unicode, then utf8, then
> to the screen, and let speech software say the symbols as you wish,
> in your language.
> And this would work for all files everywhere, not just html that edbrowse browses.
> But, some have said that some speech adapters aren't this flexible,
> and wouldn't it be nice if edbrowse would continue to perform
> some of these translations, so at least surfing the web would work properly.
> Well maybe, I'm not sure, but if we retain this functionality
> it should probably be configurable, not hard coded.
> 
> Karl Dahlke
> _______________________________________________
> Edbrowse-dev mailing list
> Edbrowse-dev@lists.the-brannons.com
> http://lists.the-brannons.com/mailman/listinfo/edbrowse-dev
> 


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Edbrowse-dev] High Unicodes
  2015-01-31 18:21 ` Tyler Spivey
@ 2015-01-31 19:09   ` Adam Thompson
  2015-01-31 22:25     ` Chris Brannon
  0 siblings, 1 reply; 6+ messages in thread
From: Adam Thompson @ 2015-01-31 19:09 UTC (permalink / raw)
  To: Tyler Spivey; +Cc: edbrowse-dev

[-- Attachment #1: Type: text/plain, Size: 652 bytes --]

On Sat, Jan 31, 2015 at 10:21:42AM -0800, Tyler Spivey wrote:
> I agree. Let the adapters do it.
> You'll still have to transform things like “ but into their
> unicode equivalents such as “.

As someone who uses espeakup (which doesn't appear to support these characters
by default) I agree, it's definitely time to let the adapters do this and give up
transforming things like this in edbrowse.
At the end of the day, with the increasing use of utf-8,
people will need to get their speech software to work correctly with it at some
stage (I know I'm regularly running into this with other command line software).

Cheers,
Adam.

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 473 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Edbrowse-dev] High Unicodes
  2015-01-31 19:09   ` Adam Thompson
@ 2015-01-31 22:25     ` Chris Brannon
  2015-02-01 16:30       ` Adam Thompson
  0 siblings, 1 reply; 6+ messages in thread
From: Chris Brannon @ 2015-01-31 22:25 UTC (permalink / raw)
  To: edbrowse-dev, acsint

Adam Thompson <arthompson1990@gmail.com> writes:

> As someone who uses espeakup (which doesn't appear to support these characters
> by default) I agree, it's definitely time to let the adapters do this and give up

Yep, this is probably a task best left to the adapter layer.  I would be
fine with scrapping the translation in edbrowse or making it
configurable.

-- Chris

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Edbrowse-dev] High Unicodes
  2015-01-31 22:25     ` Chris Brannon
@ 2015-02-01 16:30       ` Adam Thompson
  0 siblings, 0 replies; 6+ messages in thread
From: Adam Thompson @ 2015-02-01 16:30 UTC (permalink / raw)
  To: Chris Brannon; +Cc: acsint, edbrowse-dev

[-- Attachment #1: Type: text/plain, Size: 811 bytes --]

On Sat, Jan 31, 2015 at 02:25:37PM -0800, Chris Brannon wrote:
> Adam Thompson <arthompson1990@gmail.com> writes:
>
> > As someone who uses espeakup (which doesn't appear to support these characters
> > by default) I agree, it's definitely time to let the adapters do this and give up
>
> Yep, this is probably a task best left to the adapter layer. I would be
> fine with scrapping the translation in edbrowse or making it
> configurable.

As the one who mentioned the case of adapters which don't support this,
I think it'd be better just to scrap these transformations in Edbrowse.
I say this on the basis of consistancy, i.e.
if I browse a web page I get the transforms,
however if I edit a UTF-8 text file I don't.
This is not that nice imho; I'd rather just have to make my adapter behave.

Cheers,
Adam.

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 473 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Edbrowse-dev]  High Unicodes
@ 2015-02-01 16:46 Karl Dahlke
  0 siblings, 0 replies; 6+ messages in thread
From: Karl Dahlke @ 2015-02-01 16:46 UTC (permalink / raw)
  To: Edbrowse-dev

Yes, as we all seem to be in agreement here,
I am slowly pulling out these translations, trying not to break anything.
Of course I have to leave the word ones in, like &lt; <
as you have pointed out,
and there's probably a lot more word translates that I haven't done.
But I will replace each with the utf8 and be done with it.

Karl Dahlke

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2015-02-01 16:50 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-01-31 16:43 [Edbrowse-dev] High Unicodes Karl Dahlke
2015-01-31 18:21 ` Tyler Spivey
2015-01-31 19:09   ` Adam Thompson
2015-01-31 22:25     ` Chris Brannon
2015-02-01 16:30       ` Adam Thompson
2015-02-01 16:46 Karl Dahlke

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).