edbrowse-dev - development list for edbrowse
 help / color / mirror / Atom feed
* [Edbrowse-dev] Images
@ 2014-04-27 15:37 Karl Dahlke
  2014-04-28 11:54 ` Adam Thompson
  2014-04-28 15:29 ` Chris Brannon
  0 siblings, 2 replies; 4+ messages in thread
From: Karl Dahlke @ 2014-04-27 15:37 UTC (permalink / raw)
  To: Edbrowse-dev

> Webvism is both a service and a Firefox plugin for solving captchas.

I'm guessing it sends the img url to some website, maybe with yourcredentials,
perhaps as cookies, then the site posts it, some user solves it,
and returns the letters to you, which you type in,
or cut&paste in, or maybe fills in automatically.

An edbrowse function could do all of this,
if we knew how it worked inside, except, there is no easy way to get the url.

As of today, the alt text of an image is in brackets,
unless of course that image is a clickable url whence it is in braces.
I'm not thrilled about brackets since text is sometimes in brackets anyways.
Rarely though is text naturally in braces or angles <>.
So brackets are ok I guess for an image but if you have other suggestions I'm open.
Maybe ~picture of a house~.
Those are eused less frequently.

Anyways let's say this is in place, then you want the edbrowse function
to pull out the url from the src= attribute, but oops,
this is entirely lost through browsing.
It's not clickable, so I didn't particularly care about it.
Visual browsers however let you right click on a picture
to download it or whatever.
So imagine that I carried some internal tags for the image, like I do with hyperlinks
and form fields etc.
If these tags were present then the A command could bring up the url for that picture,
just like it does for links and so on.
Then the edbrowse function could grab the url and send it to Webvism
and scrape the letters off and ^ back and maybe even put them into the next input field.
Feasible, if I retained the tags and src reference for images.
Wouldn't be a big job, but not trivial and probably not our highest priority.

Karl Dahlke

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Edbrowse-dev] Images
  2014-04-27 15:37 [Edbrowse-dev] Images Karl Dahlke
@ 2014-04-28 11:54 ` Adam Thompson
  2014-04-28 15:29 ` Chris Brannon
  1 sibling, 0 replies; 4+ messages in thread
From: Adam Thompson @ 2014-04-28 11:54 UTC (permalink / raw)
  To: Karl Dahlke; +Cc: Edbrowse-dev

[-- Attachment #1: Type: text/plain, Size: 2724 bytes --]

On Sun, Apr 27, 2014 at 03:37:22PM +0000, Karl Dahlke wrote:
> > Webvism is both a service and a Firefox plugin for solving captchas.
> 
> I'm guessing it sends the img url to some website, maybe with yourcredentials,
> perhaps as cookies, then the site posts it, some user solves it,
> and returns the letters to you, which you type in,
> or cut&paste in, or maybe fills in automatically.
> 

Either that or the image itself.  If it's used in realtime then you probably need to send the downloaded image as lots of captures generate different images each time they're loaded.

> An edbrowse function could do all of this,
> if we knew how it worked inside, except, there is no easy way to get the url.
> 
> As of today, the alt text of an image is in brackets,
> unless of course that image is a clickable url whence it is in braces.
> I'm not thrilled about brackets since text is sometimes in brackets anyways.
> Rarely though is text naturally in braces or angles <>.
> So brackets are ok I guess for an image but if you have other suggestions I'm open.

I'm fairly used to brackets, I think this comes from when I used to browse the
web with the links2 browser though so I don't mind. As long as any change is documented.
> Maybe ~picture of a house~.
> Those are eused less frequently.

That also works.

> 
> Anyways let's say this is in place, then you want the edbrowse function
> to pull out the url from the src= attribute, but oops,
> this is entirely lost through browsing.
> It's not clickable, so I didn't particularly care about it.
> Visual browsers however let you right click on a picture
> to download it or whatever.
> So imagine that I carried some internal tags for the image, like I do with hyperlinks
> and form fields etc.
> If these tags were present then the A command could bring up the url for that picture,
> just like it does for links and so on.
> Then the edbrowse function could grab the url and send it to Webvism
> and scrape the letters off and ^ back and maybe even put them into the next input field.
> Feasible, if I retained the tags and src reference for images.
> Wouldn't be a big job, but not trivial and probably not our highest priority.

It would certainly have its uses, but as you say it's not really high priority at the moment.

As for the webvism function, if it does indeed use the url rather than the
image then I wonder if one could unbrowse the page, search for the alt text,
use a regex to extract out the url and send it to the site that way?

This approach wouldn't be very robust or nice to implement but could be made to
work in cases where some details about the image's alt text are known.

Cheers,
Adam.

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 473 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [Edbrowse-dev] Images
  2014-04-27 15:37 [Edbrowse-dev] Images Karl Dahlke
  2014-04-28 11:54 ` Adam Thompson
@ 2014-04-28 15:29 ` Chris Brannon
  1 sibling, 0 replies; 4+ messages in thread
From: Chris Brannon @ 2014-04-28 15:29 UTC (permalink / raw)
  To: Edbrowse-dev

Karl Dahlke <eklhad@comcast.net> writes:

> I'm guessing it sends the img url to some website, maybe with yourcredentials,
> perhaps as cookies, then the site posts it, some user solves it,
> and returns the letters to you, which you type in,

I don't know how it works.  I'm pretty sure it doesn't grab an
image URL.  Why?  There's too much variability.  A lot of sites probably
use the URL in conjunction with cookies or whatnot.  I suppose the
plugin could ship your cookies to webvism, but that's a security
minefield that no one really wants to enter.
My suspicion is that it extracts the images from the page somehow,
possibly using the DOM tree to find the image associated with the
current form.  Those are then sent as actual graphic files to the
service.  That would be the best and safest way to do it, I should
think.

-- Chris

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Edbrowse-dev]  Images
@ 2014-04-28  8:06 Karl Dahlke
  0 siblings, 0 replies; 4+ messages in thread
From: Karl Dahlke @ 2014-04-28  8:06 UTC (permalink / raw)
  To: Edbrowse-dev

> I'm fairly used to brackets, I think this comes from when I used to browse the
> web with the links2 browser though so I don't mind.

Wow. I knew it looked familiar but couldn't remember why.
That was it.
Good, then this convention makes sense.
Although it doesn't come up very often because most images are clickable hyperlinks, hence the braces.
Well braces are sort of enhanced brackets, so again good.
I'll scan through and see if I documented this anywhere.

Karl Dahlke

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2014-04-28 15:30 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-04-27 15:37 [Edbrowse-dev] Images Karl Dahlke
2014-04-28 11:54 ` Adam Thompson
2014-04-28 15:29 ` Chris Brannon
2014-04-28  8:06 Karl Dahlke

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).