From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/88133 Path: news.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: Bibliography: criterium=all Date: Wed, 16 Jul 2014 14:44:59 +0200 Message-ID: <53C673CB.9090106@wxs.nl> References: <53C58F84.6080406@wxs.nl> <53C62AC9.9000703@wxs.nl> <53C642B6.4080004@wxs.nl> <53C64B4F.4000108@wxs.nl> <20140716135919.2a643253@iram-ha-003840.extra.cea.fr> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1405514730 31251 80.91.229.3 (16 Jul 2014 12:45:30 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 16 Jul 2014 12:45:30 +0000 (UTC) To: ntg-context@ntg.nl Original-X-From: ntg-context-bounces@ntg.nl Wed Jul 16 14:45:26 2014 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([5.39.185.229]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1X7OaL-00071I-SH for gctc-ntg-context-518@m.gmane.org; Wed, 16 Jul 2014 14:45:25 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 6B4C71021E for ; Wed, 16 Jul 2014 14:45:25 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id aYcuw4RJVwu1 for ; Wed, 16 Jul 2014 14:45:25 +0200 (CEST) Original-Received: from balder.ntg.nl (localhost [IPv6:::1]) by balder.ntg.nl (Postfix) with ESMTP id 4E42F10220 for ; Wed, 16 Jul 2014 14:45:17 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 9213C101E6 for ; Wed, 16 Jul 2014 14:45:13 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id EH3ZU7j+O1Am for ; Wed, 16 Jul 2014 14:45:07 +0200 (CEST) Original-Received: from filter3-til.mf.surf.net (filter3-til.mf.surf.net [194.171.167.219]) by balder.ntg.nl (Postfix) with ESMTP id DCC5E101E2 for ; Wed, 16 Jul 2014 14:45:07 +0200 (CEST) Original-Received: from smtp.ziggozakelijk.nl (D57D1DA2.static.ziggozakelijk.nl [213.125.29.162]) by filter3-til.mf.surf.net (8.14.3/8.14.3/Debian-9.4) with ESMTP id s6GCj5vQ015967 for ; Wed, 16 Jul 2014 14:45:06 +0200 X-Default-Received-SPF: pass (skip=loggedin (res=PASS)) x-ip-name=10.100.1.103; Original-Received: from [10.100.1.103] (unverified [10.100.1.103]) by pragma-net.nl (SurgeMail 6.5a2) with ESMTP id 249-1713362 for ; Wed, 16 Jul 2014 14:45:05 +0200 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.6.0 In-Reply-To: X-Authenticated-User: hagen@controller-9 X-Bayes-Prob: 0.005 (Score 0, tokens from: ntg-context@ntg.nl, base:default, @@RPTN) X-CanIt-Geo: ip=213.125.29.162; country=NL; region=Provincie Overijssel; city=Zwolle; latitude=52.5058; longitude=6.0858; http://maps.google.com/maps?q=52.5058,6.0858&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 06Mr0J6Hd - 96e7d596575f - 20140716 (trained as not-spam) X-Scanned-By: CanIt (www . roaringpenguin . com) on 194.171.167.219 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.14 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: ntg-context-bounces@ntg.nl Original-Sender: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:88133 Archived-At: On 7/16/2014 2:24 PM, Rob Heusdens wrote: > Hello, > > This is more a 'side-comment', but anytime I see an application that wants > in fact to query data from some dataresource, I ask myself: why is that > not implemented as a real relational database? > > BibTex is some weird implementation for a problem that in facts requests a > real relational dababase approach, in which your data does not reside in a > file, but a couple of tables. Like PUBLISHER (the organisation that > publishes the article/book/journal, etc.), PUBLICATION, EDITION, > AUTHOR/CONTRIBUTOR (a person making a contribution to a PUBLICATION, like > author, co-author, editor) etc. > > For example: every author that has contributed to a publication (or > edition thereof) has only one record in the AUTHOR table, and for each > contribution to a PUBLICATION/EDITION there is a record in the > CONTRIBUTION table, with the foreign keys to the primary key of > AUTHOR/CONTRIBUTOR and to the PUBLICATION/EDITION table. > > A data-model for such must exist in the real-world, somewhere, I guess. > But Tex and other implementors have choosen to implement this in a simple > flat-file system, and which I think is part of the problem, because the > implemenation as flat file has certain limits. Esp. when more demanding > features are requested. > > Still thinking that ultimately that would be the best way to implement > BibTex, using a relational database as repository instead of flat files. > And you could implement many other 'nice' featueres, like querying other > works related to the works you want to cite (for instance at the basis of > relevant key/reference words or other). > > Would be do-able I guess (LuaTex can access dabatases), only problem is > you have to convert al these .bib files into the database format. > > But don't know if anyone has thought about implementing Bibtex as a database. the main problem there is that normally authors collect their own bibentries and that bib has become sort of a standard ... i have no problem with a database approach but it would also mean normalizing (e.g. get rid of tex stuff in bib entries) and so technically it's no problem but politically ... (similar arguments can be given for math habit and so) with context we're not too bound to such 'standards' but we can't ignore them (and therefore support them); the bib implementation is flexible enough to be extended but in order to get a database approach done a reasonable set of users need to carry it (there are some search options built in btw) > Greetings, > > Rob > >> Hans, >> >> Sorting simply by year (or, rather, year then author then title ...) >> could be useful. Take, for example, a pluriannual report having many >> references. One might wish to display a list of publications by year. >> Should this be selected using "sorttype=year"? >> >> Another possible syntax could be >> sorttype={author,year,title,page} >> and this would be the default for APA. >> Of course, the other common sorting is "sorttype=cite". >> >> Furthermore, >> sorttype={year,author,title,page} >> would be another variant, and I can even imagine using: >> sorttype={journal,volume,number,page} >> or >> sorttype={publisher,year,title} >> and I am sure that users may need something totally unanticipated. >> >> The change from the present is that "sortype=author" implies >> {author,year,title,page}. I suppose that it could default to this order >> if no other order is specified. Just like "sorttype=year" could default >> to something sane like {year,author,title,page} unless a different >> order is explicitly given, etc. >> >> A subtlety is how to handle missing fields, say in a mixture of >> articles and books where articles have journal titles (journal) and >> article titles (title) and books have book titles (title). Should >> missing journal fields (books) come before or after articles or should >> title then be taken into account, as in {journal|title}? Do we need to >> allow such logic? >> >> We can continue to discuss this with Thomas and Luigi but I mention it >> here on the mailing list as Flavien Lambert brought up the question. >> >> Alan >> >> >> >> >> On Wed, 16 Jul 2014 11:52:15 +0200 >> Hans Hagen wrote: >> >>> On 7/16/2014 11:31 AM, Flavien Lambert wrote: >>>> The sort by author is perfect. >>> >>> fyi, sorting authors is kind of complex as we need to take all name >>> elements into account (initials, firstnames, surnames, vons, >>> juniors); of course there can be multiple authors; add to the extra >>> axes year journal, title and page >>> >>> the torture test is a list with 400K author specifications (taken >>> from a bunch of bib databases) which eventually produces a 5500 page >>> document >>> >>> in practice, certainly when we have smaller databases, load time can >>> be neglected (here it is .4 sec luatex vs .3 sec luajittex); sort >>> preparation and processing of that bunch takes 35 sec for luatex and >>> 19 sec for luajittex but of course for normal cases you won't even >>> noticed it .. bib databases with 400K entries are probably not used >>> often >>> >>> (i have no clue if bibtex can handle these numbers) >>> >>>> Thanks, >>>> F. >>>> >>>> On 16 Jul 2014 17:16, "Hans Hagen" >>> > wrote: >>>> >>>> On 7/16/2014 10:10 AM, Flavien Lambert wrote: >>>> >>>> Great! Thanks a lot! >>>> >>>> And concerning sorting by date? >>>> >>>> >>>> sorttype=author >>>> >>>> sorts by author,year,journal,title,page >>>> >>>> so what do you mean with 'by year' >>>> >>>> Hans >> >> >> -- >> Alan Braslau >> CEA DSM-IRAMIS-SPEC >> CNRS URA 2464 >> Orme des Merisiers >> 91191 Gif-sur-Yvette cedex FRANCE >> tel: +33 1 69 08 73 15 >> fax: +33 1 69 08 87 86 >> mailto:alan.braslau@cea.fr >> ___________________________________________________________________________________ >> If your question is of interest to others as well, please add an entry to >> the Wiki! >> >> maillist : ntg-context@ntg.nl / >> http://www.ntg.nl/mailman/listinfo/ntg-context >> webpage : http://www.pragma-ade.nl / http://tex.aanhet.net >> archive : http://foundry.supelec.fr/projects/contextrev/ >> wiki : http://contextgarden.net >> ___________________________________________________________________________________ >> > > > ___________________________________________________________________________________ > If your question is of interest to others as well, please add an entry to the Wiki! > > maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context > webpage : http://www.pragma-ade.nl / http://tex.aanhet.net > archive : http://foundry.supelec.fr/projects/contextrev/ > wiki : http://contextgarden.net > ___________________________________________________________________________________ > -- ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com | www.pragma-pod.nl ----------------------------------------------------------------- ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________