RetroSearch Browse

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Showing content from https://mail.python.org/pipermail/python-dev/2005-September.txt below:

From jcarlson at uci.edu Thu Sep 1 00:55:51 2005 From: jcarlson at uci.edu (Josiah Carlson) Date: Wed, 31 Aug 2005 15:55:51 -0700 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: References: Message-ID: <20050831151628.8B17.JCARLSON@uci.edu> Steve Holden wrote: > > Fredrik Lundh wrote: > > the problem isn't the time it takes to unpack the return value, the problem is that > > it takes time to create the substrings that you don't need. > > > Indeed, and therefore the performance of rpartition is likely to get > worse as the length of the input strung increases. I don't like to think > about all those strings being created just to be garbage-collected. Pity > the poor CPU ... :-) > > > for some use cases, a naive partition-based solution is going to be a lot slower > > than the old find+slice approach, no matter how you slice, index, or unpack the > > return value. > > > Yup. Then it gets down to statistical arguments about the distribution > of use cases and input lengths. If we had a type that represented a > substring of an existing string it might avoid the stress, but I'm not > sure I see that one flying. What about buffer()? Tack on some string methods and you get a string slice-like instance with very low memory requirements. Add on actual comparisons of buffers and strings, and you can get nearly everything desired with very low memory overhead. A bit of free thought brings me to the (half-baked) idea that if string methods accepted any object which conformed to the buffer interface; mmap, buffer, array, ... instances could gain all of the really convenient methods that make strings the objects to use in many cases. If one wanted to keep string methods returning strings, and other objects with the buffer protocol which use string methods returning buffer objects, that seems reasonable (and probably a good idea). - Josiah P.S. Pardon me if the idea is pure insanity, I haven't been getting much sleep lately, and just got up from a nap that seems to have clouded my judgement (I just put milk in my juice...). From tdelaney at avaya.com Thu Sep 1 01:36:00 2005 From: tdelaney at avaya.com (Delaney, Timothy (Tim)) Date: Thu, 1 Sep 2005 09:36:00 +1000 Subject: [Python-Dev] Proof of the pudding: str.partition() Message-ID: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> Fredrik Lundh wrote: > the problem isn't the time it takes to unpack the return value, the > problem is that it takes time to create the substrings that you don't > need. I'm actually starting to think that this may be a good use case for views of strings i.e. rather than create 3 new strings, each "string" is a view onto the string that was partitioned. Most of the use cases I've seen, the partitioned bits are discarded almost as soon as the original string, and often the original string persists beyond the partitioned bits. Tim Delaney From nnorwitz at gmail.com Thu Sep 1 01:36:54 2005 From: nnorwitz at gmail.com (Neal Norwitz) Date: Wed, 31 Aug 2005 16:36:54 -0700 Subject: [Python-Dev] import exceptions Message-ID: Is there any reason to import exceptions? It's only done in 4 places: Lib/asyncore.py, Lib/shutil.py, Lib/idlelib/PyShell.py, and Lib/test/test_exceptions.py. I can understand the one in test, but should the other 3 be removed since exceptions are builtin? n From jimjjewett at gmail.com Thu Sep 1 01:56:04 2005 From: jimjjewett at gmail.com (Jim Jewett) Date: Wed, 31 Aug 2005 19:56:04 -0400 Subject: [Python-Dev] [Python-checkins] python/dist/src/Lib/test test_re.py, 1.45.6.3, 1.45.6.4 In-Reply-To: <20050831125701.F008B1E4004@bag.python.org> References: <20050831125701.F008B1E4004@bag.python.org> Message-ID: On 8/31/05, akuchling at users.sourceforge.net wrote: > Log Message: > ... the tests aren't run by default because I wanted to minimize > upheaval to the 2.3 test suite What is the reasoning behind this? It seems to me that if a (passing) test is being added, maintenance releases are the *most* important places to run them. On the other hand, I've also seen Raymond check (regression) tests into only development, so it seems to be a conscious choice. *I* just don't understand it. -jJ From guido at python.org Thu Sep 1 02:05:21 2005 From: guido at python.org (Guido van Rossum) Date: Wed, 31 Aug 2005 17:05:21 -0700 Subject: [Python-Dev] import exceptions In-Reply-To: References: Message-ID: On 8/31/05, Neal Norwitz wrote: > Is there any reason to import exceptions? It's only done in 4 places: > Lib/asyncore.py, Lib/shutil.py, Lib/idlelib/PyShell.py, and > Lib/test/test_exceptions.py. I can understand the one in test, but > should the other 3 be removed since exceptions are builtin? I'm guessing this is a remnant from a transitional period around Python 1.5. Let's get rid of it. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From t-meyer at ihug.co.nz Thu Sep 1 02:06:15 2005 From: t-meyer at ihug.co.nz (Tony Meyer) Date: Thu, 1 Sep 2005 12:06:15 +1200 Subject: [Python-Dev] setdefault's second argument Message-ID: > To save you from following that link, to this day I still mentally > translate "setdefault" to "getorset" whenever I see it. I read these out of order (so didn't see the giveaway getorsetandget) and spent some time wondering what an "orset" was. I figured it must be some obscure CS/text processing/numeric/literary term that suited this usage. So obscure that google's define couldn't find me a definition. set[with]default is maybe a terrible name, but it does have some things going for it ;) =Tony.Meyer ...perhaps it was the similarity to corset...but surely I'm too young to have "corset" spring to mind before "or set"... From janssen at parc.com Thu Sep 1 02:42:02 2005 From: janssen at parc.com (Bill Janssen) Date: Wed, 31 Aug 2005 17:42:02 PDT Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: Your message of "Wed, 31 Aug 2005 13:49:23 PDT." Message-ID: <05Aug31.174209pdt."58617"@synergy1.parc.xerox.com> Reinhold Birkenfeld writes: > And it's horrible, for none of the other string methods accept a RE. I suppose it depends on your perspective as to what exactly is horrible. I tend to think it's too bad that none of the other string methods accept appropriate RE patterns. Strings are thought of as "core", whereas RE, a relatively new part of the stdlib, isn't. But it's OK -- it just gives the system more Java-ness, where you have lots of little modules, each of which does something slightly different. > There are languages which give REs too much weight by philosophy > (hint, hint), but Python isn't one of them. Interestingly, Python programmers > suffer less from the "help me, my RE doesn't work" problem. Yes, but perhaps the causative bug in those "other" languages is the confusion between string *literals* and RE *literals*, which isn't a problem in the idiom I suggested. Or perhaps if RE was more helpful in Python, Python programmers would indeed suffer from the same problem. Bill From skip at pobox.com Thu Sep 1 02:59:36 2005 From: skip at pobox.com (skip@pobox.com) Date: Wed, 31 Aug 2005 19:59:36 -0500 Subject: [Python-Dev] stat() return value (was: Re: Proof of the pudding: str.partition()) In-Reply-To: References: <005301c5ada1$4a52afc0$8832c797@oemcomputer> <4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <4314E51B.1050507@hathawaymix.org> <17173.43632.145313.858480@montanaro.dyndns.org> Message-ID: <17174.21112.763903.576402@montanaro.dyndns.org> >> In the case of stat() there is no reason other than historic for the >> results to be returned in any particular order, Terry> Which is why I wonder whether the sequence part should be dropped Terry> in 3.0. I think that would be a good idea. Return an honest-to-goodness stat object and also strip the "st_" prefixes removed from the attributes. There's no namespace collision problems from which the prefixes protect us. Skip From guido at python.org Thu Sep 1 03:05:55 2005 From: guido at python.org (Guido van Rossum) Date: Wed, 31 Aug 2005 18:05:55 -0700 Subject: [Python-Dev] stat() return value (was: Re: Proof of the pudding: str.partition()) In-Reply-To: <17174.21112.763903.576402@montanaro.dyndns.org> References: <005301c5ada1$4a52afc0$8832c797@oemcomputer> <4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <4314E51B.1050507@hathawaymix.org> <17173.43632.145313.858480@montanaro.dyndns.org> <17174.21112.763903.576402@montanaro.dyndns.org> Message-ID: On 8/31/05, skip at pobox.com wrote: > I think that would be a good idea. Return an honest-to-goodness stat object > and also strip the "st_" prefixes removed from the attributes. There's no > namespace collision problems from which the prefixes protect us. +1 on dropping the sequence. -0 on dropping the st_ prefix; these are conventional and familiar to all UNIX developers and most C programmers, and help with grepping (and these days, Googling :). -- --Guido van Rossum (home page: http://www.python.org/~guido/) From skip at pobox.com Thu Sep 1 03:23:34 2005 From: skip at pobox.com (skip@pobox.com) Date: Wed, 31 Aug 2005 20:23:34 -0500 Subject: [Python-Dev] String views (was: Re: Proof of the pudding: str.partition()) In-Reply-To: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> Message-ID: <17174.22550.862457.829100@montanaro.dyndns.org> Tim> I'm actually starting to think that this may be a good use case for Tim> views of strings i.e. rather than create 3 new strings, each Tim> "string" is a view onto the string that was partitioned. How would this work? One of the advantages of the current string is that the underlying data is NUL-terminated, so when passing strings to C routines no copying is required. Suppose I executed scheme, _, rest = "http://www.python.org/".partition(':') As a Python programmer I'd get back what look like three strings: "http", ":", and "//www.python.org/". If each of them was a view onto part of the original string, only the last one would truly refer to a NUL-terminated sequence of characters. If I then wanted to see what scheme's value compared to, the string's comparison method would have to recognize that it wasn't truly NUL-terminated, copy it, call strncmp() or whatever underlying routine is used for string comparisons. (Maybe string comparisons are done inline. I'm sure there are some examples where the underlying C string routines are called.) OTOH, maybe that would work. Perhaps we should try it. Skip From skip at pobox.com Thu Sep 1 03:46:08 2005 From: skip at pobox.com (skip@pobox.com) Date: Wed, 31 Aug 2005 20:46:08 -0500 Subject: [Python-Dev] String views (was: Re: Proof of the pudding: str.partition()) In-Reply-To: <17174.22550.862457.829100@montanaro.dyndns.org> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> Message-ID: <17174.23904.883698.268577@montanaro.dyndns.org> Skip> OTOH, maybe that would work. Perhaps we should try it. Ah, I forgot the data is part of the PyString object itself, not stored as a separate char* array. Without a char* in the object it's kind of hard to do views. Skip From tdelaney at avaya.com Thu Sep 1 04:14:56 2005 From: tdelaney at avaya.com (Delaney, Timothy (Tim)) Date: Thu, 1 Sep 2005 12:14:56 +1000 Subject: [Python-Dev] String views (was: Re: Proof of the pudding: str.partition()) Message-ID: <2773CAC687FD5F4689F526998C7E4E5F4DB59C@au3010avexu1.global.avaya.com> skip at pobox.com wrote: > How would this work? One of the advantages of the current string is > that the underlying data is NUL-terminated, so when passing strings > to C routines no copying is required. I didn't say it would be easy. Just that it's about the first cases where I've seen there could be a real advantage to using string views. And I don't even know that. One of the big disadvantages of string views is that they need to keep the original object around, no matter how big it is. But in the case of partition, much of the time the original string survives for at least a similar period to the partitions. Tim Delaney From skip at pobox.com Thu Sep 1 04:21:12 2005 From: skip at pobox.com (skip@pobox.com) Date: Wed, 31 Aug 2005 21:21:12 -0500 Subject: [Python-Dev] String views (was: Re: Proof of the pudding: str.partition()) In-Reply-To: <2773CAC687FD5F4689F526998C7E4E5F4DB59C@au3010avexu1.global.avaya.com> References: <2773CAC687FD5F4689F526998C7E4E5F4DB59C@au3010avexu1.global.avaya.com> Message-ID: <17174.26008.116470.620022@montanaro.dyndns.org> Tim> One of the big disadvantages of string views is that they need to Tim> keep the original object around, no matter how big it is. But in Tim> the case of partition, much of the time the original string Tim> survives for at least a similar period to the partitions. Not necessarily. Presumably a string view would reference another string object's data buffer. A possible optimization would be to convert from a view to a normal string once the original strings' refcount dropped to one, particularly if the view's size was substantially smaller than that of the original string. Skip From foom at fuhm.net Thu Sep 1 04:51:18 2005 From: foom at fuhm.net (James Y Knight) Date: Wed, 31 Aug 2005 22:51:18 -0400 Subject: [Python-Dev] String views (was: Re: Proof of the pudding: str.partition()) In-Reply-To: <17174.26008.116470.620022@montanaro.dyndns.org> References: <2773CAC687FD5F4689F526998C7E4E5F4DB59C@au3010avexu1.global.avaya.com> <17174.26008.116470.620022@montanaro.dyndns.org> Message-ID: <01639B6F-7F21-43D4-AD45-0F9366EEBBC0@fuhm.net> On Aug 31, 2005, at 10:21 PM, skip at pobox.com wrote: > > Tim> One of the big disadvantages of string views is that they > need to > Tim> keep the original object around, no matter how big it is. > But in > Tim> the case of partition, much of the time the original string > Tim> survives for at least a similar period to the partitions. > > Not necessarily. Presumably a string view would reference another > string > object's data buffer. A possible optimization would be to convert > from a > view to a normal string once the original strings' refcount dropped > to one, > particularly if the view's size was substantially smaller than that > of the > original string. I suspect this would be a pessimization most of the time, as it would require keeping a list of pointers to all the views referencing the string object. James From stephen at xemacs.org Thu Sep 1 04:52:22 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Thu, 01 Sep 2005 11:52:22 +0900 Subject: [Python-Dev] Revising RE docs In-Reply-To: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> (Michael Chermside's message of "Tue, 30 Aug 2005 14:35:42 -0700") References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> Message-ID: <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Michael" == Michael Chermside writes: Michael> (2) is what we have today, but I would prefer (1) to Michael> gently encourage people to use the precompiled objects Michael> (which are distinctly faster when re-used). Didn't Fredrik Lundh strongly imply that implicitly compiled objects are cached? That's a pretty big speed up right there. Sure, the precompiled objects are faster because you don't have to find them. But you could have string objects (or a derivative) grow a "compiled_regexp" attribute internally. Then if you have a random regexp you think you might have used elsewhere, just intern it. (NB, wild guess, here, I don't know enough about current implementation to know if this is a reasonable extension.) Michael> Does anyone else think we ought to swap that around in Michael> the documentation? I'm not trying to assign more work to Michael> Fred... but if there were a python-dev consensus that Michael> this would be desirable, then perhaps someone would be Michael> encouraged to supply a patch. +1. I won't have time for some weeks, but I'd already flagged Barry's post for later consideration. I hope that doesn't stop somebody else from picking up the ball, though. -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From tjreedy at udel.edu Thu Sep 1 04:58:18 2005 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 31 Aug 2005 22:58:18 -0400 Subject: [Python-Dev] stat() return value (was: Re: Proof of thepudding: str.partition()) References: <005301c5ada1$4a52afc0$8832c797@oemcomputer><4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com><4314E51B.1050507@hathawaymix.org> <17173.43632.145313.858480@montanaro.dyndns.org> <17174.21112.763903.576402@montanaro.dyndns.org> Message-ID: "Guido van Rossum" wrote in message news:ca471dc205083118051acd7fab at mail.gmail.com... > On 8/31/05, skip at pobox.com wrote: >> I think that would be a good idea. Return an honest-to-goodness stat >> object >> and also strip the "st_" prefixes removed from the attributes. There's >> no >> namespace collision problems from which the prefixes protect us. > > +1 on dropping the sequence. Good. Another addition to PEP 3000. I was hoping this would not require a long-winded and possibly boring justification for something I suspect (without checking the archives) was in the back of some minds when the attributes were added. > -0 on dropping the st_ prefix; these are conventional and familiar to > all UNIX developers and most C programmers, and help with grepping > (and these days, Googling :). Terry J. Reedy From ldlandis at gmail.com Thu Sep 1 05:10:14 2005 From: ldlandis at gmail.com (LD "Gus" Landis) Date: Wed, 31 Aug 2005 22:10:14 -0500 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: <5.1.1.6.0.20050830233356.01b34118@mail.telecommunity.com> References: <005301c5ada1$4a52afc0$8832c797@oemcomputer> <4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <5.1.1.6.0.20050830233356.01b34118@mail.telecommunity.com> Message-ID: Hi, FTR, I was not implying the $PIECE() was an answer at all, but only suggesting it as an alternative name to .partition(). .piece() can be both a verb and a noun as can .partition(), thus overcoming Nick's objection to a "noun"ish thing doing the work of a "verb"ish thing. Also, IIRC, I did say it would need to be "Pythonified". I pointed to the official definition of $PIECE() merely to show that it was more than a .split() as it has (sort of) some of the notion of a slice. Phillip, I think, as I presented the $PIECE() thing, you were totally justified to recoil in horror. That said, it would be nice if there were a way to "save" the result of the .partition() result in a way that would not require duplicating the .partition() call (as has been suggested) making things like: ... s.partition(":").head, s.partition(":").tail unnecessary. One could get accustomed to the _,_,tail = s.partition(...) style I suppose, but it seems a bit "different", IMO. Also, it seems that the interference with i18n diminishes the appeal of that style. Cheers, --ldl On 8/30/05, Phillip J. Eby wrote: > ... > No, just to point out that you can make up whatever semantics you want, but > the semantics you show above are *not* the same as what are shown at the > page the person who posted about $PIECE cited, and on whose content I based > my reply: > > http://www.jacquardsystems.com/Examples/function/piece.htm > > If you were following those semantics, then the code you presented above is > buggy, as host.piece(':',1,2) would return the original string! > > Of course, since I know nothing of MUMPS besides what's on that page, it's > entirely possible I've misinterpreted that page in some hideously subtle > way -- as I pointed out in my original post regarding $PIECE. I like to > remind myself and others of the possibility that I *could* be wrong, even > when I'm *certain* I'm right, because it helps keep me from appearing any > more arrogant than I already do, and it also helps to keep me from looking > too stupid in those cases where I turn out to be wrong. Perhaps you might > find that approach useful as well. > > In any case, to avoid confusion, you should probably specify the semantics > of your piece() proposal in Python terms, so that those of us who don't > know MUMPS have some possibility of grasping the inner mysteries of your > proposal. > -- LD Landis - N0YRQ - from the St Paul side of Minneapolis From tjreedy at udel.edu Thu Sep 1 05:01:34 2005 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 31 Aug 2005 23:01:34 -0400 Subject: [Python-Dev] Proof of the pudding: str.partition() References: <20050831045535.4dovty96y0w0g4gg@login.werra.lunarpages.com> <5.1.1.6.0.20050831092223.01b56d98@mail.telecommunity.com> Message-ID: >> for some use cases, a naive partition-based solution is going to be a >> lot slower >> than the old find+slice approach, no matter how you slice, index, or >> unpack the >> return value. The index+slice approach will still be available for such cases. I am sure we will see relative speed versus string size benchmarks. Terry J. Reedy From skip at pobox.com Thu Sep 1 05:16:05 2005 From: skip at pobox.com (skip@pobox.com) Date: Wed, 31 Aug 2005 22:16:05 -0500 Subject: [Python-Dev] String views (was: Re: Proof of the pudding: str.partition()) In-Reply-To: <01639B6F-7F21-43D4-AD45-0F9366EEBBC0@fuhm.net> References: <2773CAC687FD5F4689F526998C7E4E5F4DB59C@au3010avexu1.global.avaya.com> <17174.26008.116470.620022@montanaro.dyndns.org> <01639B6F-7F21-43D4-AD45-0F9366EEBBC0@fuhm.net> Message-ID: <17174.29301.113939.867319@montanaro.dyndns.org> James> I suspect this would be a pessimization most of the time, as it James> would require keeping a list of pointers to all the views James> referencing the string object. I'm skeptical about performance as well, but not for that reason. A string object can have a referent field. If not NULL, it refers to another string object which is INCREFed in the usual way. At string deallocation, if the referent is not NULL, the referent is DECREFed. If the referent is NULL, ob_sval is freed. Skip From foom at fuhm.net Thu Sep 1 04:51:18 2005 From: foom at fuhm.net (James Y Knight) Date: Wed, 31 Aug 2005 22:51:18 -0400 Subject: [Python-Dev] String views (was: Re: Proof of the pudding: str.partition()) In-Reply-To: <17174.26008.116470.620022@montanaro.dyndns.org> References: <2773CAC687FD5F4689F526998C7E4E5F4DB59C@au3010avexu1.global.avaya.com> <17174.26008.116470.620022@montanaro.dyndns.org> Message-ID: <01639B6F-7F21-43D4-AD45-0F9366EEBBC0@fuhm.net> On Aug 31, 2005, at 10:21 PM, skip at pobox.com wrote: > > Tim> One of the big disadvantages of string views is that they > need to > Tim> keep the original object around, no matter how big it is. > But in > Tim> the case of partition, much of the time the original string > Tim> survives for at least a similar period to the partitions. > > Not necessarily. Presumably a string view would reference another > string > object's data buffer. A possible optimization would be to convert > from a > view to a normal string once the original strings' refcount dropped > to one, > particularly if the view's size was substantially smaller than that > of the > original string. I suspect this would be a pessimization most of the time, as it would require keeping a list of pointers to all the views referencing the string object. James From greg.ewing at canterbury.ac.nz Thu Sep 1 05:25:19 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 01 Sep 2005 15:25:19 +1200 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <20050831204439.GA3775@discworld.dyndns.org> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> Message-ID: <4316749F.6060204@canterbury.ac.nz> Charles Cazabon wrote: > Perhaps py3k could have a py2compat module. Importing it could have the > effect of (for instance) putting compile, id, and intern into the global > namespace, making print an alias for writeln, There's no way importing a module could add something that works like the old print statement, unless some serious magic is going on... Greg From greg.ewing at canterbury.ac.nz Thu Sep 1 05:32:45 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 01 Sep 2005 15:32:45 +1200 Subject: [Python-Dev] Alternative imports (Re: Python 3 design principles) In-Reply-To: <7168d65a050831132415118382@mail.gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> Message-ID: <4316765D.2040306@canterbury.ac.nz> Oren Tirosh wrote: > Writing programs that run on both 2.x and 3 may require ugly > version-dependent tricks like: > > try: > compile > except NameError: > from sys import compile Just had a weird thought. What if you could write from sys or __builtin__ import compile which would be equivalent to try: from sys import compile except ImportError: from __builtin__ import compile Greg From greg.ewing at canterbury.ac.nz Thu Sep 1 05:40:56 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 01 Sep 2005 15:40:56 +1200 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: <20050831151628.8B17.JCARLSON@uci.edu> References: <20050831151628.8B17.JCARLSON@uci.edu> Message-ID: <43167848.1010907@canterbury.ac.nz> Josiah Carlson wrote: > A bit of free thought brings me to the (half-baked) idea that if string > methods accepted any object which conformed to the buffer interface; > mmap, buffer, array, ... instances could gain all of the really > convenient methods that make strings the objects to use in many cases. Not a bad idea, but they couldn't literally be string methods. They'd have to be standalone functions like we used to have in the string module before it got mercilessly deprecated. :-) Not sure what happens to this when the unicode/bytearray future arrives, though. Treating a buffer of bytes as a character string isn't going to be so straightforward then. Greg From greg.ewing at canterbury.ac.nz Thu Sep 1 05:56:11 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 01 Sep 2005 15:56:11 +1200 Subject: [Python-Dev] String views In-Reply-To: <17174.22550.862457.829100@montanaro.dyndns.org> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> Message-ID: <43167BDB.6010002@canterbury.ac.nz> skip at pobox.com wrote: > If I then wanted to see what scheme's value > compared to, the string's comparison method would have to recognize that it > wasn't truly NUL-terminated, copy it, call strncmp() or whatever underlying > routine is used for string comparisons. Python string comparisons can't be using anything that relies on nul-termination, because Python strings can contain embedded nuls. Possibly it uses memcmp(), but that takes a length. You have a point when it comes to passing strings to other C routines, though. For those that don't have a variant which takes a maximum length, the substring type might have to keep a cached nul-terminated copy created on demand. Then the copying overhead would only be incurred if you did happen to pass a substring to such a routine. Greg From greg.ewing at canterbury.ac.nz Thu Sep 1 06:00:30 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 01 Sep 2005 16:00:30 +1200 Subject: [Python-Dev] String views In-Reply-To: <17174.23904.883698.268577@montanaro.dyndns.org> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <17174.23904.883698.268577@montanaro.dyndns.org> Message-ID: <43167CDE.4070206@canterbury.ac.nz> skip at pobox.com wrote: > Ah, I forgot the data is part of the PyString object itself, not stored as a > separate char* array. Without a char* in the object it's kind of hard to do > views. That wouldn't be a problem if substrings were a separate subclass of basestring with their own representation. That's probably a good idea anyway, since you wouldn't want slicing to return substrings by default -- it should be something you have to explicitly ask for. Greg From greg.ewing at canterbury.ac.nz Thu Sep 1 06:10:03 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 01 Sep 2005 16:10:03 +1200 Subject: [Python-Dev] Revising RE docs In-Reply-To: <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: <43167F1B.6080108@canterbury.ac.nz> Stephen J. Turnbull wrote: > But you could have string objects (or a derivative) grow a > "compiled_regexp" attribute internally. That would make the core dependent on the re module, which I think would be a bad idea. Personally I like the way the compilation step is made at least somewhat explicit. Regular expressions are not strings; a string is just one way of representing a regular expression. There could potentially be other representations that compile to the same re object. Greg From greg.ewing at canterbury.ac.nz Thu Sep 1 06:30:20 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 01 Sep 2005 16:30:20 +1200 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: References: <005301c5ada1$4a52afc0$8832c797@oemcomputer> <4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <5.1.1.6.0.20050830233356.01b34118@mail.telecommunity.com> Message-ID: <431683DC.60103@canterbury.ac.nz> LD "Gus" Landis wrote: > .piece() can be both a verb and a noun Er, pardon? I don't think I've ever heard 'piece' used as a verb in English. Can you supply an example sentence? (And no, "Piece, man!" doesn't count. :-) Greg From shane at hathawaymix.org Thu Sep 1 07:17:17 2005 From: shane at hathawaymix.org (Shane Hathaway) Date: Wed, 31 Aug 2005 23:17:17 -0600 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: <431683DC.60103@canterbury.ac.nz> References: <005301c5ada1$4a52afc0$8832c797@oemcomputer> <4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <5.1.1.6.0.20050830233356.01b34118@mail.telecommunity.com> <431683DC.60103@canterbury.ac.nz> Message-ID: <43168EDD.4090001@hathawaymix.org> Greg Ewing wrote: > LD "Gus" Landis wrote: > >>.piece() can be both a verb and a noun > > > Er, pardon? I don't think I've ever heard 'piece' used > as a verb in English. Can you supply an example sentence? "After Java splintered in 20XX, diehard fans desperately pieced together the remaining fragments." Shane From stephen at xemacs.org Thu Sep 1 07:22:02 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Thu, 01 Sep 2005 14:22:02 +0900 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: <431683DC.60103@canterbury.ac.nz> (Greg Ewing's message of "Thu, 01 Sep 2005 16:30:20 +1200") References: <005301c5ada1$4a52afc0$8832c797@oemcomputer> <4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <5.1.1.6.0.20050830233356.01b34118@mail.telecommunity.com> <431683DC.60103@canterbury.ac.nz> Message-ID: <87vf1lmkg5.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Greg" == Greg Ewing writes: Greg> Er, pardon? I don't think I've ever heard 'piece' used as a Greg> verb in English. Can you supply an example sentence? "I'll let the reader piece it together." More closely related, I've heard/seen "piece out" used for task allocation (from "piecework", maybe), and my dictionary claims you can use it in the sense of adding more pieces or filling in missing pieces. Not the connotations we want. -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From orent at hishome.net Thu Sep 1 07:36:55 2005 From: orent at hishome.net (Oren Tirosh) Date: Thu, 1 Sep 2005 08:36:55 +0300 Subject: [Python-Dev] Python 3 design principles In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> Message-ID: <7168d65a0508312236312f07ad@mail.gmail.com> On 9/1/05, Robert Kern wrote: > Oren Tirosh wrote: > > > While a lot of existing code will break on 3.0 it is still generally > > possible to write code that will run on both 2.x and 3.0: use only the > > "proper" forms above, do not assume the result of zip or range is a > > list, use absolute imports (and avoid static types, of course). I > > already write all my new code this way. > > > > Is this "common subset" a happy coincidence or a design principle? > > I think it's because those are the most obvious things right now. The > really radical stuff won't come up until active development on Python > 3000 actually starts. And it will, so any "common subset" will probably > not be very large. Static typing is radical stuff and doesn't hurt the common subset since it's optional. Making unicode the default is pretty radical and can be done without breaking the common subset (with the help of little tweaks like allowing str() to return unicode now like int() can return longs). Iterators and new-style classes were pretty radical changes that were managed elegantly and meet an an even stronger requirement than the common subset - they were achieved with full backward compatibility. Python 3 will most probably make big changes in the internal implementation and the C API. Perhaps it will even be generated from PyPy. I don't think keeping the common subset will really stand in the way of making big improvements. The proposed 3.x changes that break it seem more like nitpicking to me than significant improvements. Python is terrific. I find nothing I really want to change. Remove old cruft and add some brand new stuff, yes. But nothing to change. Oren From stephen at xemacs.org Thu Sep 1 07:41:45 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Thu, 01 Sep 2005 14:41:45 +0900 Subject: [Python-Dev] Revising RE docs In-Reply-To: <43167F1B.6080108@canterbury.ac.nz> (Greg Ewing's message of "Thu, 01 Sep 2005 16:10:03 +1200") References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> <43167F1B.6080108@canterbury.ac.nz> Message-ID: <87r7c9mjja.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Greg" == Greg Ewing writes: Greg> Stephen J. Turnbull wrote: >> But you could have string objects (or a derivative) grow a >> "compiled_regexp" attribute internally. Greg> That would make the core dependent on the re module, which I Greg> think would be a bad idea. Probably. Greg> Personally I like the way the compilation step is made at Greg> least somewhat explicit. Regular expressions are not Greg> strings; a string is just one way of representing a regular Greg> expression. There could potentially be other representations Greg> that compile to the same re object. I guess I agree, but I would put the emphasis elsewhere. Something like, think of the call to compile() as a declaration that this string (or other representation) represents a regular expression. The actual compilation is an accidental side effect: it could be postponed to the first call of .match() or .search(). So I guess I would prefer a nomenclature like r = re.RegExp (string) over r = re.compile (string) Not a big deal though. -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From jcarlson at uci.edu Thu Sep 1 08:03:22 2005 From: jcarlson at uci.edu (Josiah Carlson) Date: Wed, 31 Aug 2005 23:03:22 -0700 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: <43167848.1010907@canterbury.ac.nz> References: <20050831151628.8B17.JCARLSON@uci.edu> <43167848.1010907@canterbury.ac.nz> Message-ID: <20050831223522.8B25.JCARLSON@uci.edu> Greg Ewing wrote: > > Josiah Carlson wrote: > > > A bit of free thought brings me to the (half-baked) idea that if string > > methods accepted any object which conformed to the buffer interface; > > mmap, buffer, array, ... instances could gain all of the really > > convenient methods that make strings the objects to use in many cases. > > Not a bad idea, but they couldn't literally be string methods. > They'd have to be standalone functions like we used to have in > the string module before it got mercilessly deprecated. :-) > > Not sure what happens to this when the unicode/bytearray future > arrives, though. Treating a buffer of bytes as a character > string isn't going to be so straightforward then. Here's my thought: One could modify string methods to check the type of the input (string, unicode, or other). That check turns on a flag for whether the method returns are string, unicode, or buffers. One uses PyObject_AsBuffer() methods to pull the char* and length for any input offering the buffer protocol. Now here's the fun part: One makes the methods aware of the type of the self parameter. One sets the 'split' method for the buffer object to be 'string_split', etc. Unicode does indeed get tricky, how does one handle buffers of unicode objects? Right now, you get the raw pointer and underlying length ( len (buffer(u'hello')) == 10 ). If there was a unicode buffer (perhaps ubuffer), that would work, but I'm not sure I really like it. I notice much of the discussion on 'string views', which to me seems like another way of saying 'buffer', and if there is a 'string view', there would necessarily need to be a 'unicode view'. As for the bytes type, from what I understand, they should directly support buffers without issue. - Josiah From reinhold-birkenfeld-nospam at wolke7.net Thu Sep 1 08:18:19 2005 From: reinhold-birkenfeld-nospam at wolke7.net (Reinhold Birkenfeld) Date: Thu, 01 Sep 2005 08:18:19 +0200 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <4316749F.6060204@canterbury.ac.nz> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > Charles Cazabon wrote: > >> Perhaps py3k could have a py2compat module. Importing it could have the >> effect of (for instance) putting compile, id, and intern into the global >> namespace, making print an alias for writeln, > > There's no way importing a module could add something that > works like the old print statement, unless some serious > magic is going on... You'd have to enclose print arguments in parentheses. Of course, the "trailing comma" form would be lost. Reinhold -- Mail address is perfectly valid! From fredrik at pythonware.com Thu Sep 1 08:40:19 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Thu, 1 Sep 2005 08:40:19 +0200 Subject: [Python-Dev] String views (was: Re: Proof of the pudding:str.partition()) References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > As a Python programmer I'd get back what look like three strings: "http", > ":", and "//www.python.org/". If each of them was a view onto part of the > original string, only the last one would truly refer to a NUL-terminated > sequence of characters. If I then wanted to see what scheme's value > compared to, the string's comparison method would have to recognize that > it > wasn't truly NUL-terminated, copy it, call strncmp() or whatever > underlying > routine is used for string comparisons. (Maybe string comparisons are > done > inline. I'm sure there are some examples where the underlying C string > routines are called.) Python strings are character buffers with a known length, not null-terminated C strings. the CPython implementation guarantees that the character buffer has a trailing NULL character, but that's mostly to make it easy to pass Python strings directly to traditional C API:s. (string views are nothing new in Python. the original Unicode string implementation supported this, but that was partially removed during integration. the type still uses a separate buffer to hold the characters, though (unlike 8-bit strings that store the characters in the string object itself)) From fredrik at pythonware.com Thu Sep 1 08:53:12 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Thu, 1 Sep 2005 08:53:12 +0200 Subject: [Python-Dev] Proof of the pudding: str.partition() References: <005301c5ada1$4a52afc0$8832c797@oemcomputer><4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <5.1.1.6.0.20050830233356.01b34118@mail.telecommunity.com> <431683DC.60103@canterbury.ac.nz> Message-ID: Greg Ewing wrote: >> .piece() can be both a verb and a noun > > Er, pardon? I don't think I've ever heard 'piece' used > as a verb in English. Can you supply an example sentence? Main Entry: 2 piece Function: transitive verb Inflected Form(s): pieced; piec·ing 1 : to repair, renew, or complete by adding pieces : PATCH 2 : to join into a whole -- often used with together - piec·er noun From kay.schluehr at gmx.net Thu Sep 1 08:55:48 2005 From: kay.schluehr at gmx.net (Kay Schluehr) Date: Thu, 01 Sep 2005 08:55:48 +0200 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <7168d65a0508312236312f07ad@mail.gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> <7168d65a0508312236312f07ad@mail.gmail.com> Message-ID: Oren Tirosh wrote: > Python 3 will most probably make big changes in the internal > implementation and the C API. Perhaps it will even be generated from > PyPy. Don't you think the current Python 3 "visions" becomes rather pointless with the raise of PyPy and interpreter extensions that are developed polymorphically? If the distinction between a user defined package and a language extension becomes more or less irrelevant who needs a language design committee for it's control? If someone takes the Python core in order to implement static typing it might be happen and run in a separate object space. But than, I'm almost sure, it won't be an ill-defined concept like "optional static typing" but Hindley-Milnor ( or a generalization ) which restricts dynamicity but enables type safety and static control otherwise. The idea of forking a language with a new release and thereby deevaluating older code seems somewhat archaic to me. Or the other way round: archaic materials and media like papyrus and scripture enabled communication across centurys changing slightly evolutionary and continously. Form this point of view PL development is still in a state of modernistic, youthfull irresponsibility. > I don't think keeping the common subset will really stand in the way > of making big improvements. The proposed 3.x changes that break it > seem more like nitpicking to me than significant improvements. So it seems. Kay From ncoghlan at gmail.com Thu Sep 1 11:54:23 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 01 Sep 2005 19:54:23 +1000 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <7168d65a050831132415118382@mail.gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> Message-ID: <4316CFCF.4030605@gmail.com> Oren Tirosh wrote: > * Replacing print with write/writeln I still hope to see this change to "make print a builtin instead of a statement". I'd hate to lose the one-line hello world example due to cruft like "from sys import stdout". Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From nick at craig-wood.com Thu Sep 1 12:00:55 2005 From: nick at craig-wood.com (Nick Craig-Wood) Date: Thu, 1 Sep 2005 11:00:55 +0100 Subject: [Python-Dev] Python 3 design principles In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <7168d65a0508312236312f07ad@mail.gmail.com> Message-ID: <20050901100054.GA5963@craig-wood.com> On Thu, Sep 01, 2005 at 08:55:48AM +0200, Kay Schluehr wrote: > The idea of forking a language with a new release and thereby > deevaluating older code seems somewhat archaic to me. Or the other way > round: archaic materials and media like papyrus and scripture enabled > communication across centurys changing slightly evolutionary and > continously. Form this point of view PL development is still in a state > of modernistic, youthfull irresponsibility. I mostly agree with that. For me personally, one of the big reasons for jumping ship from perl to python (a considerable investment in time and effort) was to avoid perl 6. Its been clear for a long time that perl 6 will be completely different to perl 5, thus making perl 5 an evolutionary dead end. Yes I know about the perl 5 on perl 6 stuff - but who wants to program in a dead language? I'm all for removing the cruft in python 3, and giving it a bit of a spring clean, but please, please don't make it feel like a different language otherwise the users will be deserting in droves (no-one likes to be told that they've been using the wrong language for all these years). If come python 3, there is a 99% accurate program which can turn your python 2.x into python 3 code, then that would ease the transition greatly. -- Nick Craig-Wood -- http://www.craig-wood.com/nick From ncoghlan at gmail.com Thu Sep 1 13:26:56 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 01 Sep 2005 21:26:56 +1000 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: <20050831172458.GB2476@discworld.dyndns.org> References: <20050831045535.4dovty96y0w0g4gg@login.werra.lunarpages.com> <20050831143046.GE522@discworld.dyndns.org> <20050831172458.GB2476@discworld.dyndns.org> Message-ID: <4316E580.3020500@gmail.com> Charles Cazabon wrote: >>also, a Boolean positional argument is a really poor clue about its meaning, >>and it's easy to misremember the sense reversed. > > > I totally agree. I therefore borrowed the time machine and modified my > proposal to suggest it should be a keyword argument, not a positional one :). The best alternative to rpartition I've encountered so far is Reinhold's proposal of a 'separator index' that selects which occurrence of the separator in the string should be used to perform the partitioning. However, even it doesn't measure up, as you will see if you read on. . . The idea is that, rather than "partition(sep)" and "rpartition(sep)", we have a single method "partition(sep, [at_sep=1])". The behaviour could be written up like this: """ Partition splits the string into three pieces (`before`, `sep`, `after`) - the part of the string before the separator, the separator itself and the part of the string after the separator. If the relevant portion of the string doesn't exist, then the corresponding element of the tuple returned is the empty string. The `at_sep` argument determines which occurence of the separator is used to perform the partitioning. The default value of 1 means the partitioning occurs at the 1st occurence of the separator. If the `at_sep` argument is negative, occurences of the separator are counted from the end of the string instead of the start. An `at_sep` value of 0 will result in the original string being returned as the part 'after' the separator. """ A concrete implementation is below. Comparing it to Raymond's examples that use rpartition, I find that the only benefit in these examples is that the use of the optional second argument is far harder to miss than the single additional letter in the method name, particularly if partition and rpartition are used close together. Interestingly, out of 31 examples in Raymond's patch, only 7 used rpartition. The implementation, however, is significantly less obvious than that for the simple version, and likely slower due to the extra conditional, the extra list created, and the need to use join. It also breaks symmetry with index/rindex and split/rsplit. Additionally, if splitting on anything other than the first or last occurence of the separator was going to be a significant use case for str.partition, wouldn't the idea have already come up in the context of str.find and str.index? I actually thought the 'at_sep' argument was a decent idea when I started writing this message, but I have found my arguments in favour of it to be wholly unconvincing, and the arguments against it perfectly sound ;) Cheers, Nick. def partition(s, sep, at_sep=1): """ Returns a three element tuple, (head, sep, tail) where: head + sep + tail == s sep == '' or sep is t bool(sep) == (t in s) # sep indicates if the string was found >>> s = 'http://www.python.org' >>> partition(s, '://') ('http', '://', 'www.python.org') >>> partition(s, '?') ('http://www.python.org', '', '') >>> partition(s, 'http://') ('', 'http://', 'www.python.org') >>> partition(s, 'org') ('http://www.python.', 'org', '') """ if not isinstance(t, basestring) or not t: raise ValueError('partititon argument must be a non-empty string') if at_sep == 0: result = ('', '', s) else: if at_sep > 0: parts = s.split(sep, at_sep) if len(parts) <= at_sep: result = (s, '', '') else: result = (sep.join(parts[:at_sep]), sep, parts[at_sep]) else: parts = s.rsplit(sep, at_sep) if len(parts) <= at_sep: result = ('', '', s) else: result = (parts[0], sep, sep.join(parts[1:])) assert len(result) == 3 assert ''.join(result) == s assert result[1] == '' or result[1] is sep return result import doctest print doctest.testmod() ================================== **** Standard lib comparisons **** ================================== =====CGIHTTPServer.py===== def run_cgi(self): """Execute a CGI script.""" dir, rest = self.cgi_info ! rest, _, query = rest.rpartition('?') ! script, _, rest = rest.partition('/') scriptname = dir + '/' + script scriptfile = self.translate_path(scriptname) if not os.path.exists(scriptfile): def run_cgi(self): """Execute a CGI script.""" dir, rest = self.cgi_info ! rest, _, query = rest.partition('?', at_sep=-1) ! script, _, rest = rest.partition('/') scriptname = dir + '/' + script scriptfile = self.translate_path(scriptname) if not os.path.exists(scriptfile): =====cookielib.py===== else: path_specified = False path = request_path(request) ! head, sep, _ = path.rpartition('/') ! if sep: if version == 0: # Netscape spec parts company from reality here ! path = head else: ! path = head + sep if len(path) == 0: path = "/" else: path_specified = False path = request_path(request) ! head, sep, _ = path.partition('/', at_sep=-1) ! if sep: if version == 0: # Netscape spec parts company from reality here ! path = head else: ! path = head + sep if len(path) == 0: path = "/" =====httplib.py===== def _set_hostport(self, host, port): if port is None: ! host, _, port = host.rpartition(':') ! if ']' not in port: # ipv6 addresses have [...] try: ! port = int(port) except ValueError: ! raise InvalidURL("nonnumeric port: '%s'" % port) else: port = self.default_port if host and host[0] == '[' and host[-1] == ']': def _set_hostport(self, host, port): if port is None: ! host, _, port = host.partition(':', at_sep=-1) ! if ']' not in port: # ipv6 addresses have [...] try: ! port = int(port) except ValueError: ! raise InvalidURL("nonnumeric port: '%s'" % port) else: port = self.default_port if host and host[0] == '[' and host[-1] == ']': =====modulefinder.py===== assert caller is parent self.msgout(4, "determine_parent ->", parent) return parent ! pname, found, _ = pname.rpartition('.') ! if found: parent = self.modules[pname] assert parent.__name__ == pname self.msgout(4, "determine_parent ->", parent) assert caller is parent self.msgout(4, "determine_parent ->", parent) return parent ! pname, found, _ = pname.partition('.', at_sep=-1) ! if found: parent = self.modules[pname] assert parent.__name__ == pname self.msgout(4, "determine_parent ->", parent) =====pdb.py===== filename = None lineno = None cond = None ! arg, found, cond = arg.partition(',') ! if found and arg: # parse stuff after comma: "condition" ! arg = arg.rstrip() ! cond = cond.lstrip() # parse stuff before comma: [filename:]lineno | function funcname = None ! filename, found, arg = arg.rpartition(':') ! if found: ! filename = filename.rstrip() f = self.lookupmodule(filename) if not f: print '*** ', repr(filename), filename = None lineno = None cond = None ! arg, found, cond = arg.partition(',') ! if found and arg: # parse stuff after comma: "condition" ! arg = arg.rstrip() ! cond = cond.lstrip() # parse stuff before comma: [filename:]lineno | function funcname = None ! filename, found, arg = arg.partition(':', at_sep=-1) ! if found: ! filename = filename.rstrip() f = self.lookupmodule(filename) if not f: print '*** ', repr(filename), ***** return if ':' in arg: # Make sure it works for "clear C:\foo\bar.py:12" ! filename, _, arg = arg.rpartition(':') try: lineno = int(arg) except: return if ':' in arg: # Make sure it works for "clear C:\foo\bar.py:12" ! filename, _, arg = arg.partition(':', at_sep=-1) try: lineno = int(arg) except: =====smtplib.py===== """ if not port and (host.find(':') == host.rfind(':')): ! host, found, port = host.rpartition(':') ! if found: try: port = int(port) except ValueError: raise socket.error, "nonnumeric port" """ if not port and (host.find(':') == host.rfind(':')): ! host, found, port = host.partition(':', at_sep=-1) ! if found: try: port = int(port) except ValueError: raise socket.error, "nonnumeric port" -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From amk at amk.ca Thu Sep 1 13:43:37 2005 From: amk at amk.ca (A.M. Kuchling) Date: Thu, 1 Sep 2005 07:43:37 -0400 Subject: [Python-Dev] python/dist/src/Lib/test test_re.py, 1.45.6.3, 1.45.6.4 In-Reply-To: References: <20050831125701.F008B1E4004@bag.python.org> Message-ID: <20050901114337.GB12006@rogue.amk.ca> On Wed, Aug 31, 2005 at 07:56:04PM -0400, Jim Jewett wrote: > What is the reasoning behind this? > > It seems to me that if a (passing) test is being added, maintenance releases > are the *most* important places to run them. In this case, it's because adding the test requires importing a new module ('pre'), which in turn would require adding a warning, which would require checking that the warning didn't mess anything else up. I thought it was a bit too much upheaval for a module that no one should be using any more. If Anthony or Guido or someone tells me to bite the bullet and make the test run, I'll do it, of course. --amk From barry at python.org Thu Sep 1 15:11:48 2005 From: barry at python.org (Barry Warsaw) Date: Thu, 01 Sep 2005 09:11:48 -0400 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <4316CFCF.4030605@gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> <4316CFCF.4030605@gmail.com> Message-ID: <1125580308.10343.33.camel@geddy.wooz.org> On Thu, 2005-09-01 at 05:54, Nick Coghlan wrote: > Oren Tirosh wrote: > > * Replacing print with write/writeln > > I still hope to see this change to "make print a builtin instead of a > statement". I'd hate to lose the one-line hello world example due to cruft > like "from sys import stdout". I agree. You can't get much simpler to explain or use than the current print statement. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050901/03242f50/attachment.pgp From raymond.hettinger at verizon.net Thu Sep 1 15:18:05 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Thu, 01 Sep 2005 09:18:05 -0400 Subject: [Python-Dev] C coding experiment Message-ID: <004b01c5aef7$96d936a0$4320c797@oemcomputer> If anyone wants a small, but interesting C project, let me know. The project does not require much familiarity with the CPython implementation; all that is needed are basic C coding skills and a puzzle solving mentality. The goal is to determine whether the setobject.c implementation would be improved by recoding the set_lookkey() function to optimize key insertion order using Brent's variation of Algorithm D (See Knuth vol. III, section 6.4, page 525). It has the potential to boost performance for uniquification applications with duplicate keys being identified more quickly (usually with just a single probe). The function may also result in more frequent retirement of dummy entries during insertion operations. The function can be coded from scratch or adapted from Lua's source code. Raymond From mcherm at mcherm.com Thu Sep 1 16:20:58 2005 From: mcherm at mcherm.com (Michael Chermside) Date: Thu, 01 Sep 2005 07:20:58 -0700 Subject: [Python-Dev] String views Message-ID: <20050901072058.f64zse0ondkcs08o@login.werra.lunarpages.com> Tim Delaney writes: > One of the big disadvantages of string views is that they need to keep > the original object around, no matter how big it is. But in the case of > partition, much of the time the original string survives for at least a > similar period to the partitions. Why do you say that? Didn't several of Raymond's examples use the idiom: part_1, _, s = s.partition(first_sep) part_2, _, s = s.partition(second_sep) part_3, _, s = s.partition(third_sep) --- Skip writes: > I'm skeptical about performance as well, but not for that reason. A string > object can have a referent field. If not NULL, it refers to another string > object which is INCREFed in the usual way. At string deallocation, if the > referent is not NULL, the referent is DECREFed. If the referent is NULL, > ob_sval is freed. Won't work. A string may have multiple referrents, so a single referent field isn't sufficient. --- My own contribution: I know that the Java string class has support for this. The String class contains a reference to a char array along with start and length indices. The substring() method constructs what we're calling "string views". I wonder whether there is a way to instrument a JVM to record how often the underlying buffers are shared, then run some common Java apps. Since the feature is exactly analogous to what is being proposed here, I would find such such analysis very helpful. -- Michael Chermside From guido at python.org Thu Sep 1 16:48:32 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 07:48:32 -0700 Subject: [Python-Dev] String views In-Reply-To: <43167CDE.4070206@canterbury.ac.nz> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <17174.23904.883698.268577@montanaro.dyndns.org> <43167CDE.4070206@canterbury.ac.nz> Message-ID: On 8/31/05, Greg Ewing wrote: > skip at pobox.com wrote: > > > Ah, I forgot the data is part of the PyString object itself, not stored as a > > separate char* array. Without a char* in the object it's kind of hard to do > > views. > > That wouldn't be a problem if substrings were a separate > subclass of basestring with their own representation. > That's probably a good idea anyway, since you wouldn't > want slicing to return substrings by default -- it > should be something you have to explicitly ask for. You all are reinventing NSString. That's the NextStep string type used by ObjC. PyObjC bridges to NSString with some difficulty. I have never used this myself, but from Donovan Preston I understand that NSString is just a base class or an interface or something like that and many different implementations / subclasses exist. Donovan has suggested that we adopt something similar for Python -- I presume in part to make his life wrapping NSString easier, but at least in part because the concept really works well in ObjC. I'm not saying to go either way yet. I'm wary of complexifications of the string implementation based on a horriffically complex implementation in ABC that was proven to be asymptotically optimal, but unfortunately was beat every time in practical applications by something much simpler, *and* the algorithm was so complex that we couldn't get the code 100% bugfree. But that was 20 years ago. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Thu Sep 1 16:58:08 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 07:58:08 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: [Charles Cazabon] > >> Perhaps py3k could have a py2compat module. Importing it could have the > >> effect of (for instance) putting compile, id, and intern into the global > >> namespace, making print an alias for writeln, [Greg Ewing] > > There's no way importing a module could add something that > > works like the old print statement, unless some serious > > magic is going on... [Reinhold Birkenfeld] > You'd have to enclose print arguments in parentheses. Of course, the "trailing > comma" form would be lost. And good riddance! The print statement harks back to ABC and even (unvisual) Basic. Out with it! A transitional strategy could be to start designing the new API and introduce it in Python 2.x. Here's my strawman: (1) Add two new methods the the stream (file) API and extend write(): stream.write(a1, a2, ...) -- equivalent to map(stream.write, map(str, [a1, a2, ...])) stream.writeln(a1, a2, ...) -- equivalent to stream.write(a1, a2, ..., "\n") stream.writef(fmt, a1, a2, ...) -- equivalent to stream.write(fmt % (a1, a2, ...)) (2) Add builtin functions write(), writeln(), writef() that call the corresponding method on sys.stdout. (Note: these should not just be the bound methods; assignment to sys.stdout should immediately affect those, just like for print. There's an important use case for this.) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From nidoizo at yahoo.com Thu Sep 1 16:57:52 2005 From: nidoizo at yahoo.com (Nicolas Fleury) Date: Thu, 01 Sep 2005 10:57:52 -0400 Subject: [Python-Dev] Status of PEP 328 Message-ID: Hi, I would like to know what is the status of PEP 328? Can absolute_import be expected in 2.5? Any help needed? I'll be interested. Also, the content of the PEP doesn't seem to be up-to-date with the status quo, since it is mentioned support in 2.4 for "from __future__ import absolute_import". Thx and regards, Nicolas From guido at python.org Thu Sep 1 17:02:02 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 08:02:02 -0700 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <20050901100054.GA5963@craig-wood.com> References: <7168d65a050831132415118382@mail.gmail.com> <7168d65a0508312236312f07ad@mail.gmail.com> <20050901100054.GA5963@craig-wood.com> Message-ID: On 9/1/05, Nick Craig-Wood wrote: > I'm all for removing the cruft in python 3, and giving it a bit of a > spring clean, but please, please don't make it feel like a different > language otherwise the users will be deserting in droves (no-one likes > to be told that they've been using the wrong language for all these > years). IMO it won't feel like a different language; syntactically, the most far-fetched change is probably dropping the print statement (on which I just opened a new thread). > If come python 3, there is a 99% accurate program which can turn your > python 2.x into python 3 code, then that would ease the transition > greatly. That might not be so easy given the desire to change most list-returning functions and methods into iterator-returning ones. This means that *most* places where you use keys() your code will still run, but *some* places you'll have to write list(d.keys()). How is the translator going to know? Worse, there's a common idiom: L = D.keys() L.sort() that should be replaced by L = sorted(D) how is the translator going to recognize that (given that there are all sorts of variations)? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Thu Sep 1 17:06:43 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 08:06:43 -0700 Subject: [Python-Dev] Status of PEP 328 In-Reply-To: References: Message-ID: It's been approved, but AFAIK still awaiting a patch. So yes, please help! On 9/1/05, Nicolas Fleury wrote: > Hi, > > I would like to know what is the status of PEP 328? Can absolute_import > be expected in 2.5? Any help needed? I'll be interested. > > Also, the content of the PEP doesn't seem to be up-to-date with the > status quo, since it is mentioned support in 2.4 for "from __future__ > import absolute_import". > > Thx and regards, > Nicolas > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From barry at python.org Thu Sep 1 17:25:05 2005 From: barry at python.org (Barry Warsaw) Date: Thu, 01 Sep 2005 11:25:05 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: <1125588305.22624.32.camel@geddy.wooz.org> On Thu, 2005-09-01 at 10:58, Guido van Rossum wrote: > [Reinhold Birkenfeld] > > You'd have to enclose print arguments in parentheses. Of course, the "trailing > > comma" form would be lost. > > And good riddance! The print statement harks back to ABC and even > (unvisual) Basic. Out with it! I have to strongly disagree. The print statement is simple, easy to understand, and easy to use. For use cases like debugging or the interactive interpreter, and even for some more complicated use cases like print>>, I think it's hard to beat the useability of print with a write() function, even if builtin. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050901/e62986a5/attachment.pgp From p.f.moore at gmail.com Thu Sep 1 18:03:55 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Thu, 1 Sep 2005 17:03:55 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125588305.22624.32.camel@geddy.wooz.org> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <1125588305.22624.32.camel@geddy.wooz.org> Message-ID: <79990c6b0509010903251033d5@mail.gmail.com> On 9/1/05, Barry Warsaw wrote: > On Thu, 2005-09-01 at 10:58, Guido van Rossum wrote: > > > [Reinhold Birkenfeld] > > > You'd have to enclose print arguments in parentheses. Of course, the "trailing > > > comma" form would be lost. > > > > And good riddance! The print statement harks back to ABC and even > > (unvisual) Basic. Out with it! > > I have to strongly disagree. The print statement is simple, easy to > understand, and easy to use. I agree with Barry. In particular, the behaviour of adding spaces between items is something I find very useful, and it's missing from the functional forms. print greeting, name feels much more natural to me than write(greeting, " ", name) or writef("%s %s", greeting, name) And that's even worse if the original used a literal "Hello", and only later migrated to a variable greeting - remembering to get the spaces in the right place is a pain: print "Hello", name ==> print greeting, name write("Hello ", name) ==> write(greeting, name) # oops, forgot the space or write(greeting, " ", name) # non-obvious translation OK, it's a minor thing, but what's the benefit? I've used print functions a lot in things like VBScript and Javascript, and hated them every time... Paul. From fredrik.johansson at gmail.com Thu Sep 1 18:33:18 2005 From: fredrik.johansson at gmail.com (Fredrik Johansson) Date: Thu, 1 Sep 2005 18:33:18 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: <3d0cebfb0509010933f4eeb47@mail.gmail.com> I like the present print statement because parentheses are inconvenient to type compared to lowercase letters, and it looks less cluttered without them. The parentheses in writeln("hello world") don't add any more meaning than a terminating semicolon would, so why are they necessary? Why not instead change the language so as to allow any function call to be written without parentheses (when this is unambiguous)? This could make Python more convenient for creating imperative-style DSLs (though I'm not sure this is a goal). In any case, I think "write" would be better than "print", because it is easier to type (at least for me; reaching for 'w' and than 'r' goes much faster than reaching for 'p'). I don't like "writeln" though, as in 9 of 10 cases I want the line break to be there. I'd rather have write add the line break, and "writeraw" or somesuch exclude it. By the way, if print has to go, then what about the assert, raise, and import statements? Should these be changed to use function call syntax as well? (By the way, assert and raise could be methods: ZeroDivisionError.assert(denom != 0). Surprising that Java doesn't do this ;-) Fredrik From reinhold-birkenfeld-nospam at wolke7.net Thu Sep 1 18:57:34 2005 From: reinhold-birkenfeld-nospam at wolke7.net (Reinhold Birkenfeld) Date: Thu, 01 Sep 2005 18:57:34 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: Guido van Rossum wrote: > [Charles Cazabon] >> >> Perhaps py3k could have a py2compat module. Importing it could have the >> >> effect of (for instance) putting compile, id, and intern into the global >> >> namespace, making print an alias for writeln, > > [Greg Ewing] >> > There's no way importing a module could add something that >> > works like the old print statement, unless some serious >> > magic is going on... > > [Reinhold Birkenfeld] >> You'd have to enclose print arguments in parentheses. Of course, the "trailing >> comma" form would be lost. > > And good riddance! The print statement harks back to ABC and even > (unvisual) Basic. Out with it! Here I have to agree with Barry. print is very handy, and print>> is, too. I'd rather see exec and assert becoming a function. > A transitional strategy could be to start designing the new API and > introduce it in Python 2.x. Here's my strawman: > > (1) Add two new methods the the stream (file) API and extend write(): > stream.write(a1, a2, ...) -- equivalent to map(stream.write, map(str, > [a1, a2, ...])) > stream.writeln(a1, a2, ...) -- equivalent to stream.write(a1, a2, ..., "\n") > stream.writef(fmt, a1, a2, ...) -- equivalent to stream.write(fmt % > (a1, a2, ...)) Do we really need writef()? It seems to be not much better than its %-formatting equivalent. > (2) Add builtin functions write(), writeln(), writef() that call the > corresponding method on sys.stdout. (Note: these should not just be > the bound methods; assignment to sys.stdout should immediately affect > those, just like for print. There's an important use case for this.) If write* is introduced, this is absolutely necessary. Reinhold -- Mail address is perfectly valid! From raymond.hettinger at verizon.net Thu Sep 1 19:01:15 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Thu, 01 Sep 2005 13:01:15 -0400 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: Message-ID: <001701c5af16$c3d22e40$4320c797@oemcomputer> [Steve Holden] > The collective brainpower that's been exercised on this one enhancement > already must be phenomenal, but the proposal still isn't perfect. Sure it is :-) It was never intended to replace all string parsing functions, existing or contemplated. We still have str.index() so people can do low-level index tracking, optimization, or whatnot. Likewise, str.split() and regexps remain better choices for some apps. The discussion has centered around the performance cost of returning three strings when two or fewer are actually used. >From that discussion, we can immediately eliminate the center string case as it is essentially cost-free (it is either an empty string or a reference to, not a copy of the separator argument). Another case that is no cause for concern is when one of the substrings is often, but not always empty. Consider comments stripping for example: # XXX a real parser would need to skip over # in string literals for line in open('demo.py'): line, sep, comment = line.partition('#') print line On most lines, the comment string is empty, so no time is lost copying a long substring that won't be used. On the lines with a comment, I like having it because it makes the code easier to debug/maintain (making it trivial to print, log, or store the comment string). Similar logic applies to other cases where the presence of a substring is an all or nothing proposition, such as cgi scripts extracting the command string when present: line, cmdfound, command = line.rpartition('?'). If not found, you've wasted nothing (the command string is empty). If found, you've gotten what you were going to slice-out anyway. Also, there are plenty of use cases that only involve short strings (parsing urls, file paths, splitting name/value pairs, etc). The cost of ignoring a component for these short inputs is small. That leaves the case where the strings are long and parsing is repeated with the same separator. The answer here is to simply NOT use partition(). Don't write: while s: line, _, s = s.partition(sep) . . . Instead, you almost always do better with for line in s.split(sep): . . . or with re.finditer() if memory consumption is an issue. Remember, adding partition() doesn't take away anything else you have now (even if str.find() disappears, you still have str.index()). Also, its inclusion does not preclude more specialized methods like str.before(sep) or str.after(sep) if someone is able to prove their worth. What str.partition() does do well is simplify code by encapsulating several variations of a common multi-step, low-level programming pattern. It should be accepted on that basis rather than being rejected because it doesn't also replace re.finditer() or str.split(). Because there are so many places were partition() is a clear improvement, I'm not bothered when someone concocts a case where it isn't the tool of choice. Accept it for what it is, not what it is not. Raymond From guido at python.org Thu Sep 1 19:09:30 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 10:09:30 -0700 Subject: [Python-Dev] Revising RE docs In-Reply-To: <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: On 8/31/05, Stephen J. Turnbull wrote: > >>>>> "Michael" == Michael Chermside writes: > > Michael> (2) is what we have today, but I would prefer (1) to > Michael> gently encourage people to use the precompiled objects > Michael> (which are distinctly faster when re-used). > > Didn't Fredrik Lundh strongly imply that implicitly compiled objects > are cached? That's a pretty big speed up right there. What happened to RTSL? ("Read the Source, Luke" :) They *are* cached and there is no cost to using the functions instead of the methods unless you have so many regexps in your program that the cache is cleared (the limit is 100). -- --Guido van Rossum (home page: http://www.python.org/~guido/) From kbk at shore.net Thu Sep 1 19:12:33 2005 From: kbk at shore.net (Kurt B. Kaiser) Date: Thu, 1 Sep 2005 13:12:33 -0400 (EDT) Subject: [Python-Dev] Weekly Python Patch/Bug Summary Message-ID: <200509011712.j81HCXHG007703@bayview.thirdcreek.com> Patch / Bug Summary ___________________ Patches : 903 open (+551) / 5222 closed (+2324) / 6125 total (+2875) Bugs : 903 open (-23) / 5222 closed (+45) / 6125 total (+22) RFE : 187 open ( -3) / 184 closed ( +5) / 371 total ( +2) New / Reopened Patches ______________________ PEP 349: allow str() to return unicode (2005-08-22) http://python.org/sf/1266570 opened by Neil Schemenauer pdb: implement "until",fix for 1248119 (2005-08-23) http://python.org/sf/1267629 opened by Ilya Sandler tarfile: fix for bug #1257255 (2005-08-17) CLOSED http://python.org/sf/1262036 reopened by loewis Cache lines in StreamReader.readlines (2005-08-24) http://python.org/sf/1268314 opened by Martin v. L?wis extending os.walk to support following symlinks (2005-08-26) http://python.org/sf/1273829 opened by Erick Tryzelaar libtarfile.tex: external URL changed (2005-08-27) CLOSED http://python.org/sf/1274550 opened by Lars Gust?bel documentation fixes (2005-08-28) CLOSED http://python.org/sf/1274630 opened by George Yoshida Compile socket module under cygwin (2005-08-28) http://python.org/sf/1275079 opened by Miki Tebeka fix distutils typo "sortcut" -> "shortcut" (2005-08-29) CLOSED http://python.org/sf/1275796 opened by Wummel Adding new regrtest resource 'urlfetch' (2005-08-30) http://python.org/sf/1276356 opened by Hye-Shik Chang tarfile: adding filed that use direct device addressing (2005-08-30) http://python.org/sf/1276378 opened by Urban Purkat tkinter hello world example bug (2005-08-31) http://python.org/sf/1277677 opened by Yusuke Shinyama Patches Closed ______________ Improve %s support for unicode (2005-03-09) http://python.org/sf/1159501 closed by nascheme markupbase misses comments (bug 736659) (2004-02-20) http://python.org/sf/901369 closed by fdrake chr, ord, unichr documentation updates (2004-10-31) http://python.org/sf/1057588 closed by fdrake tarfile: fix for bug #1257255 (2005-08-17) http://python.org/sf/1262036 closed by loewis tarfile.py: set sizes of non-regular files to zero. (2005-03-22) http://python.org/sf/1168594 closed by loewis Fix pydoc crashing on unicode strings (2004-11-13) http://python.org/sf/1065986 closed by rhettinger fix for 1016880 urllib.urlretrieve silently truncates dwnld (2004-11-07) http://python.org/sf/1062060 closed by birkenfeld New tutorial tests in test_generators.py (2005-01-31) http://python.org/sf/1113421 closed by birkenfeld more __contains__ tests (2005-02-18) http://python.org/sf/1141428 closed by birkenfeld A program to scan python files and list those require coding (2003-08-06) http://python.org/sf/784089 closed by birkenfeld distutils.dir_utils.mkpath to support unicode (2005-03-21) http://python.org/sf/1167716 closed by loewis use ReleaseItanium configuration for zlib IA64 build (2005-03-09) http://python.org/sf/1160164 closed by loewis Solaris 2.5.1 _SC_PAGESIZE vs. _SC_PAGE_SIZE (2003-08-11) http://python.org/sf/786743 closed by loewis Flakey urllib2.parse_http_list (2003-11-25) http://python.org/sf/848870 closed by birkenfeld urllib2.parse_http_list bugfix 735248 (2004-02-21) http://python.org/sf/901480 closed by birkenfeld Cookie.py: One step closer to RFC 2109 (2003-11-24) http://python.org/sf/848017 closed by birkenfeld Fix for off-by-one bug in urllib.URLopener.retrieve (2003-09-21) http://python.org/sf/810023 closed by birkenfeld Allow socket.inet_aton("255.255.255.255") on Windo (2003-06-17) http://python.org/sf/756021 closed by birkenfeld libtarfile.tex: external URL changed (2005-08-27) http://python.org/sf/1274550 closed by birkenfeld documentation fixes (2005-08-27) http://python.org/sf/1274630 closed by birkenfeld fix distutils typo "sortcut" -> "shortcut" (2005-08-29) http://python.org/sf/1275796 closed by nnorwitz Patch for (Doc) bug #1212195 (2005-06-26) http://python.org/sf/1227545 closed by lemburg bltinmodule.c whitespace normalization (2005-07-21) http://python.org/sf/1242579 closed by birkenfeld fileinput openfile patch, bz2fileinput (2005-06-22) http://python.org/sf/1225466 closed by birkenfeld shutil.copytree() quits too soon after an error. (2005-07-21) http://python.org/sf/1242454 closed by birkenfeld New / Reopened Bugs ___________________ _register is not safe (2005-08-23) CLOSED http://python.org/sf/1267540 opened by Russell Owen bdist_rpm hardcodes setup.py as the script name (2005-08-23) http://python.org/sf/1267547 opened by Fernando P?rez tarfile local name is local, should be abspath (2005-08-12) CLOSED http://python.org/sf/1257255 reopened by birkenfeld crash recursive __getattr__ (2005-08-24) http://python.org/sf/1267884 opened by pinzo email.MIMEText & email.MIMEMultipart "From" in message body (2005-08-24) CLOSED http://python.org/sf/1268519 opened by Jim Kutter atexit not called for pythonservice (win32) (2005-08-24) CLOSED http://python.org/sf/1269051 opened by Hari Krishna Dara __new__ is class method (2005-08-16) CLOSED http://python.org/sf/1261229 reopened by mwh Cycle containing a Set is not GC'd [leak] (2005-08-25) CLOSED http://python.org/sf/1273504 opened by Pierre-Fr?d?ric Caillaud Encoding memory problem. (2005-08-26) CLOSED http://python.org/sf/1273892 opened by Darek Ostolski bz2module.c compiler warning (2005-08-26) http://python.org/sf/1274069 opened by Tim Peters 'setup.py install' fail on linux from read-only storage (2005-08-27) http://python.org/sf/1274324 opened by Alexander Belchenko splitunc not documented (2005-08-27) http://python.org/sf/1274828 opened by Poor Yorick dict key comparison swallows exceptions (2005-08-29) http://python.org/sf/1275608 opened by Armin Rigo add a get() method to sets (2005-08-29) CLOSED http://python.org/sf/1275677 opened by Antoine Pitrou discrepancy between str.__cmp__ and unicode.__cmp__ (2005-08-29) http://python.org/sf/1275719 opened by Antoine Pitrou error converting locale number to decimal (2005-08-30) CLOSED http://python.org/sf/1276437 opened by oswaldo 2.4.1 make fails on Solaris 10 (2005-08-30) http://python.org/sf/1276509 opened by csmuc dict('') doesn't raise a value error (2005-08-30) CLOSED http://python.org/sf/1276587 opened by Mike Foord dirutils.mkpath (verbose option does not work) (2005-08-30) http://python.org/sf/1276768 opened by gorilla_killa Sentence fragment in urlparse documentation (2005-08-30) CLOSED http://python.org/sf/1277016 opened by Chad Whitacre imaplib Imap.select() uses comparison to 'None' for boolean (2005-08-31) CLOSED http://python.org/sf/1277098 opened by Stephen Thorne Lambda and deepcopy (2005-08-31) http://python.org/sf/1277718 opened by Joshua Ginsberg logging module broken for multiple threads? (2005-08-31) http://python.org/sf/1277903 opened by Lenny G. Arbage help( ) broken, especially on Windows (2005-08-31) http://python.org/sf/1278102 opened by Bryan G. Olson invalid syntax in os.walk() first example (2005-09-01) CLOSED http://python.org/sf/1278906 opened by YoHell Bugs Closed ___________ Mistakes in decimal.Context.subtract documentation (2005-08-22) http://python.org/sf/1266296 closed by birkenfeld IDLE on Mac (2005-08-18) http://python.org/sf/1263656 closed by kbk markupbase parse_declaration cannot recognize comments (2003-05-12) http://python.org/sf/736659 closed by fdrake IDLE Freezes on Ubuntu Warty (2005-01-25) http://python.org/sf/1108992 closed by kbk Python 2.5a0 Tutorial errors and observations (2005-03-22) http://python.org/sf/1168135 closed by rhettinger _register is not safe (2005-08-24) http://python.org/sf/1267540 closed by loewis ftplib.py string index out of range (2005-03-23) http://python.org/sf/1168983 closed by vmlinuxz 'clear -1' in pdb (2005-04-29) http://python.org/sf/1192315 closed by birkenfeld 3.29 site is confusing re site-packages on Windows (2005-04-26) http://python.org/sf/1190204 closed by birkenfeld font lock keyword regular expressions (2004-12-09) http://python.org/sf/1082487 closed by rhettinger os.path.expanduser documentation wrt. empty $HOME (2005-05-02) http://python.org/sf/1193849 closed by birkenfeld mmap's resize method resizes the file in win32 but not unix (2003-04-27) http://python.org/sf/728515 closed by birkenfeld tarfile local name is local, should be abspath (2005-08-12) http://python.org/sf/1257255 closed by loewis Wrong "type()" syntax in docs (2005-01-11) http://python.org/sf/1100368 closed by rhettinger Erroneous line number error in Py2.4.1 (2005-04-07) http://python.org/sf/1178484 closed by loewis Python 2.4.1 crashes when importing the attached script (2005-08-04) http://python.org/sf/1251631 closed by loewis distutils.dir_utils not unicode compatible (2005-02-12) http://python.org/sf/1121494 closed by loewis email.MIMEText & email.MIMEMultipart "From" in message body (2005-08-24) http://python.org/sf/1268519 closed by jkutter urllib.urlretrieve silently truncates downloads (2004-08-26) http://python.org/sf/1016880 closed by birkenfeld atexit not called for pythonservice (win32) (2005-08-24) http://python.org/sf/1269051 closed by loewis urllib2 bug in proxy auth (2004-08-26) http://python.org/sf/1016563 closed by birkenfeld urllib2 parse_http_list wrong return (2003-05-09) http://python.org/sf/735248 closed by birkenfeld 8-bit string literal with iso8859 coding => crash (2004-01-01) http://python.org/sf/868864 closed by loewis add SHA256/384/512 to lib (2005-02-16) http://python.org/sf/1123660 closed by birkenfeld IDNA StreamReader broken (2005-03-14) http://python.org/sf/1163178 closed by loewis Large tarfiles cause overflow (2005-06-06) http://python.org/sf/1215928 closed by birkenfeld Crash when importing encoded file (2004-07-14) http://python.org/sf/990743 closed by loewis xrange() builtin accepts keyword arg silently (2005-02-09) http://python.org/sf/1119418 closed by birkenfeld pydoc on cgi.escape lacks info that are in www docs (2005-07-23) http://python.org/sf/1243553 closed by birkenfeld "new" not marked as deprecated in the docs (2005-07-30) http://python.org/sf/1247765 closed by birkenfeld __new__ is class method (2005-08-16) http://python.org/sf/1261229 closed by birkenfeld __new__ is class method (2005-08-16) http://python.org/sf/1261229 closed by birkenfeld minidom.py alternate newl support is broken (2005-08-17) http://python.org/sf/1262320 closed by birkenfeld Cycle containing a Set is not GC'd [leak] (2005-08-25) http://python.org/sf/1273504 closed by rhettinger shelve .sync operation not documented (2005-07-31) http://python.org/sf/1248199 closed by birkenfeld Install Error: "cannot compute sizeof (int), 77" (2005-07-16) http://python.org/sf/1239186 closed by birkenfeld Encoding memory problem. (2005-08-26) http://python.org/sf/1273892 closed by doerwalter Makefile ignores $CPPFLAGS (2005-08-14) http://python.org/sf/1258986 closed by perky error converting locale number to decimal (2005-08-30) http://python.org/sf/1276437 closed by birkenfeld Decoding with unicode_internal segfaults on UCS-4 builds (2005-08-03) http://python.org/sf/1251300 closed by doerwalter dict('') doesn't raise a value error (2005-08-30) http://python.org/sf/1276587 closed by birkenfeld Sentence fragment in urlparse documentation (2005-08-31) http://python.org/sf/1277016 closed by doerwalter imaplib Imap.select() uses comparison to 'None' for boolean (2005-08-31) http://python.org/sf/1277098 closed by pierslauder str.lower() to have an IMPORTANT NOTE or it's for magicians (2005-05-31) http://python.org/sf/1212195 closed by lemburg Seg Fault when compiling small program (2005-04-25) http://python.org/sf/1189248 closed by doerwalter dl module not installed with 2.2.3 (2003-06-30) http://python.org/sf/763007 closed by birkenfeld HTMLParser chokes on my.yahoo.com output (2003-06-26) http://python.org/sf/761452 closed by birkenfeld invalid syntax in os.walk() first example (2005-09-01) http://python.org/sf/1278906 closed by rhettinger RFE Closed __________ "with self:" statement (2004-04-01) http://python.org/sf/927543 closed by birkenfeld Keyword similar to "global" for nested scopes want (2003-11-23) http://python.org/sf/847778 closed by birkenfeld add a get() method to sets (2005-08-29) http://python.org/sf/1275677 closed by rhettinger pythonw.exe should not flash DOS windows (2003-12-04) http://python.org/sf/853698 closed by birkenfeld From raymond.hettinger at verizon.net Thu Sep 1 19:12:34 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Thu, 01 Sep 2005 13:12:34 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Message-ID: <000001c5af18$590e1c20$4320c797@oemcomputer> > Do we really need writef()? It seems to be not much better than its %- > formatting > equivalent. Actually, formatting needs to become a function. The overloading of the arithmetic mod operator has proven to be unfortunate (if only because of precedence issues). Also, the format coding scheme itself needs to be revisited. There is no shortage of people who have taken issue with the trailing s in %(myvar)s. Raymond From wtrenker at gmail.com Thu Sep 1 19:17:52 2005 From: wtrenker at gmail.com (William Trenker) Date: Thu, 1 Sep 2005 13:17:52 -0400 Subject: [Python-Dev] Proof of the pudding: str.partition() In-Reply-To: <431683DC.60103@canterbury.ac.nz> References: <005301c5ada1$4a52afc0$8832c797@oemcomputer> <4314CA3E.3020606@benjiyork.com> <4314E1A2.4060409@ronadam.com> <5.1.1.6.0.20050830233356.01b34118@mail.telecommunity.com> <431683DC.60103@canterbury.ac.nz> Message-ID: On 9/1/05, Greg Ewing wrote: > LD "Gus" Landis wrote: > > .piece() can be both a verb and a noun > > Er, pardon? I don't think I've ever heard 'piece' used > as a verb in English. Can you supply an example sentence? > - assemble: make by putting pieces together; "She pieced a quilt" - repair by adding pieces; "She pieced the china cup" wordnet.princeton.edu/perl/webwn Cheers, Bill From paolo_veronelli at libero.it Thu Sep 1 19:58:40 2005 From: paolo_veronelli at libero.it (Paolino) Date: Thu, 01 Sep 2005 19:58:40 +0200 Subject: [Python-Dev] itertools.chain should take an iterable ? Message-ID: <43174150.5080002@libero.it> Working on a tree library I've found myself writing itertools.chain(*[child.method() for child in self]). Well this happened after I tried instinctively itertools.chain(child.method() for child in self). Is there a reason for this signature ? Regards paolino From jack at performancedrivers.com Thu Sep 1 19:35:19 2005 From: jack at performancedrivers.com (Jack Diederich) Date: Thu, 1 Sep 2005 13:35:19 -0400 Subject: [Python-Dev] itertools.chain should take an iterable ? In-Reply-To: <43174150.5080002@libero.it> References: <43174150.5080002@libero.it> Message-ID: <20050901173518.GE6140@performancedrivers.com> On Thu, Sep 01, 2005 at 07:58:40PM +0200, Paolino wrote: > Working on a tree library I've found myself writing > itertools.chain(*[child.method() for child in self]). > Well this happened after I tried instinctively > itertools.chain(child.method() for child in self). > > Is there a reason for this signature ? This is more suited to comp.lang.python Consider the below examples (and remember that strings are iterable) >>> import itertools as it >>> list(it.chain('ABC', 'XYZ')) ['A', 'B', 'C', 'X', 'Y', 'Z'] >>> list(it.chain(['ABC', 'XYZ'])) ['ABC', 'XYZ'] >>> list(it.chain(['ABC'], ['XYZ'])) ['ABC', 'XYZ'] >>> Hope that helps, -jackdied From eric.nieuwland at xs4all.nl Thu Sep 1 19:37:27 2005 From: eric.nieuwland at xs4all.nl (Eric Nieuwland) Date: Thu, 1 Sep 2005 19:37:27 +0200 Subject: [Python-Dev] partition() (was: Remove str.find in 3.0?) In-Reply-To: <004c01c5ad9f$65567f60$8832c797@oemcomputer> References: <004c01c5ad9f$65567f60$8832c797@oemcomputer> Message-ID: <2cc8bde17bcdf3acd3ab8fc29d4c2095@xs4all.nl> Raymond Hettinger wrote: >> I think it's convenient but also rather odd that split() with a static >> string argument was moved from module string to a method in class str, >> while split() with a regexp has remained in module re. > > I don't see what you find odd. With str and unicode objects being > builtin, you don't need a separate module. In contrast, re is a > stand-alone extension which, of course, requires an import. That's an implementation oriented view. IMHO it is all a match-and-cut operation with fixed strings the simplest form of match expressions. From that point of view the distinction between the two is quite arbitrary. Of course, when turning from principles to daily practice again it is quite clear the distinction is useful. --eric From guido at python.org Thu Sep 1 19:56:58 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 10:56:58 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <000001c5af18$590e1c20$4320c797@oemcomputer> References: <000001c5af18$590e1c20$4320c797@oemcomputer> Message-ID: On 9/1/05, Raymond Hettinger wrote: > > Do we really need writef()? It seems to be not much better than its %- > > formatting > > equivalent. > > Actually, formatting needs to become a function. The overloading of the > arithmetic mod operator has proven to be unfortunate (if only because of > precedence issues). For me, it's not so much the precedence, but the fact that "%s" % x doesn't work as expected if x is a tuple; you'd have to write "%s" % (x,) which is tedious. > Also, the format coding scheme itself needs to be revisited. There is > no shortage of people who have taken issue with the trailing s in > %(myvar)s. Maybe the syntax used in the string.Template class is the way to go? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From raymond.hettinger at verizon.net Thu Sep 1 20:20:40 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Thu, 01 Sep 2005 14:20:40 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Message-ID: <000401c5af21$dc603000$4320c797@oemcomputer> > > Actually, formatting needs to become a function. The overloading of the > > arithmetic mod operator has proven to be unfortunate (if only because of > > precedence issues). > > For me, it's not so much the precedence, but the fact that "%s" % x > doesn't work as expected if x is a tuple; you'd have to write "%s" % > (x,) which is tedious. Right. That too. > > Also, the format coding scheme itself needs to be revisited. There is > > no shortage of people who have taken issue with the trailing s in > > %(myvar)s. > > Maybe the syntax used in the class is the way to go? string.Template is a bit too simplified. But perhaps it can be adapted. We still want some way to express %r, %6.2f, etc. Since string formatting has been around since Tim was in diapers, we should probably start by looking at the solutions used by other languages. With Py3.0, we have a real opportunity to break-away from doing things the way C does it. Raymond From guido at python.org Thu Sep 1 20:32:33 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 11:32:33 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <000401c5af21$dc603000$4320c797@oemcomputer> References: <000401c5af21$dc603000$4320c797@oemcomputer> Message-ID: On 9/1/05, Raymond Hettinger wrote: > string.Template is a bit too simplified. But perhaps it can be adapted. > We still want some way to express %r, %6.2f, etc. Since string > formatting has been around since Tim was in diapers, we should probably > start by looking at the solutions used by other languages. With Py3.0, > we have a real opportunity to break-away from doing things the way C > does it. Hrm. Most other languages these days do floating point formatting the way C does it. I'm happy to look for other ways to invoke the thing, but I think that we shouldn't tinker with %6.2f. (In fact, the major complaint is about the one place where I *did* tinker with it -- %(boo)s.) Maybe the ${boo} form can be extended to allow ${boo%6.2f} ??? Unfortunately that would prevent a different extension of ${boo}: %{boo+far}. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Thu Sep 1 20:33:40 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 11:33:40 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <18a9eedaee49fc776b11f8123a652e1d@xs4all.nl> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <18a9eedaee49fc776b11f8123a652e1d@xs4all.nl> Message-ID: (Please don't send private replies.) On 9/1/05, Eric Nieuwland wrote: > I have a lot of code that uses read()/write() to for binary file access. > Will that break by this change? > If so, I'd like to propose writes() instead of write() as proposed. No, that's the beauty. (Assuming the file is opened in binary mode.) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From janssen at parc.com Thu Sep 1 21:03:23 2005 From: janssen at parc.com (Bill Janssen) Date: Thu, 1 Sep 2005 12:03:23 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Thu, 01 Sep 2005 07:58:08 PDT." Message-ID: <05Sep1.120325pdt."58617"@synergy1.parc.xerox.com> I have to agree with Barry, Paul, Fredrik, Reinhold, etc. Removing the "print" statement would immediately break at a fundamental level 15 years of tutorials, books, and examples, many of which start with >>> print "Hello, World!" Of course, maybe that's the idea -- this is not your father's Python! (Though that slogan apparently didn't work out so well for Oldsmobile...) Is there a syntax trick here? Suppose start-of-the-line function names not followed by an open-paren, but followed by comma-separated lists of expressions, were treated as if the rest of the line were arguments to a function. That is, suppose print "foo", 3, dir(sys) was automagically converted to print ("foo", 3, dir(sys)) Not sure I like this kind of trickyness, though. Though it might be useful for "assert", "raise", maybe "exec", too. I also don't quite see the point of adding new top-level reserved words or built-in functions like "write". It clutters the namespace without being much of an improvement over "sys.stdout.write", IMO. Some kind of printf would be nice to have, but with Python's forgiving syntax is easy enough to add yourself. > Maybe the syntax used in the string.Template class is the way to go? If you'd consider extending the Template syntax to positional parameters ($1, $2, etc.), then perhaps "print" could be modified to use the template for formatting, if it occurs as the first argument: print string.Template("arg $1, arg $2"), arg1, arg2 with an alternate form printf "arg $1, arg $2", arg1, arg2 where the first arg is required to be a template pattern string. This is a nice improvement over C printf in that you can re-use arguments. What happens to Template() when the string module goes away? Do we write "arg $1, arg $2".Template() instead? Bill From janssen at parc.com Thu Sep 1 21:39:51 2005 From: janssen at parc.com (Bill Janssen) Date: Thu, 1 Sep 2005 12:39:51 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Thu, 01 Sep 2005 07:58:08 PDT." Message-ID: <05Sep1.124000pdt."58617"@synergy1.parc.xerox.com> > And good riddance! The print statement harks back to ABC and even > (unvisual) Basic. Out with it! Guido, After reviewing the PEP 3000 notes, I can find no justification there for removing "print" other than your statement here -- that it has served honorably and well in many programming languages for many years, a curious reason for abandoning it. There's a pointer to Python Regrets, but that document contains no justification for the change. (Actually, using pointers to Powerpoint slides to explain/justify anything is, er... -- what's a polite euphemism for "a sign of a weak mind"? :-) I agree that "print" is already a bit peculiar, but so what? If we wanted Scheme, we'd be programming in Scheme, not Python. The only other parts of PEP 3000 I take issue with are the removal of "reduce" (a little) and "lambda" (a bit more seriously). I use reduce a lot, but it's easy enough to cobble together oneself, given the changes in Python over the last 10 years. Bill From shane at hathawaymix.org Thu Sep 1 22:02:39 2005 From: shane at hathawaymix.org (Shane Hathaway) Date: Thu, 01 Sep 2005 14:02:39 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <000401c5af21$dc603000$4320c797@oemcomputer> Message-ID: <43175E5F.6070808@hathawaymix.org> Guido van Rossum wrote: > On 9/1/05, Raymond Hettinger wrote: > >>string.Template is a bit too simplified. But perhaps it can be adapted. >>We still want some way to express %r, %6.2f, etc. Since string >>formatting has been around since Tim was in diapers, we should probably >>start by looking at the solutions used by other languages. With Py3.0, >>we have a real opportunity to break-away from doing things the way C >>does it. > > > Hrm. Most other languages these days do floating point formatting the > way C does it. I'm happy to look for other ways to invoke the thing, > but I think that we shouldn't tinker with %6.2f. (In fact, the major > complaint is about the one place where I *did* tinker with it -- > %(boo)s.) > > Maybe the ${boo} form can be extended to allow ${boo%6.2f} ??? > > Unfortunately that would prevent a different extension of ${boo}: %{boo+far}. May I also suggest the following shortcut for creating and evaluating a string template. (Ever since I thought of this, I've actually used this in code without thinking... it's just too natural): message = $"Hello, $name!" Shane From guido at python.org Thu Sep 1 22:07:39 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 13:07:39 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <-2729100304010349131@unknownmsgid> References: <-2729100304010349131@unknownmsgid> Message-ID: On 9/1/05, Bill Janssen wrote: > After reviewing the PEP 3000 notes, I can find no justification there > for removing "print" other than your statement here -- that it has > served honorably and well in many programming languages for many > years, a curious reason for abandoning it. Some reasons to drop it have to do with the arcane syntax: (a) the trailing comma (is there anyone who likes that?), and (b) the optional ">>file" part. While I've always defended the latter against powerful opposition, that was only because print was already a statement and I find it important to have a way to do whatever print does to an arbitrary file. Of course, if print() were a function, we wouldn't need special syntax, we could just use stream.print() with the same signature; so that's one argument for dropping the syntax. Another real problem with print is that, while the automatic insertion of spaces is nice for beginners, it often gets in the way, and what you have to do to avoid this is pretty nasty: either drop print altogether in favor of sys.stdout.write(), or use string concatenation or a format string, assuming you have all the pieces available at the same time (which often you don't). Surely you don't want to suggest an extension, for example doubling the comma could make the extra space go away... :-) It looks to me like most arguments for keeping print are motivated by backwards compatibility (in its many guises, like the existence of 15 years of tutorials) and not by what would be best if we were to design a language from scratch. It seems to me that, as long as write() and writeln() were built-ins taking multiple args, teaching a beginner to use >>> writeln("The answer is: ", 4+4) is perfectly clear (and might be a gentle introduction to function calls as well). I've been thinking about some ancient Python history recently, which reminded me that one theme in Python's design is to have a minimalist syntax without being syntax-free like Lisp. (In a very early version of Python, 'dir' was a statement, just so that I wouldn't have to type the parentheses. Glad I dropped that one!) I really believe that dropping print in favor of a few built-in functions is an improvement -- backwards compatibility be damned! -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Thu Sep 1 22:15:03 2005 From: guido at python.org (Guido van Rossum) Date: Thu, 1 Sep 2005 13:15:03 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <3d0cebfb0509010933f4eeb47@mail.gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <3d0cebfb0509010933f4eeb47@mail.gmail.com> Message-ID: On 9/1/05, Fredrik Johansson wrote: > Why not instead change the language so as to allow any function call > to be written without parentheses (when this is unambiguous)? This > could make Python more convenient for creating imperative-style DSLs > (though I'm not sure this is a goal). Given all the other syntax it would be too ambiguous. If you really want this, please sit down and design a grammar. If you can't do that, just believe me that it would be too nasty (with too many exceptional cases) to bother. > In any case, I think "write" would be better than "print", because it > is easier to type (at least for me; reaching for 'w' and than 'r' goes > much faster than reaching for 'p'). I don't like "writeln" though, as > in 9 of 10 cases I want the line break to be there. I'd rather have > write add the line break, and "writeraw" or somesuch exclude it. Yuck. Also, write() and writeln() have a long history going back to Pascal. Java has print() and println(). Plus stream.write("abc") already has a meaning, and the elegance of my proposal is that that meaning remains unchanged. > By the way, if print has to go, then what about the assert, raise, and > import statements? Should these be changed to use function call syntax > as well? (By the way, assert and raise could be methods: > ZeroDivisionError.assert(denom != 0). Surprising that Java doesn't do > this ;-) It can't work for import because it defines a new name; if import were a function, then import(foo) would necessarily mean to evaluate foo first, which isn't what you want. It could work for raise (and even for break and continue) but I'd rather keep control flow as statements; you never know what the compiler could do with the information that a particular block doesn't contain a raise statement. It can't work for assert because you don't want the argument to be evaluated in -O mode. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From steven.bethard at gmail.com Thu Sep 1 22:23:28 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Thu, 1 Sep 2005 14:23:28 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b0509010903251033d5@mail.gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <1125588305.22624.32.camel@geddy.wooz.org> <79990c6b0509010903251033d5@mail.gmail.com> Message-ID: [Guido van Rossum] > And good riddance! The print statement harks back to ABC and even > (unvisual) Basic. Out with it! [Barry Warsaw] > I have to strongly disagree. The print statement is simple, easy to > understand, and easy to use. [Paul Moore] > I agree with Barry. In particular, the behaviour of adding spaces > between items is something I find very useful, and it's missing from > the functional forms. While I agree that mostly the print statement is "simple, easy to understand, and easy to use", I've seen the trailing-comma version cause confusion for a lot of newbies. I wouldn't mind at all if the trailing-comma version disappeared in Python 3.0 -- if you need this kind of complicated output, you can always use sys.stdout.write and/or string formatting. The spaces-between-items point that Paul Moore makes is IMHO the best argument against the proposed write*() functions. I think we *do* need a statement or function of some sort that does the most basic task: writing a line to sys.stdout that calls str() on each of the elements and joins them with spaces. That is, I think we need to keep *something* with functionality like: def XXX(*args): sys.stdout.write('%s\n' % ' '.join(str(a) for a in args)) Note that this would keep the Hello World example simple: XXX(greeting, name) STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From janssen at parc.com Thu Sep 1 22:30:14 2005 From: janssen at parc.com (Bill Janssen) Date: Thu, 1 Sep 2005 13:30:14 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Thu, 01 Sep 2005 13:07:39 PDT." Message-ID: <05Sep1.133021pdt."58617"@synergy1.parc.xerox.com> I don't use "print" myself much, but for the occasional 3-line script. But I think the user-friendliness of it is a good point, and makes up for the weirdness of it all. There's something nice about being able to write print "the answer is", 3*4+10 which is one of the reasons ABC and BASIC have it that way. > Another real problem with print is that, while the automatic insertion > of spaces is nice for beginners, it often gets in the way I agree; why not just drop that feature for Python 3.0? > It looks to me like most arguments for keeping print are motivated by > backwards compatibility (in its many guises, like the existence of 15 > years of tutorials) and not by what would be best if we were to design > a language from scratch. Well, heck, if we were designing a language from scratch, would we start with Python? I rather liked SchemeXerox. This is Python 3.0, after all, not BizarroLang 1.0. IMO the novice usability of it, combined with the existence of other alteratives for experienced programmers, combined with a tip of the hat to Python's noble history (what you refer to as "backwards compatibility"), keeps it in. Bill From python at discworld.dyndns.org Thu Sep 1 22:42:52 2005 From: python at discworld.dyndns.org (Charles Cazabon) Date: Thu, 1 Sep 2005 14:42:52 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <05Sep1.124000pdt."58617"@synergy1.parc.xerox.com> References: <05Sep1.124000pdt."58617"@synergy1.parc.xerox.com> Message-ID: <20050901204252.GB12384@discworld.dyndns.org> Bill Janssen wrote: > > And good riddance! The print statement harks back to ABC and even > > (unvisual) Basic. Out with it! I'm with Guido on this, BTW. > After reviewing the PEP 3000 notes, I can find no justification there > for removing "print" Well, how about the fact that basically all of Python's statements are for implementing logic (if, while, etc), controlling flow (return, yield, try, etc), and defining structure (def, class, etc). `print` stands pretty much alone as a statement which does none of these things -- in fact, it does nothing for the program but merely has the interesting side-effect of writing to stdout. It's an anomaly. It stands out in the language as a sore thumb waiting for Guido's hammer. Charles -- ----------------------------------------------------------------------- Charles Cazabon GPL'ed software available at: http://pyropus.ca/software/ ----------------------------------------------------------------------- From janssen at parc.com Thu Sep 1 22:43:06 2005 From: janssen at parc.com (Bill Janssen) Date: Thu, 1 Sep 2005 13:43:06 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Thu, 01 Sep 2005 12:03:23 PDT." <05Sep1.120325pdt."58617"@synergy1.parc.xerox.com> Message-ID: <05Sep1.134316pdt."58617"@synergy1.parc.xerox.com> I see this is Fredrik's earlier suggestion. Bill I (reduntantly) wrote: > Is there a syntax trick here? Suppose start-of-the-line function > names not followed by an open-paren, but followed by comma-separated > lists of expressions, were treated as if the rest of the line were > arguments to a function. That is, suppose > > print "foo", 3, dir(sys) > > was automagically converted to > > print ("foo", 3, dir(sys)) From python at discworld.dyndns.org Thu Sep 1 22:46:13 2005 From: python at discworld.dyndns.org (Charles Cazabon) Date: Thu, 1 Sep 2005 14:46:13 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <05Sep1.133021pdt."58617"@synergy1.parc.xerox.com> References: <05Sep1.133021pdt."58617"@synergy1.parc.xerox.com> Message-ID: <20050901204613.GC12384@discworld.dyndns.org> Bill Janssen wrote: > I don't use "print" myself much, but for the occasional 3-line script. > But I think the user-friendliness of it is a good point, and makes up > for the weirdness of it all. There's something nice about being able > to write > > print "the answer is", 3*4+10 > > which is one of the reasons ABC and BASIC have it that way. Providing you can live with adding a pair of parentheses to that, you can have: def print(*args): sys.stdout.write(' '.join(args) + '\n') I think the language would be cleaner if it lacked this weird exception for `print`. Charles -- ----------------------------------------------------------------------- Charles Cazabon GPL'ed software available at: http://pyropus.ca/software/ ----------------------------------------------------------------------- From rrr at ronadam.com Thu Sep 1 23:00:51 2005 From: rrr at ronadam.com (Ron Adam) Date: Thu, 01 Sep 2005 17:00:51 -0400 Subject: [Python-Dev] Python 3 design principles In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: <43176C03.7090904@ronadam.com> Reinhold Birkenfeld wrote: > Greg Ewing wrote: > >>Charles Cazabon wrote: >> >> >>>Perhaps py3k could have a py2compat module. Importing it could have the >>>effect of (for instance) putting compile, id, and intern into the global >>>namespace, making print an alias for writeln, >> >>There's no way importing a module could add something that >>works like the old print statement, unless some serious >>magic is going on... > > > You'd have to enclose print arguments in parentheses. Of course, the "trailing > comma" form would be lost. > > Reinhold The trailing comma is convenient, but I don't think it's that big of a deal to have two methods. ui.write() ui.writeln() # or ui.print() I'm +1 on making it a method of a "user interface object". Not just a function. I want to be able to import an interface, then communicate to it in a consistent way even though it may look quite different on the screen. Having a set of standard io methods moves in that direction I think. import console ui = console() ui.write("Hello World\n") howami = ui.input("How are you today? %s") import popup ui = popup('YesNo') # Create a 'YesNo' popup. ok = ui.input('Ok to proceed?') # Open it and wait for it. ok2 = ui.input('Are you sure?') # Reopen it and reuse it. if ok == ok2 == 'Yes': ... Some possible common methods... ui.write(data) # non blocking print/output, doesn't wait ui.send() # non echo write; passwords, config, etc.. ui.input(prompt) # output something and wait for return value ui.get() # non echo wait for value, or io.next() ui.read() # non blocking get As for functions without '()'s. (Just a thought) You could use '<<' or '<<<' (or other symbol) as a way to move data between objects. ui.write <<< 'Hello World/n' # ui.write('Hello World/n') ui.writeln <<< counter # ui.writeln(counter.next()) ok = ui.input <<< 'press a key:' # ok = ui.input('press a key:') The requirement could be that the item on the left is a callable, and the item on the right is a sequence or generator. Cheers, Ron From bjourne at gmail.com Thu Sep 1 23:11:16 2005 From: bjourne at gmail.com (=?ISO-8859-1?Q?BJ=F6rn_Lindqvist?=) Date: Thu, 1 Sep 2005 23:11:16 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050901204613.GC12384@discworld.dyndns.org> References: <20050901204613.GC12384@discworld.dyndns.org> Message-ID: <740c3aec05090114115a9bc56c@mail.gmail.com> Something I've noticed from teaching C++ to newbies, is that you should NOT (never ever) start with "cout << "Hello, world!" << endl;". You should start with "printf("Hello, world\n");" The cout thingy is impossible to explain to a newbie because it uses much underlying "magic" and has a very different behaviour from everything else a newbie sees in C++. It therefore causes lots of confusion. I wonder if the magic of "print" might have the same effect on newcomers to Python, whos first experience is "print 'Hello, world!'"... It would be very interesting to know. -- mvh Bj?rn From jack at performancedrivers.com Thu Sep 1 23:11:21 2005 From: jack at performancedrivers.com (Jack Diederich) Date: Thu, 1 Sep 2005 17:11:21 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050901204613.GC12384@discworld.dyndns.org> References: <05Sep1.133021pdt."58617"@synergy1.parc.xerox.com> <20050901204613.GC12384@discworld.dyndns.org> Message-ID: <20050901211121.GF6140@performancedrivers.com> On Thu, Sep 01, 2005 at 02:46:13PM -0600, Charles Cazabon wrote: > Bill Janssen wrote: > > I don't use "print" myself much, but for the occasional 3-line script. > > But I think the user-friendliness of it is a good point, and makes up > > for the weirdness of it all. There's something nice about being able > > to write > > > > print "the answer is", 3*4+10 > > > > which is one of the reasons ABC and BASIC have it that way. I don't use print much. For online applications I call a socket write or for web apps store up all the HTML in a buffer and only write it out at the end (to allow code anywhere to raise a Redirect exception). I don't use print for quick and dirty debugging, but this def dump(*args): sys.stderr.write('%s\n' % (repr(args))) > Providing you can live with adding a pair of parentheses to that, you can > have: > > def print(*args): > sys.stdout.write(' '.join(args) + '\n') > > I think the language would be cleaner if it lacked this weird exception for > `print`. Me too, for real usage. Tutorials would get messier but how quickly do people move on from those anyway? -jackdied From janssen at parc.com Thu Sep 1 23:13:41 2005 From: janssen at parc.com (Bill Janssen) Date: Thu, 1 Sep 2005 14:13:41 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Thu, 01 Sep 2005 13:46:13 PDT." <20050901204613.GC12384@discworld.dyndns.org> Message-ID: <05Sep1.141341pdt."58617"@synergy1.parc.xerox.com> > Providing you can live with adding a pair of parentheses to that, you can > have: > > def print(*args): > sys.stdout.write(' '.join(args) + '\n') > > I think the language would be cleaner if it lacked this weird exception for > `print`. Charles, I agree that it would be cleaner. I just don't think cleanliness is all that interesting -- usefulness trumps it every time. And if cleanliness was the answer, there would be larger changes to make -- like removing all the syntax variations by standardizing on a common syntax like Lisp's. Bill From fredrik at pythonware.com Thu Sep 1 23:12:57 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Thu, 1 Sep 2005 23:12:57 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <05Sep1.124000pdt."58617"@synergy1.parc.xerox.com> <20050901204252.GB12384@discworld.dyndns.org> Message-ID: Charles Cazabon wrote: > in fact, it does nothing for the program but merely has the interesting > side-effect of writing to stdout. yeah, real programmers don't generate output. From reinhold-birkenfeld-nospam at wolke7.net Thu Sep 1 23:15:04 2005 From: reinhold-birkenfeld-nospam at wolke7.net (Reinhold Birkenfeld) Date: Thu, 01 Sep 2005 23:15:04 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <000001c5af18$590e1c20$4320c797@oemcomputer> References: <000001c5af18$590e1c20$4320c797@oemcomputer> Message-ID: Raymond Hettinger wrote: >> Do we really need writef()? It seems to be not much better than its %- >> formatting >> equivalent. > > Actually, formatting needs to become a function. The overloading of the > arithmetic mod operator has proven to be unfortunate (if only because of > precedence issues). But then, a format() function would be necessary (equivalent to sprintf) > Also, the format coding scheme itself needs to be revisited. There is > no shortage of people who have taken issue with the trailing s in > %(myvar)s. That's a nuisance, right. Reinhold -- Mail address is perfectly valid! From jack at performancedrivers.com Thu Sep 1 23:27:39 2005 From: jack at performancedrivers.com (Jack Diederich) Date: Thu, 1 Sep 2005 17:27:39 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050901204252.GB12384@discworld.dyndns.org> Message-ID: <20050901212739.GG6140@performancedrivers.com> On Thu, Sep 01, 2005 at 11:12:57PM +0200, Fredrik Lundh wrote: > Charles Cazabon wrote: > > > in fact, it does nothing for the program but merely has the interesting > > side-effect of writing to stdout. > > yeah, real programmers don't generate output. > I'd say: yeah, real programmers don't generate output _to stdout_ sockets, GUI widgets, buffers? sure. stdout? Almost never. Most of these don't have write() methods so I've never had a reason to use the "print >>" syntax. -jackdied From steven.bethard at gmail.com Thu Sep 1 23:29:32 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Thu, 1 Sep 2005 15:29:32 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <000001c5af18$590e1c20$4320c797@oemcomputer> Message-ID: Reinhold Birkenfeld wrote: > Raymond Hettinger wrote: > > Actually, formatting needs to become a function. The overloading of the > > arithmetic mod operator has proven to be unfortunate (if only because of > > precedence issues). > > But then, a format() function would be necessary (equivalent to sprintf) Does it have to be a function? I'd expect it to be a method, like string.Template. E.g >>> '%s: %i'.substitute('badger', 42) badger: 42 >>> '%(name)s: %(count)i'.substitute(name='badger', count=42) badger: 42 BTW, I'm quite happy with the current string formatting format. I certainly haven't "taken issue with the trailing s in %(myvar)s". If it wasn't there, when it is for %(count)i and %(ratio)f, I'd probably wonder why. STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From python at discworld.dyndns.org Thu Sep 1 23:46:20 2005 From: python at discworld.dyndns.org (Charles Cazabon) Date: Thu, 1 Sep 2005 15:46:20 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050901204252.GB12384@discworld.dyndns.org> Message-ID: <20050901214620.GA12839@discworld.dyndns.org> Fredrik Lundh wrote: > Charles Cazabon wrote: > > > in fact, it does nothing for the program but merely has the interesting > > side-effect of writing to stdout. > > yeah, real programmers don't generate output. That wasn't quite my point - I meant that the rest of Python's statements (to a one) all have a quite fundamental impact on what the code in question means. `print` doesn't. I write data filters in Python all the time -- but I virtually never use `print`. stdout.write() is more consistent /and/ parallel to stdin.read(). `print` should go away, at least as a statement. Charles -- ----------------------------------------------------------------------- Charles Cazabon GPL'ed software available at: http://pyropus.ca/software/ ----------------------------------------------------------------------- From bob at redivi.com Thu Sep 1 23:49:56 2005 From: bob at redivi.com (Bob Ippolito) Date: Thu, 1 Sep 2005 14:49:56 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050901212739.GG6140@performancedrivers.com> References: <20050901204252.GB12384@discworld.dyndns.org> <20050901212739.GG6140@performancedrivers.com> Message-ID: <7C58F4DB-E15B-497A-A4F5-563B5CF566FF@redivi.com> On Sep 1, 2005, at 2:27 PM, Jack Diederich wrote: > On Thu, Sep 01, 2005 at 11:12:57PM +0200, Fredrik Lundh wrote: > >> Charles Cazabon wrote: >> >> >>> in fact, it does nothing for the program but merely has the >>> interesting >>> side-effect of writing to stdout. >>> >> >> yeah, real programmers don't generate output. >> >> > I'd say: > yeah, real programmers don't generate output _to stdout_ > > sockets, GUI widgets, buffers? sure. stdout? Almost never. > Most of these don't have write() methods so I've never had a reason to > use the "print >>" syntax. That is absolutely true, print is becoming less and less useful in the context of GUI or web applications. Even in Just Debugging scenarios, you're probably better off using something with more flexibility, such as the logging module. Additionally, the fact that sys.stdout is for bytes and not a text (unicode) makes it even more complicated. You can, of course, replace sys.stdout with an encoding-aware wrapper via codecs.getwriter (), but that's often inconvenient. -bob From listsub at wickedgrey.com Thu Sep 1 23:56:40 2005 From: listsub at wickedgrey.com (Eli Stevens (WG.c)) Date: Thu, 01 Sep 2005 14:56:40 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <-2729100304010349131@unknownmsgid> Message-ID: <43177918.10504@wickedgrey.com> Guido van Rossum wrote: > It seems to me that, as long as write() and writeln() were built-ins > taking multiple args, teaching a beginner to use > > >>>>writeln("The answer is: ", 4+4) > > > is perfectly clear (and might be a gentle introduction to function > calls as well). Hello, I'm Eli Stevens, and this is my first real post to python-dev (bio below). I've got two ideas relating to formatting that I'd thought I'd air out. The first is to add a separator argument to write[ln]: >>> def w(*args, **kwargs): ... nextsep = "" ... sep = kwargs.get("sep","") ... output = "" ... for item in args: ... output += nextsep ... nextsep = sep ... output += str(item) ... print output ... >>> w("foo", "bar") foobar >>> w("foo", "bar", sep=" ") foo bar >>> w("foo", "bar", sep="\n") foo bar >>> w("foo", "bar", sep="\t") foo bar >>> I'd have found this handy just this week at work. Not a huge deal to work around, obviously, but it would have been handy. The second is something along the lines of: >>> f = 3.14159 >>> str(f) '3.14159' >>> str(f, ".2") # calls f.__str__(".2") which can do whatever it wants '3.14' >>> str(f, "%.2") # the percent is ignored? '3.14' Thoughts? Eli Stevens Bio: Geek since I saw my first computer in 5th grade at school. Programmed (poorly) from middle school through high school. Graduated from Univ. of Missouri, Columbia with a bachelors in CS. C++ fan at the time. Worked a startup in the valley for 3 years, heavily in Java. Mildly disliked it (from the start), found python, loved it, got acquired by Cisco, but still doing Java. I cashed out. Now at Yahoo doing Python almost full time. Happy. No accepted patches to an open source project (yet). Prefer the MIT license for my code (assuming any of it gets to a point where I can release it :). Whew. ;) From jimjjewett at gmail.com Fri Sep 2 01:33:36 2005 From: jimjjewett at gmail.com (Jim Jewett) Date: Thu, 1 Sep 2005 19:33:36 -0400 Subject: [Python-Dev] String views Message-ID: Tim Delaney writes: > One of the big disadvantages of string views is that they need to keep > the original object around, no matter how big it is. But in the case of > partition, much of the time the original string survives for at least a > similar period to the partitions. Michael Chermside writes: > Didn't several of Raymond's examples use the idiom: > part_1, _, s = s.partition(first_sep) > part_2, _, s = s.partition(second_sep) > part_3, _, s = s.partition(third_sep) Yes, but in those cases, generally the entire original string was being kept by at least some part_#, so there really wasn't any wasted space. The problem only really shows up when a single 5-byte string keeps a 10K buffer alive. If it supports 2000 such strings, then everything is fine. Skip writes: > I'm skeptical about performance as well, but not for that reason. A string > object can have a referent field. If not NULL, it refers to another string > object which is INCREFed in the usual way. At string deallocation, if the > referent is not NULL, the referent is DECREFed. If the referent is NULL, > ob_sval is freed. Michael Chermside writes: > Won't work. A string may have multiple referrents, so a single referent > field isn't sufficient. I think you're looking at it backwards. A string would use a reference to a (series of characters) instead of ob_sval, just as dictionaries point to a table instead of small_table. The catch (as Tim mentioned) is that the underlying series of characters might be much larger than *this* string needs. If it isn't shared, then the extra is wasted. One way to deal with this might be have the strings clean up when they're called. If the string's length multiplied by the number of references to the buffer is much less than the size of the buffer, then the string should make its own small copy. Whether the complication is worth it, I don't know. -jJ From jimjjewett at gmail.com Fri Sep 2 01:44:02 2005 From: jimjjewett at gmail.com (Jim Jewett) Date: Thu, 1 Sep 2005 19:44:02 -0400 Subject: [Python-Dev] Python 3 design principles Message-ID: Nick Craig-Wood wrote: > If come python 3, there is a 99% accurate program which can turn your > python 2.x into python 3 code, then that would ease the transition > greatly. Guido wrote: > That might not be so easy given the desire to change most > list-returning functions and methods into iterator-returning ones. I assume part of the cleanup will include adding a choke point for import hooks. That way people could do the conversion on modules that they aren't sure about. There would be a performance penalty, but things would still work, and could be sped up as it was justified. > This means that *most* places where you use keys() your code will > still run, but *some* places you'll have to write list(d.keys()). How > is the translator going to know? So do it everywhere, in the auto-import. > Worse, there's a common idiom: > L = D.keys() > L.sort() > that should be replaced by > L = sorted(D) L = list(D.keys()) L = sorted(L) Not as efficient. Not as pretty. With work, even a mechanical importer could do better. But the old code would still run correctly. -jJ From jimjjewett at gmail.com Fri Sep 2 02:12:11 2005 From: jimjjewett at gmail.com (Jim Jewett) Date: Thu, 1 Sep 2005 20:12:11 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 Message-ID: Guido van Rossum suggested: > stream.write(a1, a2, ...) > stream.writeln(a1, a2, ...) > stream.writef(fmt, a1, a2, ...) > builtin functions write(), writeln(), writef() that call the > corresponding method on sys.stdout. These seem good, except that write typically matches read, and I'm not sure how well the equivalents would work. (They can be defined; they just feel a little too fragile, like C's input.) > Another real problem with print is that, while the > automatic insertion of spaces is nice for beginners, > it often gets in the way, and what you have to do to > avoid this is pretty nasty: either drop print altogether > in favor of sys.stdout.write(), or use string concatenation > or a format string, assuming you have all the pieces > available at the same time (which often you don't). I usually take "I need to get rid of spaces" as an indication that I care about exact (not just readable, but exact) formatting, and *should* use either write or a format string (possibly waiting to collect the data). Putting the spaces back in (without a format string) would be even worse. Charles Cazabon's pointed out that it *could* be as simple as writeln(' '.join( ... )) but if there isn't a builtin alias, people (at least those not intimidated by the magic required to get simple output) *will* do things at least as bad as writeln(a, " ", b, " ", c) or as bugprone as # oops, format string and debug vars got out of sync writef(" Current Vals:%s %d %d%s", curval, i, k, name, age) -jJ From rrr at ronadam.com Fri Sep 2 02:30:31 2005 From: rrr at ronadam.com (Ron Adam) Date: Thu, 01 Sep 2005 20:30:31 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <43179D27.5080805@ronadam.com> Jim Jewett wrote: > >>Another real problem with print is that, while the >>automatic insertion of spaces is nice for beginners, >>it often gets in the way, and what you have to do to >>avoid this is pretty nasty: either drop print altogether >>in favor of sys.stdout.write(), or use string concatenation >>or a format string, assuming you have all the pieces >>available at the same time (which often you don't). > > I usually take "I need to get rid of spaces" as an indication > that I care about exact (not just readable, but exact) > formatting, and *should* use either write or a format string > (possibly waiting to collect the data). > > Putting the spaces back in (without a format string) would > be even worse. Charles Cazabon's pointed out that it *could* > be as simple as > > writeln(' '.join( ... )) Why not just offer an addition method ? examine(x,y,z) # print with spaces Or some other suitable name. Cheers, Ron From skip at pobox.com Fri Sep 2 04:57:29 2005 From: skip at pobox.com (skip@pobox.com) Date: Thu, 1 Sep 2005 21:57:29 -0500 Subject: [Python-Dev] String views (was: Re: Proof of the pudding:str.partition()) In-Reply-To: References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> Message-ID: <17175.49049.572339.647470@montanaro.dyndns.org> Fredrik> Python strings are character buffers with a known length, not Fredrik> null-terminated C strings. the CPython implementation Fredrik> guarantees that the character buffer has a trailing NULL Fredrik> character, but that's mostly to make it easy to pass Python Fredrik> strings directly to traditional C API:s. I'm obviously missing something that's been there all along. Since Python strings can contain NULs, why do we bother to NUL-terminate them? Clearly, any tradition C API that expects to operate on NUL-terminated strings would break with a string containing an embedded NUL. Skip From greg.ewing at canterbury.ac.nz Fri Sep 2 05:00:09 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 02 Sep 2005 15:00:09 +1200 Subject: [Python-Dev] Python 3 design principles In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: <4317C039.2090406@canterbury.ac.nz> Reinhold Birkenfeld wrote: > Greg Ewing wrote: > >>There's no way importing a module could add something that >>works like the old print statement, unless some serious >>magic is going on... > > You'd have to enclose print arguments in parentheses. Of course, the "trailing > comma" form would be lost. But you'd still have to rewrite old code to work with it, in which case you might as well change it to whatever the new way is in 3.0. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From skip at pobox.com Fri Sep 2 05:00:51 2005 From: skip at pobox.com (skip@pobox.com) Date: Thu, 1 Sep 2005 22:00:51 -0500 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <1125580308.10343.33.camel@geddy.wooz.org> References: <7168d65a050831132415118382@mail.gmail.com> <4316CFCF.4030605@gmail.com> <1125580308.10343.33.camel@geddy.wooz.org> Message-ID: <17175.49251.919806.636862@montanaro.dyndns.org> >> I still hope to see this change to "make print a builtin instead of a >> statement". I'd hate to lose the one-line hello world example due to >> cruft like "from sys import stdout". Barry> I agree. You can't get much simpler to explain or use than the Barry> current print statement. Then why remove it at all? Skip From fdrake at acm.org Fri Sep 2 05:09:38 2005 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Thu, 1 Sep 2005 23:09:38 -0400 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <17175.49251.919806.636862@montanaro.dyndns.org> References: <7168d65a050831132415118382@mail.gmail.com> <1125580308.10343.33.camel@geddy.wooz.org> <17175.49251.919806.636862@montanaro.dyndns.org> Message-ID: <200509012309.38845.fdrake@acm.org> On Thursday 01 September 2005 23:00, skip at pobox.com wrote: > Then why remove it at all? Bingo. I don't see any need to remove it. I could live with removing the trailing-comma semi-wart, but there just isn't any need to remove it. -Fred -- Fred L. Drake, Jr. From skip at pobox.com Fri Sep 2 05:14:52 2005 From: skip at pobox.com (skip@pobox.com) Date: Thu, 1 Sep 2005 22:14:52 -0500 Subject: [Python-Dev] String views In-Reply-To: <20050901072058.f64zse0ondkcs08o@login.werra.lunarpages.com> References: <20050901072058.f64zse0ondkcs08o@login.werra.lunarpages.com> Message-ID: <17175.50092.92305.537909@montanaro.dyndns.org> >> I'm skeptical about performance as well, but not for that reason. A >> string object can have a referent field. If not NULL, it refers to >> another string object which is INCREFed in the usual way. At string >> deallocation, if the referent is not NULL, the referent is DECREFed. >> If the referent is NULL, ob_sval is freed. Michael> Won't work. A string may have multiple referrents, so a single Michael> referent field isn't sufficient. Hmmm... I implemented it last night (though it has yet to be tested). I suspect it will work. Here's my PyStringObject struct: typedef struct { PyObject_VAR_HEAD long ob_shash; int ob_sstate; PyObject *ob_referent; char *ob_sval; } PyStringObject; (minus the invariants which I have yet to check). Suppose url is a string object whose value is "http://www.python.org/", and that it has a reference count of 1 and isn't a view onto another string. Its ob_referent field would be NULL. (Maybe it would be better named "ob_target".) If we then execute before, sep, after = url.partition(":") upon return before, sep and after would be string objects whose ob_referent field refers to url and url's reference count would be 4. Their ob_sval fields would point to the start of their piece of url. When the reference counts of before, sep and after reach zero, they are reclaimed. Since they each have a non-NULL ob_referent field, the target object is DECREFed, but the ob_sval field is not freed. In the case of url, when its reference count reaches zero, since its ob_referent field is NULL, its ob_sval field is freed. The only tricky business was PyString_AsString. If the argument object is a view you have to "un-view" it by copying the interesting bits and DECREFing the ob_referent. This is because of the NUL termination guarantee. I wonder if the use of views would offset the overhead of returning to a double-malloc allocation. Skip From skip at pobox.com Fri Sep 2 05:20:04 2005 From: skip at pobox.com (skip@pobox.com) Date: Thu, 1 Sep 2005 22:20:04 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125588305.22624.32.camel@geddy.wooz.org> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <1125588305.22624.32.camel@geddy.wooz.org> Message-ID: <17175.50404.856427.685737@montanaro.dyndns.org> >> And good riddance! The print statement harks back to ABC and even >> (unvisual) Basic. Out with it! Barry> I have to strongly disagree. The print statement is simple, easy Barry> to understand, and easy to use. I'm with Barry. Even for non-debug use the print statement is suitable for the majority of my output. Skip From greg.ewing at canterbury.ac.nz Fri Sep 2 05:49:55 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 02 Sep 2005 15:49:55 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <43175E5F.6070808@hathawaymix.org> References: <000401c5af21$dc603000$4320c797@oemcomputer> <43175E5F.6070808@hathawaymix.org> Message-ID: <4317CBE3.60805@canterbury.ac.nz> Shane Hathaway wrote: > May I also suggest the following shortcut for creating and evaluating a > string template. (Ever since I thought of this, I've actually used this > in code without thinking... it's just too natural): > > message = $"Hello, $name!" As I recall, this has been considered before, and rejected on the grounds that it's too visually confusing having $ signs both inside and outside the quotes. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From jcarlson at uci.edu Fri Sep 2 05:55:05 2005 From: jcarlson at uci.edu (Josiah Carlson) Date: Thu, 01 Sep 2005 20:55:05 -0700 Subject: [Python-Dev] String views In-Reply-To: <17175.50092.92305.537909@montanaro.dyndns.org> References: <20050901072058.f64zse0ondkcs08o@login.werra.lunarpages.com> <17175.50092.92305.537909@montanaro.dyndns.org> Message-ID: <20050901204905.8B29.JCARLSON@uci.edu> skip at pobox.com wrote: > >> I'm skeptical about performance as well, but not for that reason. A > >> string object can have a referent field. If not NULL, it refers to > >> another string object which is INCREFed in the usual way. At string > >> deallocation, if the referent is not NULL, the referent is DECREFed. > >> If the referent is NULL, ob_sval is freed. > > Michael> Won't work. A string may have multiple referrents, so a single > Michael> referent field isn't sufficient. > > Hmmm... I implemented it last night (though it has yet to be tested). I > suspect it will work. Here's my PyStringObject struct: *cough* buffers with string methods *cough* Seriously. I know people don't seem to like them much, but a buffer is a string view, an array view, an mmap view, ... It does /exactly/ what you suggest string views should do, and it's already in Python. With minor wrappers, one could use string methods almost directly, or with modification of string methods, buffers and strings could share methods. - Josiah From greg.ewing at canterbury.ac.nz Fri Sep 2 06:05:10 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 02 Sep 2005 16:05:10 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <1125588305.22624.32.camel@geddy.wooz.org> <79990c6b0509010903251033d5@mail.gmail.com> Message-ID: <4317CF76.3030308@canterbury.ac.nz> Steven Bethard wrote: > I think we *do* > need a statement or function of some sort that does the most basic > task: writing a line to sys.stdout that calls str() on each of the > elements and joins them with spaces. Hypertalk (the programming language of Apple's Hypercard) had an interesting way of doing this. There were two string concatenation operators: a regular one, and a "concatenate with a space between" operator. Using these, you could build up strings for output quite nicely. It helped somewhat that Hypertalk really only had strings as a data type. A Python version of this operator would need to be willing to convert either or both operands to strings. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From steve at holdenweb.com Fri Sep 2 06:30:42 2005 From: steve at holdenweb.com (Steve Holden) Date: Thu, 01 Sep 2005 23:30:42 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <1125588305.22624.32.camel@geddy.wooz.org> <79990c6b0509010903251033d5@mail.gmail.com> Message-ID: <4317D572.8000202@holdenweb.com> Steven Bethard wrote: > [Guido van Rossum] > >>And good riddance! The print statement harks back to ABC and even >>(unvisual) Basic. Out with it! > > > [Barry Warsaw] > >>I have to strongly disagree. The print statement is simple, easy to >>understand, and easy to use. > > > [Paul Moore] > >>I agree with Barry. In particular, the behaviour of adding spaces >>between items is something I find very useful, and it's missing from >>the functional forms. > ... as proposed, but ... > > While I agree that mostly the print statement is "simple, easy to > understand, and easy to use", I've seen the trailing-comma version > cause confusion for a lot of newbies. I wouldn't mind at all if the > trailing-comma version disappeared in Python 3.0 -- if you need this > kind of complicated output, you can always use sys.stdout.write and/or > string formatting. > ... the trailing-comma version is indeed BASIC voodoo of ancient heritage, and not something I'd personally miss. > The spaces-between-items point that Paul Moore makes is IMHO the best > argument against the proposed write*() functions. I think we *do* > need a statement or function of some sort that does the most basic > task: writing a line to sys.stdout that calls str() on each of the > elements and joins them with spaces. That is, I think we need to keep > *something* with functionality like: > > def XXX(*args): > sys.stdout.write('%s\n' % ' '.join(str(a) for a in args)) > > Note that this would keep the Hello World example simple: > > XXX(greeting, name) > Of course, for Python 3.0 if we lose the keyword there's nothing to stop us calling the convenience function "print". With the removal of the trailing-comma functionality we'd only have to add parentheses to 2.X print statements to have them work :-) Next question: could the function have a sensible return value, or is None the best possible result? hesitating-to-suggest-minus-one-ly y'rs - steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC http://www.holdenweb.com/ From martin.blais at gmail.com Fri Sep 2 06:40:42 2005 From: martin.blais at gmail.com (Martin Blais) Date: Fri, 2 Sep 2005 00:40:42 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <-886891881552191728@unknownmsgid> References: <20050901204613.GC12384@discworld.dyndns.org> <-886891881552191728@unknownmsgid> Message-ID: <8393fff05090121405f9a237a@mail.gmail.com> On 9/1/05, Bill Janssen wrote: > > Providing you can live with adding a pair of parentheses to that, you can > > have: > > > > def print(*args): > > sys.stdout.write(' '.join(args) + '\n') > > > > I think the language would be cleaner if it lacked this weird exception for > > `print`. > > Charles, > > I agree that it would be cleaner. I just don't think cleanliness is > all that interesting -- usefulness trumps it every time. And if Talking about cleanliness, I'm not sure which is cleaner:: print >> sys.stderr, "This is a long sentence that I " \ "had to cut in two." print("This is a long sentence that I " "had to cut in two.", stream=sys.stderr) Sometimes I'll do this because I don't like the backslashes:: print >> sys.stderr, ("This is a long sentence that " "Had to cut in two.") Also, I find the ">>" syntax has always bothered me. I find it useful but so out-of-place in the language. +1 for removing the print statement. From steve at holdenweb.com Fri Sep 2 06:50:04 2005 From: steve at holdenweb.com (Steve Holden) Date: Thu, 01 Sep 2005 23:50:04 -0500 Subject: [Python-Dev] String views In-Reply-To: <43167BDB.6010002@canterbury.ac.nz> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <43167BDB.6010002@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > skip at pobox.com wrote: > >>If I then wanted to see what scheme's value >>compared to, the string's comparison method would have to recognize that it >>wasn't truly NUL-terminated, copy it, call strncmp() or whatever underlying >>routine is used for string comparisons. > > > Python string comparisons can't be using anything that > relies on nul-termination, because Python strings can > contain embedded nuls. Possibly it uses memcmp(), but > that takes a length. > > You have a point when it comes to passing strings to > other C routines, though. For those that don't have a > variant which takes a maximum length, the substring type > might have to keep a cached nul-terminated copy created > on demand. Then the copying overhead would only be > incurred if you did happen to pass a substring to such > a routine. > Since Python strings *can* contain embedded NULs, doesn't that rather poo on the idea of passing pointers to their data to C functions as things stand? regards Steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC http://www.holdenweb.com/ From stephen at xemacs.org Fri Sep 2 07:59:45 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Fri, 02 Sep 2005 14:59:45 +0900 Subject: [Python-Dev] String views In-Reply-To: (Steve Holden's message of "Thu, 01 Sep 2005 23:50:04 -0500") References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <43167BDB.6010002@canterbury.ac.nz> Message-ID: <87ek88ko1a.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Steve" == Steve Holden writes: Steve> Since Python strings *can* contain embedded NULs, doesn't Steve> that rather poo on the idea of passing pointers to their Steve> data to C functions as things stand? I think it's a "consenting adults" issue. Ie, C programmers always face the issue of "Do I dare strfry() this char[]?" I don't see what difference it makes that the C program in question is being linked with Python, or that the source of the data is a Python string. He's chosen to program in C, let him get on with it. -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From fredrik at pythonware.com Fri Sep 2 08:36:45 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Fri, 2 Sep 2005 08:36:45 +0200 Subject: [Python-Dev] String views (was: Re: Proof of the pudding:str.partition()) References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com><17174.22550.862457.829100@montanaro.dyndns.org> <17175.49049.572339.647470@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > Fredrik> Python strings are character buffers with a known length, not > Fredrik> null-terminated C strings. the CPython implementation > Fredrik> guarantees that the character buffer has a trailing NULL > Fredrik> character, but that's mostly to make it easy to pass Python > Fredrik> strings directly to traditional C API:s. > > I'm obviously missing something that's been there all along. Since Python > strings can contain NULs, why do we bother to NUL-terminate them? Clearly, > any tradition C API that expects to operate on NUL-terminated strings would > break with a string containing an embedded NUL. sure, but that doesn't mean that such an API would break on a string that *doesn't* contain an embedded NUL. in practice, this is the difference between the "s" and "s#" argument specifiers; the former requires a NUL-free string, the latter can handle any byte string: >>> f = open("myfile\0") Traceback (most recent call last): File " ", line 1, in ? TypeError: file() argument 1 must be (encoded string without NULL bytes), not str >>> f = open("myfile") >>> f From paul at pfdubois.com Fri Sep 2 09:18:37 2005 From: paul at pfdubois.com (Paul F. Dubois) Date: Fri, 02 Sep 2005 00:18:37 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <4317FCCD.80702@pfdubois.com> Remove the print statement....I laughed until my sides hurt. Hello? Try dating girls and talking to normal people, geek boys. We scientists still use these for debugging. We never 'move on' very far from the tutorial. The salient feature about print statements is that they live to be put in and commented out 10 minutes later, without some import being required or other enabling object being around. Easy things should be easy. Hard things should be possible. I don't believe the person who said the trailing comma case mixed up anybody, not for more than 10 seconds anyway. OK, now that I've offended everyone, I'll go back into retirement. But I *am* laughing at you. From fredrik at pythonware.com Fri Sep 2 10:07:29 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Fri, 2 Sep 2005 10:07:29 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <4317FCCD.80702@pfdubois.com> Message-ID: Paul F. Dubois wrote: > Remove the print statement....I laughed until my sides hurt. Hello? Try > dating girls and talking to normal people, geek boys. > > We scientists still use these for debugging. We never 'move on' very far > from the tutorial. The salient feature about print statements is that > they live to be put in and commented out 10 minutes later, without some > import being required or other enabling object being around. > > Easy things should be easy. Hard things should be possible. I don't > believe the person who said the trailing comma case mixed up anybody, > not for more than 10 seconds anyway. > > OK, now that I've offended everyone, I'll go back into retirement. But I > *am* laughing at you. Amen. From p.f.moore at gmail.com Fri Sep 2 10:18:10 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 2 Sep 2005 09:18:10 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <43179D27.5080805@ronadam.com> References: <43179D27.5080805@ronadam.com> Message-ID: <79990c6b05090201181f556cf1@mail.gmail.com> On 9/2/05, Ron Adam wrote: > Jim Jewett wrote: > > Putting the spaces back in (without a format string) would > > be even worse. Charles Cazabon's pointed out that it *could* > > be as simple as > > > > writeln(' '.join( ... )) > > Why not just offer an addition method ? > > examine(x,y,z) # print with spaces Because we're now up to *four* stream methods, plus the same number of builtins, to do what one statement currently does? (BTW, the ' '.join() idiom has a minor disadvantage in that it *builds* the output string, whereas print doesn't. Not a major issue, given the typical sizes of strings to be output, but it's another cost nevertheless...) Paul. From bronger at physik.rwth-aachen.de Fri Sep 2 10:58:15 2005 From: bronger at physik.rwth-aachen.de (Torsten Bronger) Date: Fri, 02 Sep 2005 10:58:15 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <4317FCCD.80702@pfdubois.com> Message-ID: <87slwnyhg8.fsf@wilson.rwth-aachen.de> Hall?chen! "Paul F. Dubois" writes: > [...] > > We scientists still use these for debugging. We never 'move on' > very far from the tutorial. The salient feature about print > statements is that they live to be put in and commented out 10 > minutes later, without some import being required or other > enabling object being around. Being a natural scientist myself, I plan to use Python for such purposes, too, and surely print will be part of it. I also agree that at least for the not professionally trained programmer, print is a very handy debugging helper. However, an even more important kind of Python programs are the utilities one creates for making life easier. They are usually short and simple with respect to their I/O. I really love the print statement with its comma notation here. Typically it's used frequently in my programs and produces lucid lines of code. Additionally, print is positive for Python advocacy in my opinion. It strengthens the beginner's impression that Python has a gentle syntax. (Again, I may speak for the non-CS folks.) I think that print's purpose is important enough for Python's target group that it deserves to remain as it is. Tsch?, Torsten. -- Torsten Bronger, aquisgrana, europa vetus ICQ 264-296-646 From k33rni at gmail.com Fri Sep 2 11:25:41 2005 From: k33rni at gmail.com (Krzysztof Zych) Date: Fri, 2 Sep 2005 11:25:41 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: <7d8f053305090202252ea42cf0@mail.gmail.com> On 01/09/05, Guido van Rossum wrote: > And good riddance! The print statement harks back to ABC and even > (unvisual) Basic. Out with it! I disagree strongly. I can't count the number of times I've been p*ssed having to write something like System.out.println("point(" + this.x + "," + this.y +")") in Java. (Strangely though, I don't object to having printf() in C, but I know it doesn't work any other way). This is what I liked about Python, it offered a no-frills way to get the job done (TSOOWTDI and the like). I agree it's mostly used for debugging purposes, to do quick-and dirty calculations, etc. Nothing can beat it. We don't want the language to be pure, we want it to be useful. Isn't "Practicality beats purity" in the Zen of Python? Last time I checked (2.4.1) it was there, and updating Zen isn't in PEP 3000 ;) -1 on removal of print. From mozbugbox at yahoo.com.au Fri Sep 2 11:40:09 2005 From: mozbugbox at yahoo.com.au (JustFillBug) Date: Fri, 2 Sep 2005 09:40:09 +0000 (UTC) Subject: [Python-Dev] Python 3 design principles References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <43176C03.7090904@ronadam.com> Message-ID: On 2005-09-01, Ron Adam wrote: > As for functions without '()'s. (Just a thought) You could use '<<' or > '<<<' (or other symbol) as a way to move data between objects. > > ui.write <<< 'Hello World/n' # ui.write('Hello World/n') > > ui.writeln <<< counter # ui.writeln(counter.next()) > > ok = ui.input <<< 'press a key:' # ok = ui.input('press a key:') > > The requirement could be that the item on the left is a callable, and > the item on the right is a sequence or generator. > Please don't abuse symbols. Perl's ways of symbols all the way without intuitive meaning is bad. Use descriptive methods and functions please. From T.A.Meyer at massey.ac.nz Fri Sep 2 04:54:19 2005 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Fri, 2 Sep 2005 14:54:19 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 Message-ID: [Guido] > The print statement harks back to ABC and even > (unvisual) Basic. Out with it! [Barry] > I have to strongly disagree. As would I. From observing recent discussions here, it would be helpful if everyone else that agrees could come up with a list (a wiki page on python.org, perhaps?) of simple, to-the-point, reasons why losing print is a bad idea. Once Guido sees the huge list of reasons in favour of keeping it, versus the one or two reasons against it (and ruminates on it while 2.5 through 2.9 are released) I'm sure he'll see reason. FWIW, I wouldn't really care if >> or the trailing comma was lost. [Barry] > The print statement is simple, easy to understand, and > easy to use. For use cases like debugging or the interactive > interpreter [...] I think it's hard to beat the useability > of print with a write() function, even if builtin. ISTM that Barry nails the key reasons here. One of the real strengths of Python is that it can be used in a wide range of applications, many of which don't need to be burdened with a complex logging strategy, don't have a GUI, aren't inside a web browser, and so on. "print" is the best example I can think of for "practicality beats purity". Writing to stdout is as common in the code I write as loops - it's worth keeping such basic functionality as elegant, simple, easy to understand, and easy to use as possible. (This is certainly my motiviation, not any concern about backwards compatibility). With standard English keyboards, at least, the '(' and ')' keys are also inconvenient to type, compared to lower-case English characters. Fundamental actions like writing to stdout deserve simplicity. =Tony.Meyer From hoffman at ebi.ac.uk Fri Sep 2 12:02:52 2005 From: hoffman at ebi.ac.uk (Michael Hoffman) Date: Fri, 2 Sep 2005 11:02:52 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050901212739.GG6140@performancedrivers.com> References: <20050901204252.GB12384@discworld.dyndns.org> <20050901212739.GG6140@performancedrivers.com> Message-ID: On Thu, 1 Sep 2005, Jack Diederich wrote: > On Thu, Sep 01, 2005 at 11:12:57PM +0200, Fredrik Lundh wrote: >> yeah, real programmers don't generate output. >> > I'd say: > yeah, real programmers don't generate output _to stdout_ > > sockets, GUI widgets, buffers? sure. stdout? Almost never. Almost every program I write produces its output mainly to stdout. And I probably use print half the time to produce this output (the rest is done mostly with csv). GUI widgets? Who needs 'em? -- Michael Hoffman European Bioinformatics Institute From gmccaughan at synaptics-uk.com Fri Sep 2 12:40:35 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Fri, 2 Sep 2005 11:40:35 +0100 Subject: [Python-Dev] Revising RE docs In-Reply-To: References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: <200509021140.36101.gmccaughan@synaptics-uk.com> On Thursday 2005-09-01 18:09, Guido van Rossum wrote: > They *are* cached and there is no cost to using the functions instead > of the methods unless you have so many regexps in your program that > the cache is cleared (the limit is 100). Sure there is; the cost of looking them up in the cache. >>> import re,timeit >>> timeit.re=re >>> timeit.Timer("""re.search(r"(\d*).*(\d*)", "abc123def456")""").timeit(1000000) 7.6042091846466064 >>> timeit.r = re.compile(r"(\d*).*(\d*)") >>> timeit.Timer("""r.search("abc123def456")""").timeit(1000000) 2.6358869075775146 >>> timeit.Timer().timeit(1000000) 0.091850996017456055 So in this (highly artificial toy) application it's about 7.5/2.5 = 3 times faster to use the methods instead of the functions. -- g From gmccaughan at synaptics-uk.com Fri Sep 2 13:14:11 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Fri, 2 Sep 2005 12:14:11 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4317FCCD.80702@pfdubois.com> References: <4317FCCD.80702@pfdubois.com> Message-ID: <200509021214.12531.gmccaughan@synaptics-uk.com> > We scientists still use these for debugging. We never 'move on' very far > from the tutorial. The salient feature about print statements is that > they live to be put in and commented out 10 minutes later, without some > import being required or other enabling object being around. > > Easy things should be easy. Hard things should be possible. I don't > believe the person who said the trailing comma case mixed up anybody, > not for more than 10 seconds anyway. Damn right. No, I mean: damn "write" :-). I've used Python for teaching beginner programmers, for quick-hack scripts, for interactive diddling about, for scientific computation, for algorithmic experimentation, for GUI applications. I'd appreciably miss "print" for *all* of these, even the last. (My GUI applications sometimes have bugs. How about yours?) So far as I can see, two arguments against "print" have been proposed. 1. It has some ugly features, like the trailing-comma hack. 2. It's a statement that does something "ordinary" and could be replaced by a function. Against which, we have 3. It's convenient for debugging, interactive use, simple scripts, and various other things. 4. It's beginner-friendly. Now, I'm sure I remember hearing something that was relevant to this. "Pragmatism beats purification"? No, that's not quite it. "Practice beats perfection?" No. Ah yes, I remember: "Practicality beats purity". But, of course, that wasn't talking about Python 3000. :-) -- g From paolo_veronelli at libero.it Fri Sep 2 14:16:42 2005 From: paolo_veronelli at libero.it (Paolino) Date: Fri, 02 Sep 2005 14:16:42 +0200 Subject: [Python-Dev] itertools.chain should take an iterable ? In-Reply-To: <20050901173518.GE6140@performancedrivers.com> References: <43174150.5080002@libero.it> <20050901173518.GE6140@performancedrivers.com> Message-ID: <431842AA.2050405@libero.it> Jack Diederich wrote: > On Thu, Sep 01, 2005 at 07:58:40PM +0200, Paolino wrote: > >>Working on a tree library I've found myself writing >>itertools.chain(*[child.method() for child in self]). >>Well this happened after I tried instinctively >>itertools.chain(child.method() for child in self). >> >>Is there a reason for this signature ? > > > This is more suited to comp.lang.python > Why ? I'm not asking for help ,I'm asking why itertools library is implemented like that and if it is possible to clean it. > Consider the below examples (and remember that strings are iterable) > > >>>>import itertools as it >>>>list(it.chain('ABC', 'XYZ')) > > ['A', 'B', 'C', 'X', 'Y', 'Z'] > >>>>list(it.chain(['ABC', 'XYZ'])) > > ['ABC', 'XYZ'] > >>>>list(it.chain(['ABC'], ['XYZ'])) > > ['ABC', 'XYZ'] > What if I want to chain an infinite list of iterables? Shouldn't itertools.chain be built to handle that? I don't think it is a problem to accept only the second case you paste and produce TypeError on the others. Hope this explains and to get other reasons. Regards Paolino > > Hope that helps, > > -jackdied > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/paolo_veronelli%40libero.it > From barry at python.org Fri Sep 2 13:50:48 2005 From: barry at python.org (Barry Warsaw) Date: Fri, 02 Sep 2005 07:50:48 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <7C58F4DB-E15B-497A-A4F5-563B5CF566FF@redivi.com> References: <20050901204252.GB12384@discworld.dyndns.org> <20050901212739.GG6140@performancedrivers.com> <7C58F4DB-E15B-497A-A4F5-563B5CF566FF@redivi.com> Message-ID: <1125661848.12804.16.camel@geddy.wooz.org> On Thu, 2005-09-01 at 17:49, Bob Ippolito wrote: > That is absolutely true, print is becoming less and less useful in > the context of GUI or web applications. I know we're dinosaurs, but some of us still write console apps in Python! > Even in Just Debugging > scenarios, you're probably better off using something with more > flexibility, such as the logging module. The logging module is great, but logging and debugging are two different things (although that fact is obscured when you don't have a console). print is useful in scenarios other than debugging. And while I do occasionally use it, I wouldn't be too heartbroken if the trailing comma form were lost. I /would/ mourn the loss of print>> though -- not necessarily the syntax, which was clearly a compromise, but the functionality. If we could have spelled it "print to sys.stderr" we would have. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050902/9fda4a2f/attachment.pgp From barry at python.org Fri Sep 2 13:59:48 2005 From: barry at python.org (Barry Warsaw) Date: Fri, 02 Sep 2005 07:59:48 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <-2729100304010349131@unknownmsgid> Message-ID: <1125662388.12802.24.camel@geddy.wooz.org> On Thu, 2005-09-01 at 16:07, Guido van Rossum wrote: > Another real problem with print is that, while the automatic insertion > of spaces is nice for beginners, it often gets in the way, OTOH, print's automatic space insertion is often the reason why I'll reach for it instead of stream.write(). Maybe we should be thinking of this differently. What on the surface appears to be many varieties of one use case, screaming out for TOOWTDI+options is really (at least) two use cases urging us to different solutions appropriate for the problem. I have no qualms with adding writeln() or writefmt() or whatever -- those seem like useful additions I'm sure I'd use. But I don't think that therefore (or under the principles of TOOWTDI or cleanliness) demands the removal of print. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050902/7cc92195/attachment.pgp From barry at python.org Fri Sep 2 14:03:22 2005 From: barry at python.org (Barry Warsaw) Date: Fri, 02 Sep 2005 08:03:22 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <8393fff05090121405f9a237a@mail.gmail.com> References: <20050901204613.GC12384@discworld.dyndns.org> <-886891881552191728@unknownmsgid> <8393fff05090121405f9a237a@mail.gmail.com> Message-ID: <1125662602.12805.29.camel@geddy.wooz.org> On Fri, 2005-09-02 at 00:40, Martin Blais wrote: > Talking about cleanliness, I'm not sure which is cleaner:: > > print >> sys.stderr, "This is a long sentence that I " \ > "had to cut in two." > > print("This is a long sentence that I " > "had to cut in two.", stream=sys.stderr) > > Sometimes I'll do this because I don't like the backslashes:: > > print >> sys.stderr, ("This is a long sentence that " > "Had to cut in two.") Or maybe print >> sys.stderr, "\ This is a long sentence that I didn't have to cut in two." A bit yucky, but easily extended to TQS when your message gets longer and longer. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050902/094d121d/attachment.pgp From nyamatongwe at gmail.com Fri Sep 2 14:06:17 2005 From: nyamatongwe at gmail.com (Neil Hodgson) Date: Fri, 2 Sep 2005 22:06:17 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <200509021214.12531.gmccaughan@synaptics-uk.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> Message-ID: <50862ebd050902050650c0a83@mail.gmail.com> Gareth McCaughan: > 3. It's convenient for debugging, interactive use, simple scripts, > and various other things. Interactive use is its own mode and works differently to the base language. To print the value of something, just type an expression. Python will evaluate and print the value of the expression. Much easier than adding 'print '. Extended interactive modes like ipython include other conveniences that don't belong in the python language. The problem with print is it becomes a barrier to extending a script into something more ambitious. This then leads to ugly 'features' like '>>' and trailing commas. By all means provide a simple syntax for i/o with the standard streams but ensure it is something that is a firm basis for extension. Neil From barry at python.org Fri Sep 2 13:44:19 2005 From: barry at python.org (Barry Warsaw) Date: Fri, 02 Sep 2005 07:44:19 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4317FCCD.80702@pfdubois.com> References: <4317FCCD.80702@pfdubois.com> Message-ID: <1125661459.12804.8.camel@geddy.wooz.org> On Fri, 2005-09-02 at 03:18, Paul F. Dubois wrote: > Remove the print statement....I laughed until my sides hurt. Hello? Try > dating girls and talking to normal people, geek boys. > > We scientists still use these for debugging. We never 'move on' very far > from the tutorial. The salient feature about print statements is that > they live to be put in and commented out 10 minutes later, without some > import being required or other enabling object being around. > > Easy things should be easy. Hard things should be possible. I don't > believe the person who said the trailing comma case mixed up anybody, > not for more than 10 seconds anyway. > > OK, now that I've offended everyone, I'll go back into retirement. But I > *am* laughing at you. Thank you Paul! Don't stay retired for long. :) -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050902/f33b7b98/attachment.pgp From raymond.hettinger at verizon.net Fri Sep 2 14:12:41 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Fri, 02 Sep 2005 08:12:41 -0400 Subject: [Python-Dev] itertools.chain should take an iterable ? In-Reply-To: <431842AA.2050405@libero.it> Message-ID: <002501c5afb7$9ec65080$9207a044@oemcomputer> [Paolino] > >>Well this happened after I tried instinctively > >>itertools.chain(child.method() for child in self). As Jack's note points out, your proposed signature is incompatible with the one we have now. I recommend creating your own version: def paolino_chain(iterables): for it in iterables: for element in it: yield element >>> list(chain(c+c for c in string.ascii_uppercase)) ['A', 'A', 'B', 'B', 'C', 'C', 'D', 'D', 'E', 'E', 'F', 'F', 'G', 'G', 'H', 'H', 'I', 'I', 'J', 'J', 'K', 'K', 'L', 'L', 'M', 'M', 'N', 'N', 'O', 'O', 'P', 'P', 'Q', 'Q', 'R', 'R', 'S', 'S', 'T', 'T', 'U', 'U', 'V', 'V', 'W', 'W', 'X', 'X', 'Y', 'Y', 'Z', 'Z'] > >>Is there a reason for this signature ? It was handy for the use cases I had in mind when creating the function. Also it was styled after a version in another language where it had proven successful. > > This is more suited to comp.lang.python > > > Why ? I'm not asking for help ,I'm asking why itertools library is > implemented like that and if it is possible to clean it. The newsgroup would have guided you to the solution listed above. If you want to request a new feature, please use SourceForge. Raymond From amk at amk.ca Fri Sep 2 14:50:06 2005 From: amk at amk.ca (A.M. Kuchling) Date: Fri, 2 Sep 2005 08:50:06 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <4317FCCD.80702@pfdubois.com> Message-ID: <20050902125006.GB5962@rogue.amk.ca> On Fri, Sep 02, 2005 at 10:07:29AM +0200, Fredrik Lundh wrote: > > OK, now that I've offended everyone, I'll go back into retirement. But I > > *am* laughing at you. > > Amen. Seconded. --amk From tzot at mediconsa.com Fri Sep 2 14:45:42 2005 From: tzot at mediconsa.com (Christos Georgiou) Date: Fri, 2 Sep 2005 15:45:42 +0300 Subject: [Python-Dev] itertools.chain should take an iterable ? References: <43174150.5080002@libero.it><20050901173518.GE6140@performancedrivers.com> <431842AA.2050405@libero.it> Message-ID: "Paolino" wrote in message news:431842AA.2050405 at libero.it... > What if I want to chain an infinite list of iterables? > Shouldn't itertools.chain be built to handle that? Raymond already suggested a four-line function that does exactly that. Create your own personal-library modules containing the functions you find useful as building blocks, and when you have a large sw base using them, present your building blocks along with their use cases as arguments for inclusion in the standard library. > I don't think it is a problem to accept only the second case you paste > and produce TypeError on the others. It would break compatibility with the current uses of itertools.chain . I like it (and have used it) as it is. From fredrik at pythonware.com Fri Sep 2 14:55:56 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Fri, 2 Sep 2005 14:55:56 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <4317FCCD.80702@pfdubois.com><200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> Message-ID: Neil Hodgson wrote: > Interactive use is its own mode and works differently to the base > language. To print the value of something, just type an expression. > Python will evaluate and print the value of the expression. Much > easier than adding 'print '. print and "echo" prints different things, for many values of "something". From skip at pobox.com Fri Sep 2 15:11:47 2005 From: skip at pobox.com (skip@pobox.com) Date: Fri, 2 Sep 2005 08:11:47 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <50862ebd050902050650c0a83@mail.gmail.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> Message-ID: <17176.20371.368005.307905@montanaro.dyndns.org> Neil> The problem with print is it becomes a barrier to extending a Neil> script into something more ambitious. This then leads to ugly Neil> 'features' like '>>' and trailing commas. By all means provide a Neil> simple syntax for i/o with the standard streams but ensure it is Neil> something that is a firm basis for extension. I don't find either the trailing comma or >> redirection ugly. If I have a long print line that's hard to read because it extends past column 80 (the print statement, not the output), it's easy to hit NL at an intermediate comma, then just type "print ", perhaps followed by another output redirector. The two print statements' output still falls on a single line. The trailing comma on the previous line gives me a space between the two output chunks. Skip From steven.bethard at gmail.com Fri Sep 2 16:04:07 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Fri, 2 Sep 2005 08:04:07 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <17176.20371.368005.307905@montanaro.dyndns.org> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > I don't find either the trailing comma or >> redirection ugly. If I have a > long print line that's hard to read because it extends past column 80 (the > print statement, not the output), it's easy to hit NL at an intermediate > comma, then just type "print ", perhaps followed by another output > redirector. The two print statements' output still falls on a single > line. The trailing comma on the previous line gives me a space between the > two output chunks. But that would be just as easy with a print() function. In the current syntax: print 'foo:', foo, 'bar:', bar, 'baz:', baz, print 'frobble', frobble In my proposed function: print('foo:', foo, 'bar:', bar, 'baz:', baz, 'frobble', frobble) To my (admittedly biased) eyes, the second version more obviously prints to a single line. STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From fredrik at pythonware.com Fri Sep 2 16:11:20 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Fri, 2 Sep 2005 16:11:20 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <4317FCCD.80702@pfdubois.com><200509021214.12531.gmccaughan@synaptics-uk.com><50862ebd050902050650c0a83@mail.gmail.com><17176.20371.368005.307905@montanaro.dyndns.org> Message-ID: Steven Bethard wrote: > But that would be just as easy with a print() function. In the current syntax: > > print 'foo:', foo, 'bar:', bar, 'baz:', baz, > print 'frobble', frobble > > In my proposed function: > > print('foo:', foo, 'bar:', bar, 'baz:', baz, > 'frobble', frobble) > > To my (admittedly biased) eyes, the second version more obviously > prints to a single line. next use case: print 'foo:', foo, 'bar:', bar, 'baz:', baz, if frobble > 0: print 'frobble', frobble else: print 'no frobble today' From python at discworld.dyndns.org Fri Sep 2 16:20:44 2005 From: python at discworld.dyndns.org (Charles Cazabon) Date: Fri, 2 Sep 2005 08:20:44 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <20050902142044.GA18622@discworld.dyndns.org> Fredrik Lundh wrote: > > > > print('foo:', foo, 'bar:', bar, 'baz:', baz, > > 'frobble', frobble) > > > > To my (admittedly biased) eyes, the second version more obviously > > prints to a single line. > > next use case: > > print 'foo:', foo, 'bar:', bar, 'baz:', baz, > if frobble > 0: > print 'frobble', frobble > else: > print 'no frobble today' The need to print /and/ not add a newline isn't nearly as common. print() could take a keyword parameter to skip the newline, or ... print('foo:', foo, 'bar:', bar, 'baz:', baz, frobble and 'frobble: ' + frobble or 'no frobble today') Or the user can just use stdout.write and have full control. Charles -- ----------------------------------------------------------------------- Charles Cazabon GPL'ed software available at: http://pyropus.ca/software/ ----------------------------------------------------------------------- From steven.bethard at gmail.com Fri Sep 2 16:46:58 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Fri, 2 Sep 2005 08:46:58 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050902142044.GA18622@discworld.dyndns.org> References: <20050902142044.GA18622@discworld.dyndns.org> Message-ID: Charles Cazabon wrote: > Fredrik Lundh wrote: > > next use case: > > > > print 'foo:', foo, 'bar:', bar, 'baz:', baz, > > if frobble > 0: > > print 'frobble', frobble > > else: > > print 'no frobble today' > > The need to print /and/ not add a newline isn't nearly as common. print() > could take a keyword parameter to skip the newline, or ... > > print('foo:', foo, 'bar:', bar, 'baz:', baz, > frobble and 'frobble: ' + frobble or 'no frobble today') > > Or the user can just use stdout.write and have full control. Or you can easily refactor your code to do the print in one line: if frobble > 0: frobble_str = 'frobble: ' + frobble else: frobble_str = 'no frobble today' print('foo:', foo, 'bar:', bar, 'baz:', baz, frobble_str) or similarly: if frobble > 0: rest = ['frobble', frobble] else: rest = ['no frobble today'] print('foo:', foo, 'bar:', bar, 'baz:', baz, *rest) I don't know which refactoring you'd prefer, but there are at least a few options here. In the first one you have to be careful to add the extra space yourself. In the second one, you have to know how *args work. But I would claim that the extra mental burden of manually adding a space or understanding *args is about equivalent to the current mental burden of print's trailing-comma behavior. I also find it more obvious in both refactored examples that the print produces exactly one line. Of course, there are examples that don't refactor so easily. Here's one: for i, obj in enumerate(objs): # do stuff print i, obj, # do more stuff print If the "do stuff" and "do more stuff" sections are empty, you can write it as something like: print(*[item for tup in enumerate(objs) for item in tup]) But it's clearly not as beginner-friendly, requiring knowledge of *args and list comprehensions. OTOH, I'd claim that if you need such exacting format, you're not doing beginner stuff anyway. But YMMV. STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From gmccaughan at synaptics-uk.com Fri Sep 2 16:52:20 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Fri, 2 Sep 2005 15:52:20 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <50862ebd050902050650c0a83@mail.gmail.com> References: <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> Message-ID: <200509021552.21149.gmccaughan@synaptics-uk.com> > > 3. It's convenient for debugging, interactive use, simple scripts, > > and various other things. > > Interactive use is its own mode and works differently to the base > language. To print the value of something, just type an expression. Doesn't do the same thing. > The problem with print is it becomes a barrier to extending a > script into something more ambitious. This then leads to ugly > 'features' like '>>' and trailing commas. By all means provide a > simple syntax for i/o with the standard streams but ensure it is > something that is a firm basis for extension. Do you have any suggestion that's as practically usable as "print"? -- g From skip at pobox.com Fri Sep 2 16:53:31 2005 From: skip at pobox.com (skip@pobox.com) Date: Fri, 2 Sep 2005 09:53:31 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> Message-ID: <17176.26475.644454.492490@montanaro.dyndns.org> Steven> print 'foo:', foo, 'bar:', bar, 'baz:', baz, Steven> print 'frobble', frobble Steven> In my proposed function: Steven> print('foo:', foo, 'bar:', bar, 'baz:', baz, Steven> 'frobble', frobble) Steven> To my (admittedly biased) eyes, the second version more Steven> obviously prints to a single line. Yes, you're right. My bad. So, is the proposal that you would need an explicit "\n" to terminate the output or not? Skip From skip at pobox.com Fri Sep 2 16:59:28 2005 From: skip at pobox.com (skip@pobox.com) Date: Fri, 2 Sep 2005 09:59:28 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050902142044.GA18622@discworld.dyndns.org> References: <20050902142044.GA18622@discworld.dyndns.org> Message-ID: <17176.26832.44077.299214@montanaro.dyndns.org> Charles> Or the user can just use stdout.write and have full control. Don't forget that those of us who are arguing in favor of keeping print are fully aware of stream.write's existence. It's just that in the common case the print statement is more convenient. Maybe a print builtin wouldn't kill me. In that case I'd want both output redirection and newline suppression though. I guess you'd have to use a keyword arg to specify an alternate stream. Perhaps if the last non-keyword argument was exactly one space, the newline could be suppressed, e.g.: print("foo", "bar", "baz", " ", stream=sys.stderr) That seems a bit like magic, but probably no less magic than the current trailing comma. Skip From steven.bethard at gmail.com Fri Sep 2 17:00:12 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Fri, 2 Sep 2005 09:00:12 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <17176.26475.644454.492490@montanaro.dyndns.org> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> Message-ID: On 9/2/05, skip at pobox.com wrote: > > Steven> print 'foo:', foo, 'bar:', bar, 'baz:', baz, > Steven> print 'frobble', frobble > > Steven> In my proposed function: > > Steven> print('foo:', foo, 'bar:', bar, 'baz:', baz, > Steven> 'frobble', frobble) > > Steven> To my (admittedly biased) eyes, the second version more > Steven> obviously prints to a single line. > > Yes, you're right. My bad. > > So, is the proposal that you would need an explicit "\n" to terminate the > output or not? Well, my proposal (which differs from Guidos) is that the print function (or whatever it ends up getting called) would have the semantics: def print(*args): sys.stdout.write(' '.join(str(arg) for arg in args)) sys.stdout.write('\n') STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From skip at pobox.com Fri Sep 2 17:05:49 2005 From: skip at pobox.com (skip@pobox.com) Date: Fri, 2 Sep 2005 10:05:49 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <17176.26832.44077.299214@montanaro.dyndns.org> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> Message-ID: <17176.27213.972940.773135@montanaro.dyndns.org> skip> print("foo", "bar", "baz", " ", stream=sys.stderr) skip> That seems a bit like magic, but probably no less magic than the skip> current trailing comma. Make that no *more* magic ... Skip From steven.bethard at gmail.com Fri Sep 2 17:12:07 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Fri, 2 Sep 2005 09:12:07 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <17176.26832.44077.299214@montanaro.dyndns.org> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > the print statement is more convenient. Maybe a print builtin wouldn't kill > me. In that case I'd want both output redirection and newline suppression > though. I guess you'd have to use a keyword arg to specify an alternate > stream. Perhaps if the last non-keyword argument was exactly one space, the > newline could be suppressed, e.g.: > > print("foo", "bar", "baz", " ", stream=sys.stderr) I think, instead, the stream API should grow a "print" method (or whatever it ends up getting called). The example would then look like: sys.stderr.print("foo", "bar", "baz", " ") It would probably be nice to provide a FileMixin object too. (Actually, this would be nice now, so that if I implement read(), I don't have to implement readline(), readlines(), etc.) The FileMixin object would make it easy for user-defined file-like objects to also support the print() method: class FileMixin(object): """Adds the file methods. Requires: read() write() Adds: __iter__() next() readline() readlines() writelines() print() # or whatever it gets called """ ... STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From steven.bethard at gmail.com Fri Sep 2 17:18:00 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Fri, 2 Sep 2005 09:18:00 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <17176.26832.44077.299214@montanaro.dyndns.org> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > Perhaps if the last non-keyword argument was exactly one space, the > newline could be suppressed, e.g.: > > print("foo", "bar", "baz", " ", stream=sys.stderr) Sorry, I missed the newline-suppression idea in my first reply. I think the rule above is too confusing. I'm also still not convinced that the print function needs to support newline-suppression. Since the print function seems to be intended mainly for newbies and simple debugging, I'm having trouble coming up with examples where this is really necessary. I'd like to see a few examples where it's crucial that the final newline is suppressed. If it *has* to be supported, I'd add it as a keyword argument, so that your example above reads like: sys.stderr.print("foo", "bar", "baz", newline=False) I guess that's not too bad actually. Kinda nice that it has to be the last thing in the function... STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From tim.peters at gmail.com Fri Sep 2 17:20:57 2005 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 2 Sep 2005 11:20:57 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4317FCCD.80702@pfdubois.com> References: <4317FCCD.80702@pfdubois.com> Message-ID: <1f7befae05090208207fb8decf@mail.gmail.com> [Paul F. Dubois] > Remove the print statement....I laughed until my sides hurt. Hello? Try > dating girls and talking to normal people, geek boys. I tried talking to both, and in this case all said "What's a 'print statement'? You mean like a bank statement -- or what?" ;-) > We scientists still use these for debugging. We never 'move on' very far > from the tutorial. The salient feature about print statements is that > they live to be put in and commented out 10 minutes later, without some > import being required or other enabling object being around. In fairness, Guido suggested adding builtin functions as replacements, so in his view you still wouldn't need to import anything. OTOH, I'd keep print, but (a) remove the inscrutable softspace gimmick, so that a comma always meant "one space"; and, (b) add even more special sytnax, so there was also an easy way to separate print items without forcing a space between them in the output. > Easy things should be easy. Hard things should be possible. I don't > believe the person who said the trailing comma case mixed up anybody, > not for more than 10 seconds anyway. Indeed, you can't even start to spell "practicality beats purity" without first duplicating the first two letters of "print" . > OK, now that I've offended everyone, I'll go back into retirement. But I > *am* laughing at you. Providing entertainment for retirees is one of the PSF's missions. I wonder whether we could get AARP to kick back $10 to the PSF for each of their members? For 350 million dollars a year, I'll be happy to maintain a parallel P3K with a "print" statement until I die. From p.f.moore at gmail.com Fri Sep 2 17:36:39 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 2 Sep 2005 16:36:39 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> Message-ID: <79990c6b05090208367372f705@mail.gmail.com> On 9/2/05, Steven Bethard wrote: [...] > Since > the print function seems to be intended mainly for newbies and simple > debugging, I think there have been quite a few comments here from people who *don't* see the print statement [1] as "mainly for newbies and simple debugging". But just to be absolutely clear, I find the print statement useful in general code - not just debugging code, not trivial scripts. I don't consider myself a "newbie". I hesitate to speak for others, but I believe that I'm not the only one in this situation. [1] Sorry, you said "print function". But you seem to be aiming the function as able to address everyone's concerns over features they like in the print *statement* so I think my point stands. Oh, yes - I also know about the "warts" in print (the >> syntax, and the trailing comma). They don't bother me - I use them on occasion and find them useful, rather than annoying. Yes, I know about stream.write and its variations. I know I can write a print function which adds spaces. Nevertheless, I still find the print statement more convenient, easier to understand and read, and frankly, not as ugly. *This is a personal opinion*. You aren't going to change my mind, and nor are you under any obligation to try. I won't discard Python if print is dropped, but I will be a little saddened. > I'm having trouble coming up with examples where this is > really necessary. I'd like to see a few examples where it's crucial > that the final newline is suppressed. No-one is saying "crucial". We're just expressing opinions. But so are those (even Guido!) who want to remove the print statement. No-one has come up with a genuine, objective benefit to removing it (that I can see). If there isn't one, then we're left with preferences, and Guido's trumps everyone else's. You (as someone who agrees with Guido) don't have anything to prove. Those of us who want to change Guido's mind need to impress him with the strength of our opinions :-). Sorry about that - I just get a bit tired of feeling like everyone's characterising me as either a newbie, or as not writing "real" code... Paul. From fredrik at pythonware.com Fri Sep 2 17:51:42 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Fri, 2 Sep 2005 17:51:42 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <20050902142044.GA18622@discworld.dyndns.org><17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> Message-ID: Paul Moore wrote: > Sorry about that - I just get a bit tired of feeling like everyone's > characterising me as either a newbie, or as not writing "real" code... Hey, I'm a newbie, and I only write simple things, but Python is for people like me, too! From rrr at ronadam.com Fri Sep 2 18:53:16 2005 From: rrr at ronadam.com (Ron Adam) Date: Fri, 02 Sep 2005 12:53:16 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b05090201181f556cf1@mail.gmail.com> References: <43179D27.5080805@ronadam.com> <79990c6b05090201181f556cf1@mail.gmail.com> Message-ID: <4318837C.5080300@ronadam.com> Paul Moore wrote: > On 9/2/05, Ron Adam wrote: > >>Jim Jewett wrote: >> >>>Putting the spaces back in (without a format string) would >>>be even worse. Charles Cazabon's pointed out that it *could* >>>be as simple as >>> >>> writeln(' '.join( ... )) >> >>Why not just offer an addition method ? >> >>examine(x,y,z) # print with spaces > > > Because we're now up to *four* stream methods, plus the same number of > builtins, to do what one statement currently does? I'm not sure having one statement that can do several things with multiple syntax's is better than having multiple methods each with a single syntax. How is this different than having two methods in the case of partition() and rpartiion(). Ron From paolo_veronelli at libero.it Fri Sep 2 20:15:55 2005 From: paolo_veronelli at libero.it (Paolino) Date: Fri, 02 Sep 2005 20:15:55 +0200 Subject: [Python-Dev] itertools.chain should take an iterable ? In-Reply-To: References: <43174150.5080002@libero.it><20050901173518.GE6140@performancedrivers.com> <431842AA.2050405@libero.it> Message-ID: <431896DB.6070901@libero.it> Christos Georgiou wrote: > "Paolino" wrote in message > news:431842AA.2050405 at libero.it... > > >>What if I want to chain an infinite list of iterables? >>Shouldn't itertools.chain be built to handle that? > > > Raymond already suggested a four-line function that does exactly that. > > Create your own personal-library modules containing the functions you find > useful as building blocks, and when you have a large sw base using them, > present your building blocks along with their use cases as arguments for > inclusion in the standard library. > > >>I don't think it is a problem to accept only the second case you paste >>and produce TypeError on the others. > > > It would break compatibility with the current uses of itertools.chain . I > like it (and have used it) as it is. I see ,I just thought itertools was young and important enough to be investigated and eventually changed, but probably this is not the place to talk about that.I will submit the feature request to SF. I must add that the inverse story would have been def handy_chain(*args): return itertools.chain(iter(args)) a two-line function (ex lambda). Generally speaking, having a star-signature in a base library function is not a good choice. This is a proof of a case. Thanks all and have a nice summer. From steven.bethard at gmail.com Fri Sep 2 19:23:39 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Fri, 2 Sep 2005 11:23:39 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b05090208367372f705@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> Message-ID: Paul Moore wrote: > On 9/2/05, Steven Bethard wrote: > [...] > > Since > > the print function seems to be intended mainly for newbies and simple > > debugging, > > I think there have been quite a few comments here from people who > *don't* see the print statement [1] as "mainly for newbies and simple > debugging". Sorry for the confusion. I wasn't trying to imply anyone was a newbie here, only that the earlier messages in this thread suggested that these were the print statement's main audience. (Hence "seems to be".) Obviously print is used by the rest of us too -- I count around 5000 instances in my installation. However, I only count around 400 instances where a "print" line ends with a comma. > [1] Sorry, you said "print function". But you seem to be aiming the > function as able to address everyone's concerns over features they > like in the print *statement* so I think my point stands. Yes, that was the intention. If the print function doesn't meet most of the same needs that the print statement needs, then it's not doing its job. > > I'm having trouble coming up with examples where this is > > really necessary. I'd like to see a few examples where it's crucial > > that the final newline is suppressed. > > No-one is saying "crucial". We're just expressing opinions. I understand that. I'd just like to see the opinions backed up with real code. ;-) Personally, I still use print a fair bit for debugging purposes. But as I don't use it for much else, I can't judge too well what other needs people have for it. Steve -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From tim.peters at gmail.com Fri Sep 2 19:45:34 2005 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 2 Sep 2005 13:45:34 -0400 Subject: [Python-Dev] setdefault's second argument In-Reply-To: <4314C08C.6060302@python.org> References: <1f7befae05083009146a9c35ce@mail.gmail.com> <003301c5ad80$c72c1020$8832c797@oemcomputer> <1f7befae05083009565974978c@mail.gmail.com> <4314C08C.6060302@python.org> Message-ID: <1f7befae050902104577b93824@mail.gmail.com> [Tim Peters] >> Dang! I may have just found a use, in Zope's >> lib/python/docutils/parsers/rst/directives/images.py (which is part >> of docutils, not really part of Zope): >> >> figwidth = options.setdefault('figwidth') >> figclass = options.setdefault('figclass') >> del options['figwidth'] >> del options['figclass'] [David Goodger] > If a feature is available, it *will* eventually be used! > Whose law is that? This is a different law, about design mistakes getting used by people who should know better ;-) > The code needs to store the values of certain dict entries, then > delete them. This is because the "options" dict is passed on to > another function, where those entries are not welcome. The code above > is simply shorter than this: > > if options.has_key('figwidth'): > figwidth = options['figwidth'] > del options['figwidth'] > # again for 'figclass' > > Alternatively, > > try: > figwidth = options['figwidth'] > del options['figwidth'] > except KeyError: > pass Those wouldn't work in context, because they leave figwidth unbound if it's not a key in options. Later code unconditionally references fidgwidth. > It saves between one line and three lines of code per entry. But > since those entries are probably not so common, it would actually be > faster to use one of the above patterns. Changing figwidth = options.setdefault('figwidth') figclass = options.setdefault('figclass') to figwidth = options.setdefault('figwidth', None) figclass = options.setdefault('figclass', None) is a minimal semantics-neutral edit to avoid the unloved 1-argument case. >> Assuming options is a dict-like thingie, it probably meant to do: >> >> figwidth = options.pop('figwidth', None) >> figclass = options.pop('figclass', None) > Yes, but the "pop" method was only added in Python 2.3. Docutils > currently maintains compatibility with Python 2.1, so that's RIGHT > OUT! Oh, stop torturing yourself. Nobody uses Python 2.1 anymore ;-) >> David, are you married to that bizarre use of setdefault ? > No, not at all. In fact, I will vehemently deny that I ever wrote > such code, and will continue to do so until someone looks up its > history and proves that I'm guilty, which I probably am. No, I checked, and this code was actually added by an Asian spammer, who polluted the docutils codebase with thousandsd of porn links hidden in triple-quoted strings. Google reveals that 1-argument setdefault() is a favorite of Asian porn spammers. So you should add a second argument just to avoid getting in trouble with Interpol ;-) From steve at holdenweb.com Fri Sep 2 21:22:43 2005 From: steve at holdenweb.com (Steve Holden) Date: Fri, 02 Sep 2005 14:22:43 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1f7befae05090208207fb8decf@mail.gmail.com> References: <4317FCCD.80702@pfdubois.com> <1f7befae05090208207fb8decf@mail.gmail.com> Message-ID: <4318A683.60600@holdenweb.com> Tim Peters wrote: > [Paul F. Dubois] > >>Remove the print statement....I laughed until my sides hurt. Hello? Try >>dating girls and talking to normal people, geek boys. [...] > > > Providing entertainment for retirees is one of the PSF's missions. I > wonder whether we could get AARP to kick back $10 to the PSF for each > of their members? For 350 million dollars a year, I'll be happy to > maintain a parallel P3K with a "print" statement until I die. No you wouldn't, that's a lie (whether you know it or not). With that kind of money at your disposal you would soon realise there's more to life than writing software and smoking by the office door. You might even start smoking things other than tobacco. Maintaining software would become a tedious drudge, you would be stultified by the obligation to do it, and be driven to suicide by the enforced interactions with the other developers who had nothing but scorn for the lowly print statement. On the other hand, with that kind of money you could probably hire enough geeks to do the maintenance for you. first-in-line-for-the-job-ly y'rs - steve PS: For what little it's worth I'd keep print too. -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC http://www.holdenweb.com/ From p.f.moore at gmail.com Fri Sep 2 21:45:05 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 2 Sep 2005 20:45:05 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> Message-ID: <79990c6b05090212453f3b7c77@mail.gmail.com> On 9/2/05, Steven Bethard wrote: > Sorry for the confusion. I wasn't trying to imply anyone was a newbie > here, only that the earlier messages in this thread suggested that > these were the print statement's main audience. No problem - I was more joking than serious. But I don't see the same implication in earlier messages as you do - to me, the general impression is that people use the print statement in many different ways, and debugging and trivial scripts are far from the only use. > Obviously print is used by the rest of us too -- I count around > 5000 instances in my installation. I find it hard to reconcile that with your comment that newbies/debigging are the only real uses for the print statement... > However, I only count around 400 > instances where a "print" line ends with a comma. Yes, generally my uses of print are to produce complete lines of output. > I understand that. I'd just like to see the opinions backed up with > real code. ;-) Personally, I still use print a fair bit for debugging > purposes. But as I don't use it for much else, I can't judge too well > what other needs people have for it. Fair enough. I'll try to review where I use the print statement: - Debugging, most definitely. Adding a quick print "a =", a is often all that's needed. - Logging, sometimes. When I just want some basic output, and don't want to deal with the complexity of the logging package. - Unix-style command-line utilities, where textual output to stdout is the norm. - Error and help messages, often with print >>sys.stderr (The last two are obviously the ones I'd emphasize most when arguing that print should stay). Frankly, pretty much anything where the output is to go to stdout/stderr (console, redirected file or pipe) and it's line-oriented in nature. Yes, a stream.writeln() method could do what I want, but the print statement just *feels* more natural. Interestingly enough, the other languages I use most (C, Java, VB(Script) and Javascript (under Windows Scripting Host)) all use functions for output. Except for C, I uniformly dislike the resulting code - the output structure gets hopelessly lost under the weight of string concatenation and explicitly added spaces. With C, this is mitigated by printf, which implies to me that if Python goes this route, C-style string formatting will become far more prevalent in code. But I'm really still just speculating. No-one's really going to know if it's a bad idea until it happens. Personally, I'm just arguing against taking that risk in the absence of any clear benefits beyond "purity"... Paul. From steven.bethard at gmail.com Fri Sep 2 22:26:51 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Fri, 2 Sep 2005 14:26:51 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b05090212453f3b7c77@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: Paul Moore wrote: > Interestingly enough, the other languages I use most (C, Java, > VB(Script) and Javascript (under Windows Scripting Host)) all use > functions for output. Except for C, I uniformly dislike the resulting > code - the output structure gets hopelessly lost under the weight of > string concatenation and explicitly added spaces. Are your complaints about Guido's proposal or mine? The complaint above doesn't quite seem relevant to my proposal, which retains the space-insertion. Basically, my proposal suggests that files (and other streams) gain a print method like: class file(object): ... def print(self, *args): self.write(' '.join(str(arg) for arg in args)) self.write('\n') and the print statement becomes the builtin print() function, defined like: def print(*args): sys.stdout.print(*args) Looking at your use cases, this seems to cover them pretty well: > - Debugging, most definitely. Adding a quick print "a =", a is often > all that's needed. Use the builtin print(): print('a =', a) > - Logging, sometimes. When I just want some basic output, and don't > want to deal with the complexity of the logging package. Use the builtin print(): print('some logging message', foo) > - Unix-style command-line utilities, where textual output to stdout is the norm. Use the builtin print(): print('line of output') > - Error and help messages, often with print >>sys.stderr Use the print() method of sys.stderr: sys.stderr.print('error or help message') STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From john at hazen.net Fri Sep 2 23:18:14 2005 From: john at hazen.net (John Hazen) Date: Fri, 2 Sep 2005 14:18:14 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: <20050902211814.GA30031@gate2.hazen.net> I like the elegance of python, and read py-dev for my own edification. Since I believe I still have somewhat of a "beginner's mind" regarding python, I'll chime in with my opinions. First of all, I dislike 'writeln', for two reasons: 1) The name. I always want to mentally pronounce it 'ritt-linn'. If we *must* have this function, I'd prefer 'writeline'. 2) 'writeln' is a convenience function. A convenience function should be convenient! It seems to me that the most common (and convenient) use case is adding spaces between the arguments /and/ adding the newline. Since this is the same as the current default behavior of 'print', I suggest we use that name. So, after reading all the messages, it turns out my proposal is the same as STeVe's: all streams grow a 'print' method, and the builtin print function just be an alias to sys.stdout.print. Originally, I thought I preferred the statement version of print, but as long as the basic behavior of print is kept, I could get used to adding parens in my typing. The consistency calling the print builtin function with the stream.print method is nicer than the keystroke savings of no parens required by the statement. Having print as a function removes the need for ">>" too ('stream.print(foo)' instead of 'print >>stream foo'). I'm OK with losing the trailing-comma behavior, as I think 'write' should be used for anything beyond the basic default usecase. To summarize: +1 STeVe's proposal (stream.print for all streams, print builtin which maps to sys.stdout.print) +0 status quo -1 Guido's proposal (stream.write and stream.writeln for all streams, write and writeln builtins which map to sys.stdout) -John For reference: * Steven Bethard [2005-09-02 13:06]: > > Basically, my proposal suggests that files (and > other streams) gain a print method like: > > class file(object): > ... > def print(self, *args): > self.write(' '.join(str(arg) for arg in args)) > self.write('\n') > > and the print statement becomes the builtin print() function, defined like: > > def print(*args): > sys.stdout.print(*args) From skip at pobox.com Sat Sep 3 00:12:53 2005 From: skip at pobox.com (skip@pobox.com) Date: Fri, 2 Sep 2005 17:12:53 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> Message-ID: <17176.52837.144428.881211@montanaro.dyndns.org> Steven> Obviously print is used by the rest of us too -- I count around Steven> 5000 instances in my installation. However, I only count around Steven> 400 instances where a "print" line ends with a comma. I took a quick look at my own code: 980 active print statements 110 active print statements with trailing commas 67 active print statements with output redirection 6 active print statements with trailing commas and output redirection 64 inactive print statements 4 inactive print statements with trailing commas 6 inactive print statements with output redirection 1 inactive print statement with trailing commas and output redirection so more than 10% of the print statements in my code use the trailing comma feature and more than 5% use output redirection. I suspect the discrepancy between use of the two features would be less if output redirection had been available from the start. Skip From martin.blais at gmail.com Sat Sep 3 01:07:16 2005 From: martin.blais at gmail.com (Martin Blais) Date: Fri, 2 Sep 2005 19:07:16 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050902142044.GA18622@discworld.dyndns.org> References: <20050902142044.GA18622@discworld.dyndns.org> Message-ID: <8393fff05090216075c6242e4@mail.gmail.com> On 9/2/05, Charles Cazabon wrote: > Fredrik Lundh wrote: > > > > > > print('foo:', foo, 'bar:', bar, 'baz:', baz, > > > 'frobble', frobble) > > > > > > To my (admittedly biased) eyes, the second version more obviously > > > prints to a single line. > > > > next use case: > > > > print 'foo:', foo, 'bar:', bar, 'baz:', baz, > > if frobble > 0: > > print 'frobble', frobble > > else: > > print 'no frobble today' > > The need to print /and/ not add a newline isn't nearly as common. print() > could take a keyword parameter to skip the newline, or ... > > print('foo:', foo, 'bar:', bar, 'baz:', baz, > frobble and 'frobble: ' + frobble or 'no frobble today') Ouf, I'm just feeling an evil idea creeping up just now: print('foo:', foo, 'bar:', bar, 'baz:', baz,) Just kidding, really... Funny enough, the syntax does not barf and goes undetected: >>> def foo( a, b, c ): ... print a, b, c ... >>> foo(1, 2, 3,) 1 2 3 >>> From p.f.moore at gmail.com Sat Sep 3 01:17:11 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Sat, 3 Sep 2005 00:17:11 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: <79990c6b050902161715cf829b@mail.gmail.com> On 9/2/05, Steven Bethard wrote: > Paul Moore wrote: > > Interestingly enough, the other languages I use most (C, Java, > > VB(Script) and Javascript (under Windows Scripting Host)) all use > > functions for output. Except for C, I uniformly dislike the resulting > > code - the output structure gets hopelessly lost under the weight of > > string concatenation and explicitly added spaces. > > Are your complaints about Guido's proposal or mine? The complaint > above doesn't quite seem relevant to my proposal, which retains the > space-insertion. Basically, my proposal suggests that files (and > other streams) gain a print method [...] Mainly Guido's, I guess. As you point out, your pint method is pretty close to the print statement. Some comments, though: 1. The print statement applies to *all* streams, where a print method won't. (You can't ensure that all 3rd-party file-like objects will be updated, unfortunately...) [A mildly obscure example - the Vim interface to Python binds sys.stdout to a pseudo stream which puts output in Vim's message area. Fail to update that code, and the print builtin won't work in Vim...] 2. There are still a confusing number of methods/builtins involved. Your print method isn't enough by itself, you still need write (and presumably a write builtin). Would you reject Guido's writeln? What about writef (again, as proposed by Guido)? I'm not at all clear precisely how many new methods and builtins you are proposing. 3. I am still hoping that someone will articulate a clear benefit for removing the print statement. Without that, I still see all cost and no benefit (even if I accept that your print function is an entirely adequate replacement for the print statement, that doesn't count as a benefit, just as the avoidance of yet another cost...) Paul. From martin.blais at gmail.com Sat Sep 3 01:34:31 2005 From: martin.blais at gmail.com (Martin Blais) Date: Fri, 2 Sep 2005 19:34:31 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: <8393fff050902163428c791a3@mail.gmail.com> On 9/2/05, Steven Bethard wrote: > Paul Moore wrote: > > Interestingly enough, the other languages I use most (C, Java, > > VB(Script) and Javascript (under Windows Scripting Host)) all use > > functions for output. Except for C, I uniformly dislike the resulting > > code - the output structure gets hopelessly lost under the weight of > > string concatenation and explicitly added spaces. > > Are your complaints about Guido's proposal or mine? The complaint > above doesn't quite seem relevant to my proposal, which retains the > space-insertion. Basically, my proposal suggests that files (and > other streams) gain a print method like: > > class file(object): > ... > def print(self, *args): > self.write(' '.join(str(arg) for arg in args)) > self.write('\n') > > and the print statement becomes the builtin print() function, defined like: > > def print(*args): > sys.stdout.print(*args) > > Looking at your use cases, this seems to cover them pretty well: > > > - Debugging, most definitely. Adding a quick print "a =", a is often > > all that's needed. > > Use the builtin print(): > > print('a =', a) > > > - Logging, sometimes. When I just want some basic output, and don't > > want to deal with the complexity of the logging package. > > Use the builtin print(): > > print('some logging message', foo) > > > - Unix-style command-line utilities, where textual output to stdout is the norm. > > Use the builtin print(): > > print('line of output') > > > - Error and help messages, often with print >>sys.stderr > > Use the print() method of sys.stderr: > > sys.stderr.print('error or help message') > Wow, that's so cool actually, you make the concept of "print'ing" even more regular (on all file objects, and then builtin print is just like general print'ing, except for sys.stdout), we don't need a keyword argument for the stream anymore, and the special statement goes away. And if you like concise, then you could do something like this:: perr = sys.stderr.print ... perr("Error: comfobulator failed to initialize doogledigook.") I like it so much that my mind is wandering about hacking my sitecustomize.py to inject it in __builtin__ so I can start using it right now... +1 Also, you're making a point that I think seem to be missing: it's REALLY just about a couple of parentheses. Print statement without parens, print with parens.... same stuff. It's a builtin, it's still always there, and so it's still a convenient as before, except you have to type parentheses. On the upside: one less quirk/exception in the language (one more tiny step towards lisp, me love that, simple is good). I don't think that it would make it harder on the newbies either: less stuff to learn, it's just a function! Someone above proposed a string of one char as last argument would trigger the no-newline case. Why not use an empty string instead? print("Incomplete line", '') Seems like the thing that would disrupt print the least is an empty string... The special meaning is implied. Or if you want more verbose, a special symbol/value in a convenient namespace: print("Incomplete line", print.cont) cheers, From ncoghlan at gmail.com Sat Sep 3 01:54:43 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 09:54:43 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <17175.50404.856427.685737@montanaro.dyndns.org> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <1125588305.22624.32.camel@geddy.wooz.org> <17175.50404.856427.685737@montanaro.dyndns.org> Message-ID: <4318E643.2020102@gmail.com> skip at pobox.com wrote: > >> And good riddance! The print statement harks back to ABC and even > >> (unvisual) Basic. Out with it! > > Barry> I have to strongly disagree. The print statement is simple, easy > Barry> to understand, and easy to use. > > I'm with Barry. Even for non-debug use the print statement is suitable for > the majority of my output. 99.9% of my Python code is test harnesses to run low-level hardware control interface tests, for which printing to stdout works perfectly. Maybe one day I'll stick a GUI on the front end of them, but even then I will probably just be using subprocess to invoke the command line versions behind the scenes. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Sat Sep 3 03:02:43 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 11:02:43 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> Message-ID: <4318F633.6050501@gmail.com> Steven Bethard wrote: > Well, my proposal (which differs from Guidos) is that the print > function (or whatever it ends up getting called) would have the > semantics: > > def print(*args): > sys.stdout.write(' '.join(str(arg) for arg in args)) > sys.stdout.write('\n') I'd rather see a signature with the expressiveness of the current print statement (full implementation below). I've called it 'output' rather than 'print' so that copy and pasting it into a 2.4 interactive session will work (I also think the symmetry with input is cute). With this function: print 1, 2, 3 => output(1, 2, 3) print >> sys.stderr, 1, 2, 3 => output(1, 2, 3, stream=sys.stderr) print "%d%d%d" % (1, 2, 3) => output(1, 2, 3, sep='') print 1, 2, 3, => output(1, 2, 3, end='') Printing a tab-separated or comma separated list becomes trivial: print "%d, %d, %d" % (1, 2, 3) => output(1, 2, 3, sep=', ') print "%d\t%d\t%d" % (1, 2, 3) => output(1, 2, 3, sep='\t') Printing the items in a sequence also becomes straightforward: print " ".join(map(str, range(10))) => output(*range(10)) Playing well with generator expressions comes for free, too: print " ".join(str(x*x) for x in range(10)) => output(*(x*x for x in range(10))) The capabilities of the current print statement reveal a lot of collective wisdom regarding what is useful - the only real issues are that the syntax is somewhat ugly and unique to the print statement, rather than using standard function call syntax, and that as a result of the first issue, it is difficult to control the behaviour of the separator. Turning it into a proper function with three keyword arguments (sep, end, stream) would allow both of these issues to be addressed, and also provide a whole lot of fringe benefits relating to printing of sequences as described above. Cheers, Nick. The sample implementation: def output(*args, **kwds): """Functional replacement for the print statement >>> output(1, 2, 3) 1 2 3 >>> output(1, 2, 3, sep='') 123 >>> output(1, 2, 3, sep=', ') 1, 2, 3 >>> output(1, 2, 3, end='Alternate line ending') 1 2 3Alternate line ending >>> import sys >>> output(1, 2, 3, stream=sys.stderr) 1 2 3 >>> output(*range(10)) 0 1 2 3 4 5 6 7 8 9 >>> output(*(x*x for x in range(10))) 0 1 4 9 16 25 36 49 64 81 """ # Parse the keyword-only optional arguments defaults = { "sep": " ", "end": "\n", "stream": sys.stdout, } for name, default in defaults.items(): item = None try: item = kwds[name] except KeyError: pass if item is None: kwds[name] = default sep, end, stream = kwds["sep"], kwds["end"], kwds["stream"] # Perform the print operation without building the whole string for arg in args[:1]: stream.write(str(arg)) for arg in args[1:]: stream.write(sep) stream.write(str(arg)) stream.write(end) -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Sat Sep 3 03:13:21 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 11:13:21 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b05090208367372f705@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> Message-ID: <4318F8B1.9090701@gmail.com> Paul Moore wrote: > No-one is saying "crucial". We're just expressing opinions. But so are > those (even Guido!) who want to remove the print statement. No-one has > come up with a genuine, objective benefit to removing it (that I can > see). If there isn't one, then we're left with preferences, and > Guido's trumps everyone else's. You (as someone who agrees with Guido) > don't have anything to prove. Those of us who want to change Guido's > mind need to impress him with the strength of our opinions :-). "Print as statement" => printing sequences nicely is a pain "Print as function" => extended call syntax deals with sequences nicely "Print as statement" => can't easily change the separator "Print as function" => keyword argument handles the separator nicely "Print as statement" => trailing comma suppresses newline by magic "Print as function" => keyword argument handles the line terminator nicely "Print as statement" => redirection is via a magic symbol "Print as function" => keyword argument handles redirection nicely "Print as statement" => can't easily save 'settings' for re-use "Print as function" => can use functional.partial to create custom version See my other post where I present a Python 2.4 implementation of a function called "output" which does everything I describe above. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Sat Sep 3 03:19:59 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 11:19:59 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <8393fff05090216075c6242e4@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <8393fff05090216075c6242e4@mail.gmail.com> Message-ID: <4318FA3F.1000709@gmail.com> Martin Blais wrote: > Funny enough, the syntax does not barf and goes undetected: Python generally allows trailing commas so that it is easier to write sequence literals which are appended to later. There's also the fact that a trailing comma is used to make a 1-element tuple - so it could be said that the exception is actually that the comma after the last item can be optionally left out when there is more than one item in the sequence :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From pje at telecommunity.com Sat Sep 3 03:26:10 2005 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 02 Sep 2005 21:26:10 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4318F633.6050501@gmail.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> Message-ID: <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> At 11:02 AM 9/3/2005 +1000, Nick Coghlan wrote: >Printing the items in a sequence also becomes straightforward: > >print " ".join(map(str, range(10))) => output(*range(10)) > >Playing well with generator expressions comes for free, too: > >print " ".join(str(x*x) for x in range(10)) > => output(*(x*x for x in range(10))) An implementation issue: that generator expression will get expanded into a tuple, so you shouldn't use that for outputting large sequences. I don't much care for 'output' as the name, or 'end' as the end-of-line arguments, but for the most part I like the semantics; being able to drop the separator or change the end-of-line string make lots of use cases straightforward, and perhaps almost worth the parentheses. My inclination would be to call the function 'print', though, and rename 'end' to 'trailer'. From guido at python.org Sat Sep 3 03:42:10 2005 From: guido at python.org (Guido van Rossum) Date: Fri, 2 Sep 2005 18:42:10 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4318F8B1.9090701@gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> Message-ID: Wow. With so many people expressing a gut response and not saying what in the proposal they don't like, it's hard to even start a response. Is it... - Going from statement to function? - Losing the automatically inserted space? - Having to write more to get a newline appended? - Losing the name 'print'? Some responses seemed to have missed (or perhaps for stronger rhetorical effect intentionally neglected) that I was proposing builtins in addition to the stream methods, so that all those debug prints would be just as easy to add as before. And I don't think I ever said print was only for newbies! I'd like to be flexible on all points *except* the syntax -- I really want to get rid of print as a *statement*. Consider this: if Python *didn't* have a print statement, but it had a built-in function with the same functionality (including, say, keyword parameters to suppress the trailing newline or the space between items); would anyone support a proposal to make it a statement instead? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From martin.blais at gmail.com Sat Sep 3 03:45:57 2005 From: martin.blais at gmail.com (Martin Blais) Date: Fri, 2 Sep 2005 21:45:57 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> Message-ID: <8393fff0509021845427b0ba3@mail.gmail.com> On 9/2/05, Phillip J. Eby wrote: > At 11:02 AM 9/3/2005 +1000, Nick Coghlan wrote: > >Printing the items in a sequence also becomes straightforward: > > > >print " ".join(map(str, range(10))) => output(*range(10)) > > > >Playing well with generator expressions comes for free, too: > > > >print " ".join(str(x*x) for x in range(10)) > > => output(*(x*x for x in range(10))) > > An implementation issue: that generator expression will get expanded into a > tuple, so you shouldn't use that for outputting large sequences. Then how about:: output(*(x*x for x in range(10)), iter=1) Where all given iterable parameters are automatically iterated? From ncoghlan at iinet.net.au Sat Sep 3 03:47:18 2005 From: ncoghlan at iinet.net.au (Nick Coghlan) Date: Sat, 03 Sep 2005 11:47:18 +1000 Subject: [Python-Dev] New Wiki page - PrintAsFunction Message-ID: <431900A6.6000406@iinet.net.au> All, I put up a Wiki page for the idea of replacing the print statement with an easier to use builtin: http://wiki.python.org/moin/PrintAsFunction Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From tjreedy at udel.edu Sat Sep 3 03:47:46 2005 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 2 Sep 2005 21:47:46 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <4317FCCD.80702@pfdubois.com> Message-ID: "Paul F. Dubois" wrote in message news:4317FCCD.80702 at pfdubois.com... > Remove the print statement....I laughed until my sides hurt. Hello? Try > dating girls and talking to normal people, geek boys. > > We scientists still use these for debugging. We never 'move on' very far > from the tutorial. The salient feature about print statements is that > they live to be put in and commented out 10 minutes later, without some > import being required or other enabling object being around. > > Easy things should be easy. Hard things should be possible. I don't > believe the person who said the trailing comma case mixed up anybody, > not for more than 10 seconds anyway. > > OK, now that I've offended everyone, I'll go back into retirement. But I > *am* laughing at you. > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/python-python-dev%40m.gmane.org > From ncoghlan at gmail.com Sat Sep 3 03:51:24 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 11:51:24 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> Message-ID: <4319019C.6010207@gmail.com> Phillip J. Eby wrote: > At 11:02 AM 9/3/2005 +1000, Nick Coghlan wrote: > >> Printing the items in a sequence also becomes straightforward: >> >> print " ".join(map(str, range(10))) => output(*range(10)) >> >> Playing well with generator expressions comes for free, too: >> >> print " ".join(str(x*x) for x in range(10)) >> => output(*(x*x for x in range(10))) > > > An implementation issue: that generator expression will get expanded > into a tuple, so you shouldn't use that for outputting large sequences. Agreed - but using join with print suffers from a similar problem, in that it builds the large string in memory before displaying it. I actually hope that extended function call syntax in Py3k will use iterators rather than tuples so that this problem goes away. > I don't much care for 'output' as the name, or 'end' as the end-of-line > arguments, but for the most part I like the semantics; being able to > drop the separator or change the end-of-line string make lots of use > cases straightforward, and perhaps almost worth the parentheses. > > My inclination would be to call the function 'print', though, and rename > 'end' to 'trailer'. 'print' is Py24 incompatible though, which is why I didn't use it for the sample code. The version I put on the wiki now uses 'term' for the line terminator keyword, but I'm not too worried about the exact names at this point. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From skip at pobox.com Sat Sep 3 04:17:15 2005 From: skip at pobox.com (skip@pobox.com) Date: Fri, 2 Sep 2005 21:17:15 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> Message-ID: <17177.1963.69703.689791@montanaro.dyndns.org> Guido> Is it... Guido> - Going from statement to function? Guido> - Losing the automatically inserted space? Guido> - Having to write more to get a newline appended? Guido> - Losing the name 'print'? You forgot - gratuitous breakage? I realize you're talking about Py3K, so breakage is allowed, but the advantages of a print function/method over the current print statement don't seem sufficient to warrant the level of code change (probably simple but tedious) that will be necessary to convert 2.x to 3.x. Guido> Consider this: if Python *didn't* have a print statement, but it Guido> had a built-in function with the same functionality (including, Guido> say, keyword parameters to suppress the trailing newline or the Guido> space between items); would anyone support a proposal to make it Guido> a statement instead? Nope, but there is a large body of code out there that does use print statements already. Again, I know you're prepared for breakage, but that doesn't necessarily mean a completely blank sheet of paper. Skip From nnorwitz at gmail.com Sat Sep 3 05:14:54 2005 From: nnorwitz at gmail.com (Neal Norwitz) Date: Fri, 2 Sep 2005 20:14:54 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <17177.1963.69703.689791@montanaro.dyndns.org> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <17177.1963.69703.689791@montanaro.dyndns.org> Message-ID: On 9/2/05, skip at pobox.com wrote: > > Nope, but there is a large body of code out there that does use print > statements already. Again, I know you're prepared for breakage, but that > doesn't necessarily mean a completely blank sheet of paper. Ideally I very much prefer that print become a function. However, the major backlash has swayed me some, if for no other reason that people are so strongly against changing it. What if a tool existed that did the conversion? I realize that the tool is unlikely to be perfect, but what if it could do 99.9% of the job? I'm not thinking about just fixing print, but also converting iterkeys/itervalues/iteritems, xrange -> range, raw_input -> input, warning about use of input(), etc. I'm sure this tool wouldn't be perfect, but if it did most of the work, would that change opinions? n From ncoghlan at gmail.com Sat Sep 3 05:33:03 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 13:33:03 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <17177.1963.69703.689791@montanaro.dyndns.org> Message-ID: <4319196F.1060405@gmail.com> Neal Norwitz wrote: > I'm sure this tool wouldn't be perfect, but if it did most of the > work, would that change opinions? To me, the main objection seems to revolve around the fact that people would like to be able to "future-proof" Python 2.x code so that it will also run on Py3k. We're steadily accumulating collections of "old ways" and "new ways", and the Py3k transition should mainly be about deleting the "old ways". That is, if the way something is going to be done is to change in Py3k, the new alternative should already be in place towards the end of the 2.x series, so that Py3k is only a potential problem if people are still using the "old ways". Maybe "from __future__ import py3k" would do the trick ;) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From tjreedy at udel.edu Sat Sep 3 05:33:08 2005 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 2 Sep 2005 23:33:08 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <20050902142044.GA18622@discworld.dyndns.org><17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com><4318F8B1.9090701@gmail.com> Message-ID: "Guido van Rossum" wrote in message news:ca471dc20509021842e586aa3 at mail.gmail.com... > With so many people expressing a gut response and not saying what in > the proposal they don't like, it's hard to even start a response. > Is it... For me a bit of several things though with quite variable intensity. First, print, as an abbreviation of looped writes, works fine for its appointed task. It gives me don't-care-about-the-format info on values with near minimal overhead. So change needs clear justification. > - Going from statement to function? Minor. For quickly adding debug prints, two extra ()s are a small burden, but if the function were called 'out', then there would still be just five keystrokes. Nick's output() convinced me that there are compensations to the function form. Besides, having used the argument of aesthetic consistency elsewhere, I can hardly deny it to you ;-). > - Losing the automatically inserted space? Major. This is an essential plus of print. > - Having to write more to get a newline appended? Near major. See above. I believe that two people have reported that around 85% of their prints use these defaults, so I think adding a keyword for something other would be the way to go. > - Losing the name 'print'? You gave one reason for this as disassociating from Basic. I can see how a CS grad would want to do so, but Basic once was the vehicle for CP4E (computer programming for everyone) that you want Python to become. In fact, I think PSF should promote Python as the 'Basic for the 21st Century' that should be on most desktops the way Basic once was. So I would prefer to see a different reason for a name change. > - [dash added] Some responses seemed to have missed [snip] > that I was proposing builtins in addition to the stream methods, My objections here are first the plural, which does not seem really necessary, and second the longer (in chars and syllables) and also old name 'writeln' from Pascal for the one that does what print does. > I'd like to be flexible on all points *except* the syntax -- I really > want to get rid of print as a *statement*. [snip] > would anyone support a proposal to make it a statement instead? Good question. Most Python statements benefit from statement syntax because their function syntax equivalent would be a little to hughly more awkward. This is mainly because parts of the statement are implicitly quoted. (Lisp does this with special forms and macros, but I prefer the Python way.) The two syntax tricks in print are different in that they are easily replaced by keywords. So I strongly suspect 'no'. Terry J. Reedy From ncoghlan at gmail.com Sat Sep 3 06:45:51 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 14:45:51 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org><17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com><4318F8B1.9090701@gmail.com> Message-ID: <43192A7F.9070508@gmail.com> Terry Reedy wrote: > "Guido van Rossum" wrote in message >>- Going from statement to function? > > > Minor. For quickly adding debug prints, two extra ()s are a small burden, > but if the function were called 'out', then there would still be just five > keystrokes. Nick's output() convinced me that there are compensations to > the function form. Besides, having used the argument of aesthetic > consistency elsewhere, I can hardly deny it to you ;-). I've added a bit to the wiki to look at different names that have been suggested. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From steve at holdenweb.com Sat Sep 3 08:58:23 2005 From: steve at holdenweb.com (Steve Holden) Date: Sat, 03 Sep 2005 01:58:23 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4318FA3F.1000709@gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <8393fff05090216075c6242e4@mail.gmail.com> <4318FA3F.1000709@gmail.com> Message-ID: Nick Coghlan wrote: > Martin Blais wrote: > > Python generally allows trailing commas so that it is easier to write sequence > literals which are appended to later. > > There's also the fact that a trailing comma is used to make a 1-element tuple > - so it could be said that the exception is actually that the comma after the > last item can be optionally left out when there is more than one item in the > sequence :) > Given >>> (1,2,3,) (1, 2, 3) >>> (1,2,3,,) File " ", line 1 (1,2,3,,) ^ SyntaxError: invalid syntax >>> in Python 2.4, could the double-comma be imbued with some additional mystical meaning that the print() function/method could recognise as indicating a requirement to terminate output with a space rather than a newline? Python 3.0.6 (#17, Aug 13, 2008, 18:02:40) [CC 4.6.2 (cygwin special)] on cygwin Type "help", "copyright", "credits" or "license" for more information. . . . >>> f = open("myfile.txt", "w") >>> f.print('foo:', foo, 'bar:', bar, 'baz:', baz,,) >>> if frobble > 0: ... f.print('frobble', frobble) ... else: ... f.print('no frobble today') ... What other uses might this exciting new syntax (;-) find? Perhaps there could also be special meanings for three and four training commas, and a double-semicolon. Maybe it's time to consult Larry Wall? I am aware this response seems flippant. Sorry. I'm not against the introduction of the suggested new API, but that's adding to the language rather than simplifying it, so I'm not sure I understand the reason why the print statement must go (except to counter the addition of the new API), particularly since Guido's original venomous outburst arrived in the middle of a thread about Python 3.0 design principles: > > [Reinhold Birkenfeld] > >>> You'd have to enclose print arguments in parentheses. Of course, the "trailing >>> comma" form would be lost. > > > And good riddance! The print statement harks back to ABC and even > (unvisual) Basic. Out with it! > Is the principle here "Python must be different from ABC and BASIC"? In that case I suppose we'd better start thinking about what to use instead of "if" and "for". What did the print statement do to us that it must be cast out in this way? I suspect the fundamental problem is that the commas do something more than delimit sequence members. In which case we should say so rather than belittling Python's ancient predecessors. regards Steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC http://www.holdenweb.com/ From fredrik at pythonware.com Sat Sep 3 10:05:15 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Sat, 3 Sep 2005 10:05:15 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <20050902142044.GA18622@discworld.dyndns.org><17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: Steven Bethard wrote: >> - Error and help messages, often with print >>sys.stderr > > Use the print() method of sys.stderr: > > sys.stderr.print('error or help message') so who's going to add print methods to all file-like objects? From radeex at gmail.com Sat Sep 3 10:24:44 2005 From: radeex at gmail.com (Christopher Armstrong) Date: Sat, 3 Sep 2005 18:24:44 +1000 Subject: [Python-Dev] Asynchronous use of Traceback objects Message-ID: <60ed19d40509030124730b8f5b@mail.gmail.com> With the implementation and soon release of PEP 342, I've been thinking more about traceback objects lately. First I'll give you some background information for my problem. I've implemented a module for integrating Twisted's Deferreds with the new yield expression, that allows you to do stuff like: @defgen def foo(): x = yield deferredOperation() print x Basically, defgen checks for Deferreds coming out of the generator, and when it finds one, adds a callback to that Deferred which will resume the generator, sending the result in. Since Deferreds have error support, it also allows code like this: @defgen def foo(userinput): try: x = yield deferredOperation(userinput) except ValueError: "Crap, wrong user input!" We have this object in Twisted called the "Failure", which is used for conveniently passing around exceptions asynchronously, and Deferreds use them to represent errors in deferred operations. The Failure objects have a reference to an exception object and the traceback that was associated with the original raising of that exception. However, we can only hold on to that traceback for so long, because traceback objects have references to so many things and can so easily cause horrible GC issues. The loss of this traceback object isn't *usually* a problem because we store enough other information in the Failure object to print representations of tracebacks nicely. However, if we try to re-raise the exception, we lose that traceback information. defgen re-raises the exception from a Failure into a defgen-using function with g.throw(). Unfortunately, quite often the traceback has been cleaned from the Failure object, and this means you'll get exceptions in defgen-using code with very bizarre and uninformative tracebacks. I had the idea to create a fake Traceback object in Python that doesn't hold references to any frame objects, but is still able to be passed to 'raise' and formatted as tracebacks are, etc. Unfortunately, raise does a type check on its third argument and, besides, it seems the traceback formatting functions are very reliant on the internal structure of traceback objects, so that didn't work. It does seem that I would be able to construct a passable fake Traceback object from C code -- one that had references to fake frames. These fake objects would only remember the original line numbers, filenames and so forth so that traceback printing could still work. I can try implementing this soon, but I'd just like to make sure I'm on the right track. For example, perhaps a better idea would be to change the traceback-printing functions to use Python attribute lookup instead of internal structure lookup, and then change raise to accept arbitrary Python objects as its third argument, as long as it matches the traceback interface. That would probably mean much more work, though. One concern is that I really don't like requiring C modules to use Twisted; all of the ones currently in there are optional. What's the likelihood of such a traceback-constructor getting its way into CPython if I do implement it? It may seem niche, but I expect many Twisted users would like to use PEP 342 defgen (many users are already using the defgen I wrote for python 2.2 generators). Thanks for any help, and have fun, -- Twisted | Christopher Armstrong: International Man of Twistery Radix | -- http://radix.twistedmatrix.com | Release Manager, Twisted Project \\\V/// | -- http://twistedmatrix.com |o O| | w----v----w-+ From rrr at ronadam.com Sat Sep 3 10:28:33 2005 From: rrr at ronadam.com (Ron Adam) Date: Sat, 03 Sep 2005 04:28:33 -0400 Subject: [Python-Dev] New Wiki page - PrintAsFunction In-Reply-To: <431900A6.6000406@iinet.net.au> References: <431900A6.6000406@iinet.net.au> Message-ID: <43195EB1.3090406@ronadam.com> Nick Coghlan wrote: > All, > > I put up a Wiki page for the idea of replacing the print statement with an > easier to use builtin: > > http://wiki.python.org/moin/PrintAsFunction > > Cheers, > Nick. Looks like a good start, much better than just expressing opinions. :-) How about making it a class? There are several advantages such as persistent separators and being able to have several different instances active at once. Cheers, Ron import sys class Print(object): newline = '\n' sep = ' ' def __init__(self, out=sys.stdout): self.out = out def __call__(self, *args, **kwds): savesep = self.sep try: self.sep = kwds['sep'] except KeyError: pass for arg in args[:1]: self.out.write(str(arg)) for arg in args[1:]: self.out.write(self.sep) self.out.write(str(arg)) self.sep = savesep def ln(self, *args, **kwds): self(*args, **kwds) self.out.write(self.newline) # default "builtin" instance write = Print() # could be print in place of write in python 3k. # standard printing write.ln(1, 2, 3) # print without spaces write.ln(1, 2, 3, sep='') # print comma separated write.ln(1, 2, 3, sep=', ') # or write.sep = ', ' # remain until changed write.ln(1, 2, 3) write.ln(4, 5, 6) write.sep = ' ' # print without trailing newline write(1, 2, 3) # print to a different stream printerr = Print(sys.stderr) printerr.ln(1, 2, 3) # print a simple sequence write.ln(*range(10)) # Print a generator expression write.ln(*(x*x for x in range(10))) # print to file f = open('printout.txt','w') fileprint = Print(f) fileprint("hello world\n") f.close() From paolo_veronelli at libero.it Sat Sep 3 13:31:26 2005 From: paolo_veronelli at libero.it (Paolino) Date: Sat, 03 Sep 2005 13:31:26 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <8393fff0509021845427b0ba3@mail.gmail.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> <8393fff0509021845427b0ba3@mail.gmail.com> Message-ID: <4319898E.2070604@libero.it> Martin Blais wrote: > On 9/2/05, Phillip J. Eby wrote: > >>At 11:02 AM 9/3/2005 +1000, Nick Coghlan wrote: >> >>>Printing the items in a sequence also becomes straightforward: >>> >>>print " ".join(map(str, range(10))) => output(*range(10)) >>> >>>Playing well with generator expressions comes for free, too: >>> >>>print " ".join(str(x*x) for x in range(10)) >>> => output(*(x*x for x in range(10))) >> >>An implementation issue: that generator expression will get expanded into a >>tuple, so you shouldn't use that for outputting large sequences. > > > Then how about:: > > output(*(x*x for x in range(10)), iter=1) > Illegal in python2.4.(Wrongly ?) And makes the star solution half unuseful. >>> def f(*args,**kwargs): ... pass ... >>> f(*(1,2,3),iter=True) File " ", line 1 f(*(1,2,3),iter=True) Leaving out what I just asserted in the previous thread :( I suppose you meant output((x*x for x in range(10)), iter=1) f(1,[2,3],(_ for _ in (4,5)),iter=True) Regards Paolino From ncoghlan at gmail.com Sat Sep 3 12:36:52 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 20:36:52 +1000 Subject: [Python-Dev] New Wiki page - PrintAsFunction In-Reply-To: <43195EB1.3090406@ronadam.com> References: <431900A6.6000406@iinet.net.au> <43195EB1.3090406@ronadam.com> Message-ID: <43197CC4.8020005@gmail.com> Ron Adam wrote: > Nick Coghlan wrote: > >>All, >> >>I put up a Wiki page for the idea of replacing the print statement with an >>easier to use builtin: >> >>http://wiki.python.org/moin/PrintAsFunction >> >>Cheers, >>Nick. > > > Looks like a good start, much better than just expressing opinions. :-) > > > How about making it a class? That's what sys.stdout is for :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From tjreedy at udel.edu Sat Sep 3 13:21:19 2005 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 3 Sep 2005 07:21:19 -0400 Subject: [Python-Dev] New Wiki page - PrintAsFunction References: <431900A6.6000406@iinet.net.au> <43195EB1.3090406@ronadam.com> Message-ID: "Ron Adam" wrote in message news:43195EB1.3090406 at ronadam.com... > # standard printing > write.ln(1, 2, 3) > # print without trailing newline > write(1, 2, 3) This violates this design principle: When there are two options and one is overwhelmingly more common in use (in this case, with newline added, at least 95%) the common case should be easier, not harder. Terry J. Reedy From ncoghlan at gmail.com Sat Sep 3 14:26:25 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 03 Sep 2005 22:26:25 +1000 Subject: [Python-Dev] New Wiki page - PrintAsFunction In-Reply-To: References: <431900A6.6000406@iinet.net.au> <43195EB1.3090406@ronadam.com> Message-ID: <43199671.3000404@gmail.com> Terry Reedy wrote: > "Ron Adam" wrote in message > news:43195EB1.3090406 at ronadam.com... > >># standard printing >>write.ln(1, 2, 3) > > >># print without trailing newline >>write(1, 2, 3) > > > This violates this design principle: > When there are two options and one is overwhelmingly more common in use (in > this case, with newline added, at least 95%) the common case should be > easier, not harder. Having write/writeln as builtins has that problem too (with writeln being more common, but having the less obvious and longer name), but that pair of functions is still what is currently recorded in PEP 3000 as the candidate replacement for the print statement. Unfortunately, giving "write" the behaviour of "writeln" would result in a confusing difference between "sys.stdout.write('Hello world!')" and "write('Hello world!')", where the latter appends a trailing newline, but the former doesn't. I figure the naming of any replacement function for the print statement is going to end up squarely in Guido's court, particularly given that the need for a workable transition strategy makes it difficult to use the most obvious name (i.e., print). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From p.f.moore at gmail.com Sat Sep 3 15:15:23 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Sat, 3 Sep 2005 14:15:23 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> Message-ID: <79990c6b050903061575d01712@mail.gmail.com> On 9/3/05, Guido van Rossum wrote: > Wow. > > With so many people expressing a gut response and not saying what in > the proposal they don't like, it's hard to even start a response. Fair point. > Is it... > > - Going from statement to function? I thought this was a major issue, but Nick Coghlan's output() function has persuaded me otherwise. Now, I'd say I was more concerned about going from *one* statement to *six* functions (the number you explicitly referred to in your posting - 3 methods and 3 builtins - but I'd be willing to concede that the exact number needed is vague, not least because the write method already exists...) > - Losing the automatically inserted space? This was important to me. > - Having to write more to get a newline appended? Not so much "more" as "ugly" - the function name writeln reminds me of Pascal (not a good thing!), and an explicit "\n" obscures the main intent of the code. > - Losing the name 'print'? Not a big deal, but it seems gratuitous. > Some responses seemed to have missed (or perhaps for stronger > rhetorical effect intentionally neglected) that I was proposing > builtins in addition to the stream methods, The opposite - to me, that just increases the number of proposed functions, which is one of my objections :-) > I'd like to be flexible on all points *except* the syntax -- I really > want to get rid of print as a *statement*. OK, how about a *single* builtin, named "print", which works something like Nick Coghlan's proposal (I'm happy to fiddle with the details, but the basic principle is that it can do all the variations the print statement can currently do - plus extra, in the case of Nick's code). It should rely solely on a stream having a "write" method (so there's no change to the "file-like object" interface, and existing objects don't need to be changed to support the new proposal). If you really want a stream.print method, I can cope, as long as it's clear that it's an *optional* part of the file-like interface - after all, it's a convenience method only. A mixin providing it might work, but I've no idea how you'd do a mixin which file-like objects implemented in C could use... A name other than "print" for the new builtin has the benefit that it could be introduced now, with Python 3.0 merely removing the print statement in its favour. But there isn't really a name I like as much as "print", and at least you *know* that no-one is using variable names that would hide a print builtin :-) > Consider this: if Python *didn't* have a print statement, but it had a > built-in function with the same functionality (including, say, keyword > parameters to suppress the trailing newline or the space between > items); would anyone support a proposal to make it a statement > instead? No - and if that builtin was what you had proposed, you may not have got such a negative reaction :-) Paul. From p.f.moore at gmail.com Sat Sep 3 15:35:58 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Sat, 3 Sep 2005 14:35:58 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4318F633.6050501@gmail.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> Message-ID: <79990c6b05090306355891f450@mail.gmail.com> On 9/3/05, Nick Coghlan wrote: [...] > Playing well with generator expressions comes for free, too: > > print " ".join(str(x*x) for x in range(10)) > => output(*(x*x for x in range(10))) Hmm... This prompts a coding question - is it possible to recognise which arguments to a function are generators, so that you could write output(1, 2, [3,4], (c for c in 'abc'), 'def', (5, 6)) and get 1 2 [3, 4] a b c def (5, 6) ? At the simplest level, an explicit check for types.GeneratorType would work, but I'm not sure if there's a more general check that might might work - for example, iter((1,2,3)) may be a candidate for looping over, where (1,2,3) clearly (? :-)) isn't. Maybe "iter(arg) is arg" is the right check... Of course, there's a completely separate question as to whether magic this subtle is *advisable*... Paul. From martin.blais at gmail.com Sat Sep 3 15:55:07 2005 From: martin.blais at gmail.com (Martin Blais) Date: Sat, 3 Sep 2005 09:55:07 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <43198914.8000005@tiscali.it> References: <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> <8393fff0509021845427b0ba3@mail.gmail.com> <43198914.8000005@tiscali.it> Message-ID: <8393fff050903065579e2543b@mail.gmail.com> On 9/3/05, Paolino wrote: > Martin Blais wrote: > > Then how about:: > > > > output(*(x*x for x in range(10)), iter=1) > > > Illegal in python2.4.(Wrongly ?) And makes the star solution half unuseful. > > >>> def f(*args,**kwargs): > ... pass > ... > >>> f(*(1,2,3),iter=True) > File " ", line 1 > f(*(1,2,3),iter=True) > > Leaving out what I just asserted in the previous thread :( I suppose you > meant output((x*x for x in range(10)), iter=1) > > f(1,[2,3],(_ for _ in (4,5)),iter=True) Yes, that's right, my bad, I indeed meant your corrected version above (forgot to remove the star) From ncoghlan at gmail.com Sat Sep 3 16:09:02 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 00:09:02 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b05090306355891f450@mail.gmail.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> Message-ID: <4319AE7E.8020803@gmail.com> Paul Moore wrote: > Hmm... This prompts a coding question - is it possible to recognise > which arguments to a function are generators, so that you could write > > output(1, 2, [3,4], (c for c in 'abc'), 'def', (5, 6)) > > and get > > 1 2 [3, 4] a b c def (5, 6) > > ? > > At the simplest level, an explicit check for types.GeneratorType would > work, but I'm not sure if there's a more general check that might > might work - for example, iter((1,2,3)) may be a candidate for looping > over, where (1,2,3) clearly (? :-)) isn't. Maybe "iter(arg) is arg" is > the right check... > > Of course, there's a completely separate question as to whether magic > this subtle is *advisable*... If an iterator wants to behave like that, the iterator should define the appropriate __str__ method. Otherwise, just break it up into multiple lines: write(1, 2, [3,4]) write(*(c for c in 'abc')) writeln('def', (5, 6)) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From paolo_veronelli at libero.it Sat Sep 3 17:27:41 2005 From: paolo_veronelli at libero.it (Paolino) Date: Sat, 03 Sep 2005 17:27:41 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4319AE7E.8020803@gmail.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> Message-ID: <4319C0ED.4060608@libero.it> Nick Coghlan wrote: > If an iterator wants to behave like that, the iterator should define the > appropriate __str__ method. Otherwise, just break it up into multiple lines: > > write(1, 2, [3,4]) > write(*(c for c in 'abc')) This cannot accept keyword args(I wonder if this is a bug), which makes it a non compatible solution with the rest of yours. > writeln('def', (5, 6)) > Regards Paolino From ncoghlan at gmail.com Sat Sep 3 16:50:35 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 00:50:35 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4319C0ED.4060608@libero.it> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: <4319B83B.8050206@gmail.com> Paolino wrote: > Nick Coghlan wrote: > >> If an iterator wants to behave like that, the iterator should define >> the appropriate __str__ method. Otherwise, just break it up into >> multiple lines: >> >> write(1, 2, [3,4]) >> write(*(c for c in 'abc')) > > This cannot accept keyword args(I wonder if this is a bug), which makes > it a non compatible solution with the rest of yours. Actually, it's an ordering quirk in the parser - the extended call syntax stuff has to come last in the function call, which means we need to put the keyword arguments at the front: Py> writeln(sep=', ', *(x*x for x in range(10))) 0, 1, 4, 9, 16, 25, 36, 49, 64, 81 I personally believe keyword arguments should be allowed between *args and **kwds at the call site, and keyword-only arguments after * in the function definition, but the current behaviour has never bothered me enough for me to look into what would be required to change it. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From barry at python.org Sat Sep 3 17:15:12 2005 From: barry at python.org (Barry Warsaw) Date: Sat, 03 Sep 2005 11:15:12 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> Message-ID: <1125760512.19993.60.camel@presto.wooz.org> On Fri, 2005-09-02 at 21:42, Guido van Rossum wrote: > With so many people expressing a gut response and not saying what in > the proposal they don't like, it's hard to even start a response. Is > it... > - Going from statement to function? So I ignored my visceral reaction against the proposal and actually converted some code in our commercial app (if I have time I might look at some code in Mailman) to try to understand why I disliked the proposal so much. I do hate having to write two parentheses -- it's more than the extra keystrokes. It's that I have to use two shifted characters and I have to be sure to close the construct, which can be a PITA when the start of the function call is separated from the end by many lines. What I found is that while this can be a real annoyance for some code, there are some beneficial trade-offs that make this palatable. I haven't read all the concrete proposals for the print() function, but assuming it's something like the logger, it's very nice not to have to do the %-operation explicitly. A very common case in my code is to have a format string followed by a bunch of arguments, and including an output stream. IWBNI could do something like this: printf("""\ ERROR: Failed to import handler %s for function %s in file %s. Improperly formed foobar string.""", handler, function, file, to=sys.stderr) The other use case is where I don't have a format string and there, a straight translation to print('WAUUGH! object:', obj, 'refcounts:', sys.getrefcount(obj)) print(' Finishing frobnication...', nl=False) isn't horrible, although this looks kind of goofy to get a blank line: print() So for permanent code, I think it's a decent trade-off. We lose something but we gain something. I'll mourn the syntax highlighting loss (or end up hacking python-mode) but oh well. I still suspect that a print function will be less friendly to newbies and casual programmers. I can't test that, but I would love it if one of the educators on this list could conduct some usability studies on the issue. I also suspect that having to use a function will every-so-slightly impede the debug-to-console use of print. I haven't played with that idea much, but I'll try it next time I'm doing something like that. > - Losing the automatically inserted space? Yes, definitely for the non-format-string variety. The two things I hate most about Java's way is having to concatenate a string and having to include the space myself. It's highly error-prone and ugly. Above all else, /please/ avoid the forehead-welt-tool which is System.out.println(). > - Having to write more to get a newline appended? Yes, definitely. In everything I've converted, it's much more common to want the newline than not. I want an easy way to suppress the newline, but I'm willing to write "nl=False" to get that. > - Losing the name 'print'? I'm mixed on this. OT1H, I like print() better than write() but OTOH, I can imagine that a decade of muscle memory will be hard to overcome. > Some responses seemed to have missed (or perhaps for stronger > rhetorical effect intentionally neglected) that I was proposing > builtins in addition to the stream methods, so that all those debug > prints would be just as easy to add as before. And I don't think I > ever said print was only for newbies! I know we'll have built-ins, but I disagree that debug prints will be just as easy. Clearly they won't be, the question is to what degree they will be harder to write and what benefit you will get in trade. If those answers are "only a little bit" and "a lot", it will probably be acceptable. > I'd like to be flexible on all points *except* the syntax -- I really > want to get rid of print as a *statement*. > > Consider this: if Python *didn't* have a print statement, but it had a > built-in function with the same functionality (including, say, keyword > parameters to suppress the trailing newline or the space between > items); would anyone support a proposal to make it a statement > instead? Probably not, but such an alternative universe is hard to imagine, so I'm not sure it would have dawned on anyone to suggest it. I think the right approach is to design and add the replacement for Python 2.x, encourage people to use it, and then see if it still warrants removal of the print statement for Python 3.0. fwiw-ly y'rs, -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050903/0e744156/attachment.pgp From guido at python.org Sat Sep 3 17:17:20 2005 From: guido at python.org (Guido van Rossum) Date: Sat, 3 Sep 2005 08:17:20 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4319C0ED.4060608@libero.it> References: <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: So, another round. * Gratuitous breakage: IMO it's not gratuitous. The *extensions* to the print statement (trailing comma, >>stream) are ugly, and because it's all syntax, other extensions are hard to make. Had it been a function from the start it would have been much easier to add keyword args, for example. * Neal brought up the possibility of a conversion tool and wondered how perfect it could be. Because it's currently syntax, this is a rare case where the conversion can easily be made perfect (assuming the tool has a complete Python parser). The only thing that wouldn't necessarily be translated perfectly would be code manipulating softspace directly. * The possibility of future-proofing: I don't believe that this was a major reason for the negative responses; after all, once we settle on the function names & functionality, we can add the functions to 2.5 and people can start using them at their leisure. (This was actually why I proposed different names.) * Don't break it just because it's too much like Basic or ABC? I never meant it that way. In ABC, WRITE was actually *more* like a procedure call, because procedure calls in ABC don't use parentheses. I think the old Basic wasn't such a bad language as its reputation would like to have it; it was fully interactive, and quite good for teaching. The problem that the ABC authors were mainly fighting was arbitrary limitations and lack of structured programming support -- for example, old Basic implementations often had 1- or 2-char variable names only, heavily relied on GOTO, and there were no locals. The ABC authors also made a slogan of getting rid of PEEK and POKE, but ABC never did provide a replacement for interacting with the environment or graphics. Basic's WRITE statement (in the version that I remember) had to be a statement because there were no predefined procedures -- only a few functions, all other predefined functionality was statements. Since WRITE was the only game in town, it used some syntax hacks: separating items with commas caused them to be written as 20-character-wide columns; using semicolons instead caused single spaces to appear between (it would have made more sense the other way around, but I guess they were constrained by backward compatibility, too :-). I guess Python's print statement's trailing comma reminds me of the latter feature. * Alas, writing the arguments to the print statement in parentheses is not enough to future-proof your code, even if we had a print() function that behaved right; print ('a', 'b') prints something completely diferent than print 'a', 'b'. (The former prints a tuple with quoted string literals.) The only thing that could mean the same would be print(expr) with a single expression argument. * A lot of discussion has actually focused on getting the semantics of the replacement function right, and I like a lot of what was written. Here's my own version: print() could become a built-in function that behaves roughly like the current print statement without a trailing comma; it inserts spaces between items and ends with a newline. A keyword argument (file= or to=?) can specify an alternate file to write to (default sys.stdout); all that is used is the file's write() method with one string argument. The softspace misfeature goes away. I see two different ways to support the two most-called-for additional requirements: (a) an option to avoid the trailing newline, (b) an option to avoid the space between items. One way would be to give the print() call additional keyword arguments. For example, sep="//" would print double slashes between the items, and sep="" would concatenate the items directly. And end="\r\n" could be used to change the newline delimiter to CRLF, while end="" would mean to suppress the newline altogther. But to me that API becomes rather klunky; I'd rather have a separate function (printbare() or write()?) that just writes its arguments as strings to sys.stdout (or to the file given with a keyword argument) without intervening spaces or trailing newline. If for example you want the intervening spaces but not the trailing newline, sorry, you're going to have to write the spaces yourself, which is no big deal IMO. The new API is still much easier to use than what you have to do currently for more control (sys.stdout.write()). If there's demand, we could also introduce printf(), which would work just like C's printf() except it takes a keyword argument to redirect the output. It would be easier from a future-proofing standpoint if the main function *wasn't* called print; but people seem to react intuitively to the name change, and there are other approaches available (like a conversion program or running P2 programs in the P3 VM using a backwards compatible parser+translator). Maybe someone can work this into the Wiki? (http://wiki.python.org/moin/PrintAsFunction) As I said, I'm flexible on all the details but I really want to get rid of the statement syntax for this functionality. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From barry at python.org Sat Sep 3 17:32:09 2005 From: barry at python.org (Barry Warsaw) Date: Sat, 03 Sep 2005 11:32:09 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b050903061575d01712@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> Message-ID: <1125761529.19992.71.camel@presto.wooz.org> On Sat, 2005-09-03 at 09:15, Paul Moore wrote: > OK, how about a *single* builtin, named "print", which works something > like Nick Coghlan's proposal (I'm happy to fiddle with the details, So I've now read Nick's wiki page and here are my comments: First, while I think you'll need two builtins, they won't be distinguished by their end-of-line behavior. That is easily handled by a keyword argument. More important IMO will be the need to distinguish whether you want a format string or not. The two use cases I came up with (and posted about previously) are: print 'obj:', obj, 'refcounts', sys.getrefcount(obj) print 'obj: %s, refcounts: %s' % (obj, sys.getrefcount(obj)) Despite that these look superficially equivalent, they really aren't. The problem is that if you used one function, you'd have to make the format string selectable by keyword argument. But that's really ugly because the focus of the operation /is/ the format string, and you really want that to come first, not last, in the function call order. So the alternative is to do some magical interpretation of the first argument to decide whether it's a format string or not. Ick! So I think it's best to have two builtins: print(*args, **kws) printf(fmt, *args, **kws) I would also /require/ that any behavior changing keyword arguments /not/ be magically inferred from the positional arguments. So you'd have to explicitly spell 'nl=False' or "stream=fp" if that's what you wanted. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050903/38d25d17/attachment.pgp From raymond.hettinger at verizon.net Sat Sep 3 17:32:15 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Sat, 03 Sep 2005 11:32:15 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125761529.19992.71.camel@presto.wooz.org> Message-ID: <002001c5b09c$aaf50be0$8c0ba044@oemcomputer> [Barry Warsaw] > I think it's best to have two builtins: > > print(*args, **kws) > printf(fmt, *args, **kws) > > I would also /require/ that any behavior changing keyword arguments > /not/ be magically inferred from the positional arguments. So you'd > have to explicitly spell 'nl=False' or "stream=fp" if that's what you > wanted. Good improvements. Raymond From barry at python.org Sat Sep 3 18:50:41 2005 From: barry at python.org (Barry Warsaw) Date: Sat, 03 Sep 2005 12:50:41 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: <1125766241.19994.82.camel@presto.wooz.org> On Sat, 2005-09-03 at 11:17, Guido van Rossum wrote: > I see two different ways to support the two most-called-for additional > requirements: (a) an option to avoid the trailing newline, (b) an > option to avoid the space between items. See a (very quick and very dirty ;) strawman that I just posted to the wiki. I think this has some interesting semantics, including the ability to control the separator inline in a C++-like fashion. The writef() version also accepts string.Templates or %s-strings as its first argument. I'm not sure I like reserving 'to' and 'nl' keyword arguments, and not having the ability to print Separator instances directly, but OTOH maybe those aren't big deals. Anyway, this is close to what (I think) I'd like to see in the proposed built-ins. I'm out of time for now, so I'll check back later for all the derision and mocking. :) -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050903/42f6a56d/attachment.pgp From foom at fuhm.net Sat Sep 3 18:51:54 2005 From: foom at fuhm.net (James Y Knight) Date: Sat, 3 Sep 2005 12:51:54 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125761529.19992.71.camel@presto.wooz.org> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> Message-ID: <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> On Sep 3, 2005, at 11:32 AM, Barry Warsaw wrote: > So I think it's best to have two builtins: > > print(*args, **kws) > printf(fmt, *args, **kws) It seems pretty bogus to me to add a second builtin just to apply the % operator for you. I've always really liked that Python doesn't have separate xyzf functions, because formatting is an operation you can do directly on the string and pass that to any function you like. It's much cleaner... James From steven.bethard at gmail.com Sat Sep 3 19:06:48 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Sat, 3 Sep 2005 11:06:48 -0600 Subject: [Python-Dev] iterators and extended function call syntax (WAS: Replacement for print in Python 3.0) In-Reply-To: <4319019C.6010207@gmail.com> References: <4317FCCD.80702@pfdubois.com> <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> <4319019C.6010207@gmail.com> Message-ID: Nick Coghlan wrote: > I actually hope that extended function call syntax in Py3k will > use iterators rather than tuples so that this problem goes away. I suggested this a while back on the Python list: http://mail.python.org/pipermail/python-list/2004-December/257282.html Raymond Hettinger brought up a few pretty valid complaints, the biggest of which is that a lot of code now expects *args to be sequences, not iterators. For example, the code you posted on the Wiki[1] would break: def write(*args, **kwds): ... # may break if args iterator does not have a __len__ if not args: return ... # will break unless "args = tuple(args)" precedes it stream.write(str(args[0])) for arg in args[1:]: stream.write(sep) stream.write(str(arg)) This code would have to be rewritten to use the iterator's .next() method and try/excepts for StopIterations. It's not particularly hard, but people would have to do some relearning about *args. [1] http://wiki.python.org/moin/PrintAsFunction STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From martin.blais at gmail.com Sat Sep 3 19:12:02 2005 From: martin.blais at gmail.com (Martin Blais) Date: Sat, 3 Sep 2005 13:12:02 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125760512.19993.60.camel@presto.wooz.org> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <1125760512.19993.60.camel@presto.wooz.org> Message-ID: <8393fff0509031012c3d83db@mail.gmail.com> On 9/3/05, Barry Warsaw wrote: > On Fri, 2005-09-02 at 21:42, Guido van Rossum wrote: > > I do hate having to write two parentheses -- it's more than the extra > keystrokes. It's that I have to use two shifted characters and I have > to be sure to close the construct, which can be a PITA when the start of > the function call is separated from the end by many lines. (defun python-abbrev-print () "Help me change old habits." (insert "print()") (backward-char 1) t) (put 'python-abbrev-print 'no-self-insert t) (define-abbrev python-mode-abbrev-table "print" "" 'python-abbrev-print) From steven.bethard at gmail.com Sat Sep 3 19:12:27 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Sat, 3 Sep 2005 11:12:27 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: Fredrik Lundh wrote: > Steven Bethard wrote: > > >> - Error and help messages, often with print >>sys.stderr > > > > Use the print() method of sys.stderr: > > > > sys.stderr.print('error or help message') > > so who's going to add print methods to all file-like objects? The same people that added __iter__(), next(), readline(), readlines() and writelines() to their file-like objects when technically these are all derivable from read() and write(). This is why I suggested providing a FileMixin class. In retrospect, I'm surprised we don't already have one... STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From steven.bethard at gmail.com Sat Sep 3 19:12:46 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Sat, 3 Sep 2005 11:12:46 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: Guido van Rossum wrote: > If there's demand, we could also introduce printf(), which would work > just like C's printf() except it takes a keyword argument to redirect > the output. I think this is probably unnecessary if string formatting becomes a function instead of the % operator (as has been suggested). I don't think that: write("""\ ERROR: Failed to import handler %s for function %s in file %s. Improperly formed foobar string.""".substitute(handler, function, file), to=sys.stderr) is really any worse than: printf("""\ ERROR: Failed to import handler %s for function %s in file %s. Improperly formed foobar string.""", handler, function, file, to=sys.stderr) STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From guido at python.org Sat Sep 3 19:32:07 2005 From: guido at python.org (Guido van Rossum) Date: Sat, 3 Sep 2005 10:32:07 -0700 Subject: [Python-Dev] iterators and extended function call syntax (WAS: Replacement for print in Python 3.0) In-Reply-To: References: <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> <4319019C.6010207@gmail.com> Message-ID: On 9/3/05, Steven Bethard wrote: > Nick Coghlan wrote: > > I actually hope that extended function call syntax in Py3k will > > use iterators rather than tuples so that this problem goes away. > > I suggested this a while back on the Python list: > http://mail.python.org/pipermail/python-list/2004-December/257282.html > > Raymond Hettinger brought up a few pretty valid complaints, [...] What he said. There's no way this is going to happen. If you want to have a function that takes an iterator and you want to pass it an iterator, just do that -- don't use the *args notation. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Sat Sep 3 19:37:57 2005 From: guido at python.org (Guido van Rossum) Date: Sat, 3 Sep 2005 10:37:57 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4319B83B.8050206@gmail.com> References: <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <4319B83B.8050206@gmail.com> Message-ID: On 9/3/05, Nick Coghlan wrote: > Actually, it's an ordering quirk in the parser - the extended call syntax > stuff has to come last in the function call, which means we need to put the > keyword arguments at the front: > > Py> writeln(sep=', ', *(x*x for x in range(10))) > 0, 1, 4, 9, 16, 25, 36, 49, 64, 81 > > I personally believe keyword arguments should be allowed between *args and > **kwds at the call site, and keyword-only arguments after * in the function > definition, but the current behaviour has never bothered me enough for me to > look into what would be required to change it. Same here. If anyone wants to give it a try, please go ahead! -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Sat Sep 3 19:49:09 2005 From: guido at python.org (Guido van Rossum) Date: Sat, 3 Sep 2005 10:49:09 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: OK. Now that we've got the emotions under control somewhat, maybe a few folks can go off and write up a PEP for a print-replacement. I nominate Barry and Nick since they seem to be motivated; anyone who thinks their view is important and won't be taken into account enough by those two ought to speak up now and volunteer as a co-author. I suggest the wiki as a place for working out drafts. I'm pulling out of the discussion until I see a draft PEP. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From pje at telecommunity.com Sat Sep 3 20:09:09 2005 From: pje at telecommunity.com (Phillip J. Eby) Date: Sat, 03 Sep 2005 14:09:09 -0400 Subject: [Python-Dev] Asynchronous use of Traceback objects In-Reply-To: <60ed19d40509030124730b8f5b@mail.gmail.com> Message-ID: <5.1.1.6.0.20050903140429.01b6ba58@mail.telecommunity.com> At 06:24 PM 9/3/2005 +1000, Christopher Armstrong wrote: >For example, perhaps a better idea would be to >change the traceback-printing functions to use Python attribute lookup >instead of internal structure lookup, and then change raise to accept >arbitrary Python objects as its third argument, as long as it matches >the traceback interface. Given that traceback printing isn't a performance-critical activity, there probably isn't a reason any more for requiring a particular C layout. On the other hand, being able to create frame or traceback instances or subclasses would probably also solve your problem, without having to do too much hacking on the C code that expects a particular layout. From p.f.moore at gmail.com Sat Sep 3 20:42:57 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Sat, 3 Sep 2005 19:42:57 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> Message-ID: <79990c6b050903114260d2e4af@mail.gmail.com> On 9/3/05, James Y Knight wrote: > > On Sep 3, 2005, at 11:32 AM, Barry Warsaw wrote: > > > So I think it's best to have two builtins: > > > > print(*args, **kws) > > printf(fmt, *args, **kws) > > It seems pretty bogus to me to add a second builtin just to apply the > % operator for you. I've always really liked that Python doesn't have > separate xyzf functions, because formatting is an operation you can > do directly on the string and pass that to any function you like. > It's much cleaner... I have to agree. While I accept that Barry has genuine use cases for the printf form, I don't quite see why %-formatting isn't enough. Is the print-plus-% form so much less readable and/or maintainable? Paul. From gjc at inescporto.pt Sat Sep 3 22:01:24 2005 From: gjc at inescporto.pt (Gustavo J. A. M. Carneiro) Date: Sat, 03 Sep 2005 21:01:24 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b050903114260d2e4af@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> <79990c6b050903114260d2e4af@mail.gmail.com> Message-ID: <1125777684.7886.34.camel@localhost.localdomain> On Sat, 2005-09-03 at 19:42 +0100, Paul Moore wrote: > On 9/3/05, James Y Knight wrote: > > > > On Sep 3, 2005, at 11:32 AM, Barry Warsaw wrote: > > > > > So I think it's best to have two builtins: > > > > > > print(*args, **kws) > > > printf(fmt, *args, **kws) > > > > It seems pretty bogus to me to add a second builtin just to apply the > > % operator for you. I've always really liked that Python doesn't have > > separate xyzf functions, because formatting is an operation you can > > do directly on the string and pass that to any function you like. > > It's much cleaner... > > I have to agree. While I accept that Barry has genuine use cases for > the printf form, I don't quite see why %-formatting isn't enough. Is > the print-plus-% form so much less readable and/or maintainable? printf does avoid one extra set of () in many cases, making the code look and indent nicer. I take this chance to state my humble opinion. Please keep the print function print(), not writeln()! "printing stuff" is everyone's favorite anachronistic expression, even though the output doesn't go to a printer anymore. We all love it! I know Guido wanted a different name so that print() could be introduced in python 2 to allow a smooth transition to python 3, but the disadvantages in lost readability and familiarity by far outweigh the transition concerns imho. Regards. -- Gustavo J. A. M. Carneiro The universe is always one step beyond logic From kalinda at acc.umu.se Sat Sep 3 22:36:07 2005 From: kalinda at acc.umu.se (Jonny Reichwald) Date: Sat, 3 Sep 2005 22:36:07 +0200 Subject: [Python-Dev] str.strip() enhancement Message-ID: <68575462-D2A3-4046-B204-22C570E52F95@acc.umu.se> Hi, I would like to suggest a small enhancement to str.strip(). By expanding its current form, where it only takes a char list, to taking any list containing either char lists or string lists, it is possible to remove entire words from the stripped string. To clarify what I mean, here are some examples, first argument string to be stripped, second argument a list of things to strip: #A char list gives the same result as the standard strip >>> my_strip("abcdeed", "de") 'abc' #A list of strings instead >>> my_strip("abcdeed", ("ed",)) 'abcde' #The char order in the strings to be stripped are of importance >>> my_strip("abcdeed", ("ad", "eb")) 'abcdeed' Functions used in the above examples: def my_lstrip(str, list): ret_str = str[max([k == True and len(v) for (k,v) in zip ([str.startswith(e) for e in list], list)]):] if ret_str != str: return my_lstrip(ret_str, list) return str def my_rstrip(str, list): ret_str = str[:len(str)-max([k == True and len(v) for (k,v) in zip([str.endswith(e) for e in list], list)])] if ret_str != str and ret_str != False: return my_rstrip(ret_str, list) return str def my_strip(str, list): return my_lstrip(my_rstrip(str, list), list) Would this be useful for anyone else besides me? -- Jonny Reichwald From tjreedy at udel.edu Sat Sep 3 22:59:15 2005 From: tjreedy at udel.edu (Terry Reedy) Date: Sat, 3 Sep 2005 16:59:15 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <20050902142044.GA18622@discworld.dyndns.org><17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com><4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com><1125761529.19992.71.camel@presto.wooz.org><09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net><79990c6b050903114260d2e4af@mail.gmail.com> <1125777684.7886.34.camel@localhost.localdomain> Message-ID: "Gustavo J. A. M. Carneiro" wrote in message news:1125777684.7886.34.camel at localhost.localdomain... > I take this chance to state my humble opinion. Please keep the print > function print(), not writeln()! "printing stuff" is everyone's > favorite anachronistic expression, even though the output doesn't go to > a printer anymore. We all love it! I know Guido wanted a different > name so that print() could be introduced in python 2 to allow a smooth > transition to python 3, but the disadvantages in lost readability and > familiarity by far outweigh the transition concerns imho. 'prnt(' (or any other temp name) could easily be searched/replaced by 'print(' when the time comes. Terry J. Reedy From skip at pobox.com Sat Sep 3 23:02:19 2005 From: skip at pobox.com (skip@pobox.com) Date: Sat, 3 Sep 2005 16:02:19 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <17177.1963.69703.689791@montanaro.dyndns.org> Message-ID: <17178.3931.674287.314561@montanaro.dyndns.org> >> Nope, but there is a large body of code out there that does use print >> statements already. Again, I know you're prepared for breakage, but >> that doesn't necessarily mean a completely blank sheet of paper. Neal> Ideally I very much prefer that print become a function. However, Neal> the major backlash has swayed me some, if for no other reason that Neal> people are so strongly against changing it. I think from Guido's perspective the print statement is a wart. From my perspective I see it as a case of if it ain't broke, don't fix it. I'll adapt to a print function easily enough. The breakage just seems unnecessary. Neal> What if a tool existed that did the conversion? I realize that Neal> the tool is unlikely to be perfect, but what if it could do 99.9% Neal> of the job? I'm not thinking about just fixing print, but also Neal> converting iterkeys/itervalues/iteritems, xrange -> range, Neal> raw_input -> input, warning about use of input(), etc. That's a different subject altogether, especially if you are talking about more than just converting print. It should probably have its own subject and thread. I don't know what's in the "etc" part, but I've never used iter-this-n-that (their names have always seemed ugly enough that I've simply avoided them) or raw_input, I rarely use xrange, and the conversion is trivial, so the only potential benefit for me would be print, which I can probably get 90% of the way there with a couple Emacs macros. Skip From raymond.hettinger at verizon.net Sat Sep 3 23:57:09 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Sat, 03 Sep 2005 17:57:09 -0400 Subject: [Python-Dev] str.strip() enhancement In-Reply-To: <68575462-D2A3-4046-B204-22C570E52F95@acc.umu.se> Message-ID: <000501c5b0d2$6f7b4ee0$8c0ba044@oemcomputer> [Jonny Reichwald] > I would like to suggest a small enhancement to str.strip(). > By expanding its current form, where it only takes a char list, to > taking any list containing either char lists or string lists, it is > possible to remove entire words from the stripped string. . . . > Would this be useful for anyone else besides me? Probably not. Have you seen any other requests for something similar? Are there precedents in any other language? Can you point to examples of existing code other than your own that would benefit? Even if an example or two is found, it is worth complicating the API. Keep in mind the difficulties that plague str.split() -- that is what happens when a function grows beyond a single, clear, unified, cohesive concept. Raymond From janssen at parc.com Sun Sep 4 00:17:23 2005 From: janssen at parc.com (Bill Janssen) Date: Sat, 3 Sep 2005 15:17:23 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Fri, 02 Sep 2005 20:33:03 PDT." <4319196F.1060405@gmail.com> Message-ID: <05Sep3.151726pdt."58617"@synergy1.parc.xerox.com> > To me, the main objection seems to revolve around the fact that people would > like to be able to "future-proof" Python 2.x code so that it will also run on > Py3k. Nick, You seem to be dreaming. People like the "print" statement for many and varied reasons, it seems. Skip's point about gratuitous breakage is one good argument for retaining it, but by no means the main argument. Bill From jcarlson at uci.edu Sun Sep 4 00:20:52 2005 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 03 Sep 2005 15:20:52 -0700 Subject: [Python-Dev] str.strip() enhancement In-Reply-To: <000501c5b0d2$6f7b4ee0$8c0ba044@oemcomputer> References: <68575462-D2A3-4046-B204-22C570E52F95@acc.umu.se> <000501c5b0d2$6f7b4ee0$8c0ba044@oemcomputer> Message-ID: <20050903151716.8B39.JCARLSON@uci.edu> "Raymond Hettinger" wrote: > > [Jonny Reichwald] > > I would like to suggest a small enhancement to str.strip(). > > By expanding its current form, where it only takes a char list, to > > taking any list containing either char lists or string lists, it is > > possible to remove entire words from the stripped string. > . . . > > > Would this be useful for anyone else besides me? > > Probably not. There is also the point that this functionality is a 4-line function... def trim_endings(strng, endings): for ending in endings: if ending and string.endswith(ending): return strng[:-len(ending)] - Josiah From janssen at parc.com Sun Sep 4 00:32:38 2005 From: janssen at parc.com (Bill Janssen) Date: Sat, 3 Sep 2005 15:32:38 PDT Subject: [Python-Dev] Hacking print (was: Replacement for print in Python 3.0) In-Reply-To: Your message of "Fri, 02 Sep 2005 18:42:10 PDT." Message-ID: <05Sep3.153241pdt."58617"@synergy1.parc.xerox.com> Just to add a bit more perspective (though I continue to believe that "print" should be retained as-is): In my UpLib code, I no longer use print. Instead, I typically use a variant of logging called "note" instead of print: note ([LEVEL, ] FORMAT-STRING [, *ARGS]) It works just like C printf, but uses the Python string formatting to merge the ARGS into the FORMAT-STRING. Having the printf-style formatting seems to me to outweigh the irritation of having to surround my args with parentheses (why are parentheses shifted characters?!), though having both would be great. If an integer LEVEL is provided, it is compared to the current output-level setting, and if LEVEL is *higher* than the current setting, the output is suppressed. The default LEVEL is 1. Normally, "note" writes to sys.stderr, but there are functions to set both the note-level and the note-sink. Adding the "\n" to the end of the format string seems to be just as easy as writing "noteln", and much clearer, so I've never even considered adding a "-ln" variant of this function. I think the "-ln" variants made familiar by Pascal and Java were a bad idea, on a par with the notion of a split between "text" and "binary" file opens. I might even be in favor of retiring "print" if it were replaced with a different statement, say "printf", which had the capabilities of "note", but didn't require parentheses around its arguments. Bill From janssen at parc.com Sun Sep 4 00:40:19 2005 From: janssen at parc.com (Bill Janssen) Date: Sat, 3 Sep 2005 15:40:19 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Sat, 03 Sep 2005 08:15:12 PDT." <1125760512.19993.60.camel@presto.wooz.org> Message-ID: <05Sep3.154026pdt."58617"@synergy1.parc.xerox.com> > I do hate having to write two parentheses -- it's more than the extra > keystrokes. It's that I have to use two shifted characters and I have > to be sure to close the construct, which can be a PITA when the start of > the function call is separated from the end by many lines. > What I found is that while this can be a real annoyance for some code, > there are some beneficial trade-offs that make this palatable... > So for permanent code, I think it's a decent trade-off. We lose > something but we gain something. I'll mourn the syntax highlighting > loss (or end up hacking python-mode) but oh well. Wouldn't it make sense then to replace the "print" statement with a "printf" statement? Then you'd get the formatting, and wouldn't have to type the parentheses. I don't see an argument for moving to a function; indeed, there's an argument against. What you want is a fancier print. Bill From janssen at parc.com Sun Sep 4 00:49:36 2005 From: janssen at parc.com (Bill Janssen) Date: Sat, 3 Sep 2005 15:49:36 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Sat, 03 Sep 2005 08:17:20 PDT." Message-ID: <05Sep3.154941pdt."58617"@synergy1.parc.xerox.com> Guido writes: > * Gratuitous breakage: IMO it's not gratuitous. The *extensions* to > the print statement (trailing comma, >>stream) are ugly, and because > it's all syntax, other extensions are hard to make. Had it been a > function from the start it would have been much easier to add keyword > args, for example. So here's the summary of the arguments against: two style points (trailing comma and >>stream) (from the man who approved the current decorator syntax!), and it's hard to extend. (By the way, I agree that the ">>" syntax is ugly, and IMO a bad idea in general. Shame the "@" wasn't used instead. :-) Seems pretty weak to me. Are there other args against? What baffles me is that when I read through the rest of PEP 3000, I agree with the other changes. But removing "print" sticks in my craw, and there's no real justification for it. I just don't get it. If someone said, "print" doesn't support a format argument as C printf does, I'd say that's a strong argument. But an argument for extending "print" once again, not junking it. Unless it was perhaps replaced with: >>> printf @sys.stderr %"Must output %s at once!" "important message" Bill From janssen at parc.com Sun Sep 4 01:12:16 2005 From: janssen at parc.com (Bill Janssen) Date: Sat, 3 Sep 2005 16:12:16 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Sat, 03 Sep 2005 15:49:36 PDT." <05Sep3.154941pdt."58617"@synergy1.parc.xerox.com> Message-ID: <05Sep3.161217pdt."58617"@synergy1.parc.xerox.com> Or perhaps: >>> print [with FORMAT-STRING] [>> STREAM] *ARGS as an alternative to >>> printf [@ STREAM] FORMAT-STRING *ARGS Bill From steven.bethard at gmail.com Sun Sep 4 02:03:46 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Sat, 3 Sep 2005 18:03:46 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <6161664430564208585@unknownmsgid> References: <6161664430564208585@unknownmsgid> Message-ID: Bill Janssen wrote: > So here's the summary of the arguments against: two style points > (trailing comma and >>stream) (from the man who approved the current > decorator syntax!), and it's hard to extend. (By the way, I agree that > the ">>" syntax is ugly, and IMO a bad idea in general. Shame the "@" > wasn't used instead. :-) > > Seems pretty weak to me. Are there other args against? Did you see Nick Coghlan's post? http://mail.python.org/pipermail/python-dev/2005-September/056076.html I found his arguments to be reasonably compelling. BTW, the implementation he refers to in this post is at: http://mail.python.org/pipermail/python-dev/2005-September/056075.html and the updated version is at: http://wiki.python.org/moin/PrintAsFunction STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From kalinda at acc.umu.se Sun Sep 4 02:19:13 2005 From: kalinda at acc.umu.se (Jonny Reichwald) Date: Sun, 4 Sep 2005 02:19:13 +0200 Subject: [Python-Dev] str.strip() enhancement In-Reply-To: <000501c5b0d2$6f7b4ee0$8c0ba044@oemcomputer> References: <000501c5b0d2$6f7b4ee0$8c0ba044@oemcomputer> Message-ID: <08669098-44B8-47F0-9CC9-1A62EAC8CA26@acc.umu.se> Raymond Hettinger wrote: > [Jonny Reichwald] >> Would this be useful for anyone else besides me? > > Probably not. ok > Even if an example or two is found, it is worth complicating the API. > Keep in mind the difficulties that plague str.split() -- that is what > happens when a function grows beyond a single, clear, unified, > cohesive > concept. I am not aware of these difficulties, any pointers? From an API pow, I do not think it neccessarily complicates it, but rather generalizes it in a way that may not be very usable :) I can understand that it would probably not be worth the effort though... -- Jonny Reichwald From raymond.hettinger at verizon.net Sun Sep 4 02:40:27 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Sat, 03 Sep 2005 20:40:27 -0400 Subject: [Python-Dev] str.strip() enhancement In-Reply-To: <08669098-44B8-47F0-9CC9-1A62EAC8CA26@acc.umu.se> Message-ID: <000401c5b0e9$3f0cd000$8c0ba044@oemcomputer> > > Even if an example or two is found, it is worth complicating the API. > > Keep in mind the difficulties that plague str.split() -- that is what > > happens when a function grows beyond a single, clear, unified, > > cohesive > > concept. > > I am not aware of these difficulties, any pointers? Yes. From memory, write-down what you think str.split() does. Then look at the docs and see how much you got wrong and how much you missed. A thorough answer would cover empty string behaviors, the return type, whether None is allowed, whether a keyword argument is acceptable, and the effects of using a unicode or UserString argument. For extra credit, write down the length invariant and determine whether a string.join() invariant would always hold. The str.split() API has led to countless doc revisions, invalid error reports, newsgroup discussions, and questions on the tutor list. We ought to keep it unchanged for Py3.0 just to serve as a warning to future generations ;-) > From an API pow, I do not think it neccessarily complicates it Please stop smoking crack before posting to python-dev ;-) Try updating the doc string, library reference entry, and the test suite. Be sure to specify that the proposed arguments are non-commutative and whether general iterables are allowed. Then report back that there was no change in complexity. >, but > rather generalizes it in a way that may not be very usable :) > I can understand that it would probably not be worth the effort > though... Hmm, that suggests another design principle, "If a proposer lacks faith in his or her own proposal, it is doomed." Raymond From ncoghlan at gmail.com Sun Sep 4 03:27:36 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 11:27:36 +1000 Subject: [Python-Dev] iterators and extended function call syntax (WAS: Replacement for print in Python 3.0) In-Reply-To: References: <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <5.1.1.6.0.20050902212100.0339ccf0@mail.telecommunity.com> <4319019C.6010207@gmail.com> Message-ID: <431A4D88.6050808@gmail.com> Guido van Rossum wrote: > On 9/3/05, Steven Bethard wrote: > >>Nick Coghlan wrote: >> >>>I actually hope that extended function call syntax in Py3k will >>>use iterators rather than tuples so that this problem goes away. >> >>I suggested this a while back on the Python list: >> http://mail.python.org/pipermail/python-list/2004-December/257282.html >> >>Raymond Hettinger brought up a few pretty valid complaints, > > [...] > > What he said. There's no way this is going to happen. If you want to > have a function that takes an iterator and you want to pass it an > iterator, just do that -- don't use the *args notation. I guess that answers that, then. . . so noted on the Python 3.0 Suggestions wiki page. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From radeex at gmail.com Sun Sep 4 03:38:40 2005 From: radeex at gmail.com (Christopher Armstrong) Date: Sun, 4 Sep 2005 11:38:40 +1000 Subject: [Python-Dev] Asynchronous use of Traceback objects In-Reply-To: <5.1.1.6.0.20050903140429.01b6ba58@mail.telecommunity.com> References: <60ed19d40509030124730b8f5b@mail.gmail.com> <5.1.1.6.0.20050903140429.01b6ba58@mail.telecommunity.com> Message-ID: <60ed19d405090318386aa093bd@mail.gmail.com> On 9/4/05, Phillip J. Eby wrote: > At 06:24 PM 9/3/2005 +1000, Christopher Armstrong wrote: > >For example, perhaps a better idea would be to > >change the traceback-printing functions to use Python attribute lookup > >instead of internal structure lookup, and then change raise to accept > >arbitrary Python objects as its third argument, as long as it matches > >the traceback interface. > > Given that traceback printing isn't a performance-critical activity, there > probably isn't a reason any more for requiring a particular C layout. On > the other hand, being able to create frame or traceback instances or > subclasses would probably also solve your problem, without having to do too > much hacking on the C code that expects a particular layout. I guess the biggest difference in these two strategies, to me, is that one can be implemented in an external module while the other *requires* changes to CPython to work. So I'll do the former, i.e., writing C functions that construct traceback objects, accessible from Python. Maybe after I do that I could write a PEP (if that's necessary) on changing the traceback stuff on a more fundamental level, to allow for Python objects. putting-on-his-C-gloves-ly, -- Twisted | Christopher Armstrong: International Man of Twistery Radix | -- http://radix.twistedmatrix.com | Release Manager, Twisted Project \\\V/// | -- http://twistedmatrix.com |o O| | w----v----w-+ From ncoghlan at gmail.com Sun Sep 4 04:08:37 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 12:08:37 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125766241.19994.82.camel@presto.wooz.org> References: <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125766241.19994.82.camel@presto.wooz.org> Message-ID: <431A5725.6070500@gmail.com> Barry Warsaw wrote: > On Sat, 2005-09-03 at 11:17, Guido van Rossum wrote: > > >>I see two different ways to support the two most-called-for additional >>requirements: (a) an option to avoid the trailing newline, (b) an >>option to avoid the space between items. > > > See a (very quick and very dirty ;) strawman that I just posted to the > wiki. I think this has some interesting semantics, including the > ability to control the separator inline in a C++-like fashion. The > writef() version also accepts string.Templates or %s-strings as its > first argument. I'm not sure I like reserving 'to' and 'nl' keyword > arguments, and not having the ability to print Separator instances > directly, but OTOH maybe those aren't big deals. The latter problem is easily solved by calling str() at the point of the call so that write() never sees the actual Separator object. However, this 'inline' behaviour modification has always annoyed me in C++ - if you want this kind of control over the formatting, a format string is significantly clearer. I think your own examples from the Wiki page show this: write('obj:', obj, 'refs:', refs) write(Separator(': '), 'obj', obj, Separator(', '), 'refs', Separator(': '), refs, nl=False) write() writef('obj: %s, refs: %s', obj, refs) writef(Template('obj: $obj, refs: $refs, obj: $obj'), obj=obj, refs=refs, to=sys.stderr, nl=False) That said, looking at 'writef' suggests a different solution to me - a builtin called 'format'. The latter two examples would become: write(format('obj: %s, refs: %s', obj, refs)) write(format(Template('obj: $obj, refs: $refs, obj: $obj'), obj=obj, refs=refs), to=sys.stderr, nl=False) Separating the formatting out into a separate functions like this addresses your concern with the namespace conflict for 'to' and 'nl', and also makes the 'format' builtin more generally useful, as it can be used for cases other than direct output to a stream. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Sun Sep 4 04:16:05 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 12:16:05 +1000 Subject: [Python-Dev] Mixin classes in the standard library In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: <431A58E5.4080404@gmail.com> Steven Bethard wrote: > The same people that added __iter__(), next(), readline(), readlines() > and writelines() to their file-like objects when technically these are > all derivable from read() and write(). This is why I suggested > providing a FileMixin class. In retrospect, I'm surprised we don't > already have one... Where would we put it though? I sometimes wonder if there should be a 'mixins' module to provide a one-stop shop for finding things like DictMixin and ListMixin. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From janssen at parc.com Sun Sep 4 04:22:14 2005 From: janssen at parc.com (Bill Janssen) Date: Sat, 3 Sep 2005 19:22:14 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Sat, 03 Sep 2005 17:03:46 PDT." Message-ID: <05Sep3.192222pdt."58617"@synergy1.parc.xerox.com> Steven, > Did you see Nick Coghlan's post? > http://mail.python.org/pipermail/python-dev/2005-September/056076.html > I found his arguments to be reasonably compelling. You were already convinced on Friday, so with you, he was preaching to the choir. I'm not surprised you found those "arguments" compelling. I did not. I thought it was rather weak. The "points" he makes seem either irrelevant or style judgements, and many seem mischaracterized by the words used. Point by point: > "Print as statement" => printing sequences nicely is a pain > "Print as function" => extended call syntax deals with sequences nicely True, but I see it as a weakness in the Python string formatting language, instead of a weakness with "print". I think that print should be extended with a printf-like format argument (or replaced with a "printf" statement), and that the formatting available in this format argument should handle this complaint. > "Print as statement" => can't easily change the separator > "Print as function" => keyword argument handles the separator nicely So what? To begin with, "print" users have been "changing the separator" for years by doing string concatentation where it matters. And there's always file.write() for those who need it. > "Print as statement" => trailing comma suppresses newline by magic > "Print as function" => keyword argument handles the line terminator nicely This is a somewhat argumentative way of writing this. It would be better put as "newline emission control is performed syntactically", which I see as neutral. Style judgement. > "Print as statement" => redirection is via a magic symbol > "Print as function" => keyword argument handles redirection nicely This is a somewhat argumentative way of writing this. It would be better put as "output redirection is indicated syntactically", which I see as neutral. Style judgement. I might write this point as "Print as statement" => redirection is via a cool magic symbol "Print as function" => redirection done with a boring wordy keyword arg See what I mean? > "Print as statement" => can't easily save 'settings' for re-use > "Print as function" => can use functional.partial to create custom version So what? Who in the world thought up this as a reasonable feature for "print"? Oh, well: file a feature request and see what happens. I think what Nick really is asking for is a better print statement -- and there's no particularly good reason to move to a function to attain that end. Let's add a good format specifier to "print", instead. Bill From ncoghlan at gmail.com Sun Sep 4 04:43:48 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 12:43:48 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <05Sep3.192222pdt."58617"@synergy1.parc.xerox.com> References: <05Sep3.192222pdt."58617"@synergy1.parc.xerox.com> Message-ID: <431A5F64.2040907@gmail.com> Bill Janssen wrote: > Steven, > > >>Did you see Nick Coghlan's post? >> http://mail.python.org/pipermail/python-dev/2005-September/056076.html >>I found his arguments to be reasonably compelling. > > > You were already convinced on Friday, so with you, he was preaching to > the choir. I'm not surprised you found those "arguments" compelling. > > I did not. > > I thought it was rather weak. The "points" he makes seem either > irrelevant or style judgements, and many seem mischaracterized by the > words used. > > Point by point: > > >>"Print as statement" => printing sequences nicely is a pain >>"Print as function" => extended call syntax deals with sequences nicely > > > True, but I see it as a weakness in the Python string formatting > language, instead of a weakness with "print". I think that print > should be extended with a printf-like format argument (or replaced > with a "printf" statement), and that the formatting available in this > format argument should handle this complaint. I agree with this point actually. There should be an "iterable" formatting code that looks something like "%[sep]i" Then "%i" % (my_seq,) would be the equivalent of "".join(my_seq), only allowing it to be easily embedded inside a larger format string. Some other examples: ("% i" % my_seq) => " ".join(my_seq) ("%, i" % my_seq) => ", ".join(my_seq) I see this as being similar to the way that "%.2f" controls the way that a floating point value is displayed. > I think what Nick really is asking for is a better print statement -- > and there's no particularly good reason to move to a function to > attain that end. Let's add a good format specifier to "print", > instead. The real driver is that Guido wants to change it, but I'm actually starting to think I like having the print statement, and what I really want is a 'format' builtin to get around the tuple-related quirks of the string mod operator, and an enhancement to the string mod operator to deal better with iterables. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From guido at python.org Sun Sep 4 05:10:31 2005 From: guido at python.org (Guido van Rossum) Date: Sat, 3 Sep 2005 20:10:31 -0700 Subject: [Python-Dev] Revising RE docs In-Reply-To: <200509021140.36101.gmccaughan@synaptics-uk.com> References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> <200509021140.36101.gmccaughan@synaptics-uk.com> Message-ID: On 9/2/05, Gareth McCaughan wrote: > On Thursday 2005-09-01 18:09, Guido van Rossum wrote: > > > They *are* cached and there is no cost to using the functions instead > > of the methods unless you have so many regexps in your program that > > the cache is cleared (the limit is 100). > > Sure there is; the cost of looking them up in the cache. > > >>> import re,timeit > > >>> timeit.re=re > >>> timeit.Timer("""re.search(r"(\d*).*(\d*)", "abc123def456")""").timeit(1000000) > 7.6042091846466064 > > >>> timeit.r = re.compile(r"(\d*).*(\d*)") > >>> timeit.Timer("""r.search("abc123def456")""").timeit(1000000) > 2.6358869075775146 > > >>> timeit.Timer().timeit(1000000) > 0.091850996017456055 > > So in this (highly artificial toy) application it's about 7.5/2.5 = 3 times > faster to use the methods instead of the functions. Yeah, but the cost is a constant -- it is not related to the cost of compiling the re. (You should've shown how much it cost if you included the compilation in each search.) I haven't looked into this, but I bet the overhead you're measuring is actually the extra Python function call, not the cache lookup itself. I also notice that _compile() is needlessly written as a varargs function -- all its uses pass it exactly two arguments. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From ncoghlan at gmail.com Sun Sep 4 05:10:28 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 13:10:28 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431A5F64.2040907@gmail.com> References: <05Sep3.192222pdt."58617"@synergy1.parc.xerox.com> <431A5F64.2040907@gmail.com> Message-ID: <431A65A4.10400@gmail.com> Nick Coghlan wrote: > I agree with this point actually. There should be an "iterable" formatting > code that looks something like "%[sep]i" > > Then "%i" % (my_seq,) would be the equivalent of "".join(my_seq), only > allowing it to be easily embedded inside a larger format string. > > Some other examples: > ("% i" % my_seq) => " ".join(my_seq) > ("%, i" % my_seq) => ", ".join(my_seq) > > I see this as being similar to the way that "%.2f" controls the way that a > floating point value is displayed. A correction to this - such a formatting operator would need to automatically invoke str on the items in the iterable: ("%i" % (my_seq,)) => "".join(map(str, my_seq)) ("% i" % (my_seq,)) => " ".join(map(str, my_seq)) ("%, i" % (my_seq,)) => ", ".join(map(str, my_seq)) ("%(seq), i" % dict(seq=my_seq)) => ", ".join(map(str, my_seq)) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Sun Sep 4 05:54:15 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 13:54:15 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431A65A4.10400@gmail.com> References: <05Sep3.192222pdt."58617"@synergy1.parc.xerox.com> <431A5F64.2040907@gmail.com> <431A65A4.10400@gmail.com> Message-ID: <431A6FE7.3010206@gmail.com> Nick Coghlan wrote: > Nick Coghlan wrote: > >>I agree with this point actually. There should be an "iterable" formatting >>code that looks something like "%[sep]i" >> >>Then "%i" % (my_seq,) would be the equivalent of "".join(my_seq), only >>allowing it to be easily embedded inside a larger format string. >> >>Some other examples: >>("% i" % my_seq) => " ".join(my_seq) >>("%, i" % my_seq) => ", ".join(my_seq) >> >>I see this as being similar to the way that "%.2f" controls the way that a >>floating point value is displayed. > > > A correction to this - such a formatting operator would need to automatically > invoke str on the items in the iterable: > > ("%i" % (my_seq,)) => "".join(map(str, my_seq)) > ("% i" % (my_seq,)) => " ".join(map(str, my_seq)) > ("%, i" % (my_seq,)) => ", ".join(map(str, my_seq)) > ("%(seq), i" % dict(seq=my_seq)) => ", ".join(map(str, my_seq)) Hmm, 'i' is already taken. I think I'll use 'j for join' while working on a patch. The full specification of the number formatting operations is impressive, though (this is the first time I've actually read the full description of the string formatting behaviour). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From steven.bethard at gmail.com Sun Sep 4 07:54:57 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Sat, 3 Sep 2005 23:54:57 -0600 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <176641374289806004@unknownmsgid> References: <176641374289806004@unknownmsgid> Message-ID: Bill Janssen wrote: > I think what Nick really is asking for is a better print statement -- > and there's no particularly good reason to move to a function to > attain that end. Well one reason (you can judge for yourself whether it's "good" or not) is that adding more syntax to the print statement will make Python's parser more complex, while converting the print statement to a function should make Python's parser simpler. STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From t-meyer at ihug.co.nz Sun Sep 4 08:13:40 2005 From: t-meyer at ihug.co.nz (Tony Meyer) Date: Sun, 4 Sep 2005 18:13:40 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Message-ID: [Nick Coghlan] > "Print as statement" => printing sequences nicely is a pain What's wrong with this? >>> print range(10) [0, 1, 2, 3, 4, 5, 6, 7, 8, 9] >>> print tuple("string") ('s', 't', 'r', 'i', 'n', 'g') This is a serious question - that's how I would expect a print function to work anyway. > "Print as statement" => can't easily change the separator [etc] To me, the point of the builtin print is that it's simple. If you want to control what separator is used, or if there is a newline at the end, or print to something that isn't sys.stdout, or some other magic, then use sys.stdout.write(). If you want to get the contents of __unicode__/__str__ of an object to stdout, which there has been overwhelming evidence is a very common task, then print is a fantastically simple and straightforward way to do that. [Terry Reedy] > For quickly adding debug prints, two extra ()s are a small burden, > but if the function were called 'out', then there would still be just five > keystrokes. But seven keypresses (assuming one is using a keyboard where you use shift to get '(' and ')'). It sounds trivial, but a print statement (i.e. no ()) looks clean and concise. I like this: while True: pass More than: while (true) {} For the same reason. This is a big plus of Python vs. C. [Guido] > Consider this: if Python *didn't* have a print statement, but > it had a built-in function with the same functionality > (including, say, keyword parameters to suppress the trailing > newline or the space between items); would anyone support a > proposal to make it a statement instead? Yes. If it didn't have the redirect stuff; I would like it more if it also didn't have the trailing comma magic. "print" is a fundamental; it deserves to be a statement :) =Tony.Meyer From t-meyer at ihug.co.nz Sun Sep 4 08:17:14 2005 From: t-meyer at ihug.co.nz (Tony Meyer) Date: Sun, 4 Sep 2005 18:17:14 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Message-ID: [...] > maybe a few folks can go off and write up a PEP for a > print-replacement. [...] > I'm pulling out of the > discussion until I see a draft PEP. If there are two competing proposals, then the two groups write a PEP and counter-PEP and the PEPs duke it out. Is this still the case if proposal B is very nearly the status quo? IOW, would writing a "Future of the print statement in Python 3.0" counter PEP that kept print as a statement be appropriate? If not, other than python-dev posting (tiring out the poor summary guys <0.5 wink>), what is the thing to do? =Tony.Meyer From skink at evhr.net Sun Sep 4 10:58:47 2005 From: skink at evhr.net (Fabien Schwob) Date: Sun, 04 Sep 2005 10:58:47 +0200 Subject: [Python-Dev] bug in urlparse Message-ID: <431AB747.7050500@evhr.net> Hello, I'm using the module urlparse and I think I've found a bug in the urlparse module. When you merge an url and a link like"../../../page.html" with urljoin, the new url created keep some "../" in it. Here is an example : >>> import urlparse >>> begin = "http://www.example.com/folder/page.html" >>> end = "../../../otherpage.html" >>> urlparse.urljoin(begin, end) 'http://www.example.com/../../otherpage.html' I would more expect the following url : http://www.example.com/otherpage.html It's what is done in most web browser. So I would like to know if it's a bug or not. If it is, I would try to code and to submit a patch. -- Fabien SCHWOB From ncoghlan at gmail.com Sun Sep 4 12:07:11 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 20:07:11 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <431AC74F.2010102@gmail.com> Tony Meyer wrote: > [...] > >>maybe a few folks can go off and write up a PEP for a >>print-replacement. > > [...] > >>I'm pulling out of the >>discussion until I see a draft PEP. > > > If there are two competing proposals, then the two groups write a PEP and > counter-PEP and the PEPs duke it out. Is this still the case if proposal B > is very nearly the status quo? > > IOW, would writing a "Future of the print statement in Python 3.0" counter > PEP that kept print as a statement be appropriate? If not, other than > python-dev posting (tiring out the poor summary guys <0.5 wink>), what is > the thing to do? Keeping print as a statement is certainly one of the options I'm considering, so I don't think a counter-PEP is warranted just yet. There isn't even a PEP to be a counter to - it's all still on the Wiki at the moment. The more I play with it, the more I believe the part I have a problem with is a weakness in the string formatting for iterables. The point about not needing parentheses for conditionals where a lot of other languages require them is a good one - I'm sure I write print statements nearly as often as I write conditionals. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Sun Sep 4 12:19:46 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 04 Sep 2005 20:19:46 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <431ACA42.1020708@gmail.com> Tony Meyer wrote: > [Nick Coghlan] > >>"Print as statement" => printing sequences nicely is a pain > > > What's wrong with this? > > >>>>print range(10) > > [0, 1, 2, 3, 4, 5, 6, 7, 8, 9] > >>>>print tuple("string") > > ('s', 't', 'r', 'i', 'n', 'g') > > This is a serious question - that's how I would expect a print function to > work anyway. Py> print (x*x for x in range(10)) Oh, wait, this is what I actually meant: Py> print " ".join(map(str, (x*x for x in range(10)))) 0 1 4 9 16 25 36 49 64 81 Printing the contents of an arbitrary iterable is harder than it should be. Many iterables (including the builtin ones) have a reasonable default display, but a non-default display (e.g. linefeed separated instead of comma separated) isn't the most obvious thing to express. I thought making print a function solved that problem, but it doesn't really. So I'm currently exploring a different approach involving string formatting. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From mwh at python.net Sun Sep 4 12:30:11 2005 From: mwh at python.net (Michael Hudson) Date: Sun, 04 Sep 2005 11:30:11 +0100 Subject: [Python-Dev] Weekly Python Patch/Bug Summary In-Reply-To: <200509011712.j81HCXHG007703@bayview.thirdcreek.com> (Kurt B. Kaiser's message of "Thu, 1 Sep 2005 13:12:33 -0400 (EDT)") References: <200509011712.j81HCXHG007703@bayview.thirdcreek.com> Message-ID: <2mu0h1nn0s.fsf@starship.python.net> "Kurt B. Kaiser" writes: > Patch / Bug Summary > ___________________ > > Patches : 903 open (+551) / 5222 closed (+2324) / 6125 total (+2875) Err ... ? Cheers, mwh -- LaTeX, pah. Don't be silly. I'm using a homebrew markup system that I wrote in Common Lisp. ;-) -- Peter Seibel, comp.lang.lisp, talking about his book "Practical Lisp" From nyamatongwe at gmail.com Sun Sep 4 12:34:01 2005 From: nyamatongwe at gmail.com (Neil Hodgson) Date: Sun, 4 Sep 2005 20:34:01 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <200509021552.21149.gmccaughan@synaptics-uk.com> References: <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <200509021552.21149.gmccaughan@synaptics-uk.com> Message-ID: <50862ebd0509040334520f1430@mail.gmail.com> Gareth McCaughan: > > Interactive use is its own mode and works differently to the base > > language. To print the value of something, just type an expression. > > Doesn't do the same thing. In interactive mode, you are normally interested in the values of things, not their formatting so it does the right thing. If you need particular formatting or interpretation, you can always achieve this. > Do you have any suggestion that's as practically usable > as "print"? The print function proposal is already as usable as the print statement. When I write a print statement, I'd like to be able to redirect that to a log or GUI easily. If print is a function then its interface can be reimplemented but users can't add new statements to Python. Creation of strings containing values could be simplified as that would be applicable in many cases. I actually like being able to append to strings in Java with the second operand being stringified. Perhaps a stringify and catenate operator could be included in Python. Like this: MessageBox("a=" ? a ? "pos=" ? x?","?y) Neil From mwh at python.net Sun Sep 4 12:39:38 2005 From: mwh at python.net (Michael Hudson) Date: Sun, 04 Sep 2005 11:39:38 +0100 Subject: [Python-Dev] Asynchronous use of Traceback objects In-Reply-To: <60ed19d40509030124730b8f5b@mail.gmail.com> (Christopher Armstrong's message of "Sat, 3 Sep 2005 18:24:44 +1000") References: <60ed19d40509030124730b8f5b@mail.gmail.com> Message-ID: <2mpsrpnml1.fsf@starship.python.net> Christopher Armstrong writes: > I had the idea to create a fake Traceback object in Python that > doesn't hold references to any frame objects, but is still able to be > passed to 'raise' and formatted as tracebacks are, etc. Unfortunately, > raise does a type check on its third argument and, besides, it seems > the traceback formatting functions are very reliant on the internal > structure of traceback objects, so that didn't work. An option you may not have considered is to ditch the C code that formats tracebacks and always use traceback.py (this has a few obvious problems -- what do you do if traceback.py fails to import, what if formatting the traceback raises an error -- but nothing too horrendous, I think). Less duplication and less C code are always good things (IMHO, at least). > It does seem that I would be able to construct a passable fake > Traceback object from C code -- one that had references to fake > frames. These fake objects would only remember the original line > numbers, filenames and so forth so that traceback printing could still > work. I can try implementing this soon, but I'd just like to make sure > I'm on the right track. For example, perhaps a better idea would be to > change the traceback-printing functions to use Python attribute lookup > instead of internal structure lookup, My suggestion above would obviously acheive this bit :) > and then change raise to accept arbitrary Python objects as its > third argument, as long as it matches the traceback interface. That > would probably mean much more work, though. > > One concern is that I really don't like requiring C modules to use > Twisted; all of the ones currently in there are optional. Well, presumably this is optional too -- you only need it if you want informative tracebacks... > What's the likelihood of such a traceback-constructor getting its > way into CPython if I do implement it? I'd support more flexibility in this area. I'm not sure what the best approach is, though. Cheers, mwh -- I have *both* hands clapping, but I'm still not sure it's a sound. When I tried deciding if it were a sound while clapping only one hand, I fell off my chair. -- Peter Hansen, Zen master, comp.lang.python From guido at python.org Sun Sep 4 16:28:05 2005 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Sep 2005 07:28:05 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: On 9/3/05, Tony Meyer wrote: > If there are two competing proposals, then the two groups write a PEP and > counter-PEP and the PEPs duke it out. Is this still the case if proposal B > is very nearly the status quo? No. The primary argument is between keeping the print statement and doing something else; only when "doing something else" is rejected we should concentrate on smaller improvements to the statement. The possibility of improving the statement isn't going to sway me. > IOW, would writing a "Future of the print statement in Python 3.0" counter > PEP that kept print as a statement be appropriate? If not, other than > python-dev posting (tiring out the poor summary guys <0.5 wink>), what is > the thing to do? In the end the process is not democratic. I don't think there's anything that can change my mind about dropping the statement. I have my preferences about the replacement too, but that's where I need others to weigh in so we make sure all the important use cases are covered. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Sun Sep 4 16:48:28 2005 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Sep 2005 07:48:28 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431AC74F.2010102@gmail.com> References: <431AC74F.2010102@gmail.com> Message-ID: On 9/4/05, Nick Coghlan wrote: > Keeping print as a statement is certainly one of the options I'm considering, > so I don't think a counter-PEP is warranted just yet. There isn't even a PEP > to be a counter to - it's all still on the Wiki at the moment. I am so far a bit disappointed by the wiki contents; I'm hoping on more of a summary of the argumentation and use cases; instead, I found wild proposals that have zero chance of making it. > The more I play with it, the more I believe the part I have a problem with is > a weakness in the string formatting for iterables. I've noticed. I think you should cool down a bit about this. Automatically consuming iterables can have serious side effects (like reading a file to the end!), which you generally want to avoid. Putting complex syntax in %xyz format strings for iterators seems like a poor choice of tool -- it is already complex and brittle. All *my* sequence printing needs are generally met by a simple for loop or ",".join(...). If that's still too much typing for you, and you really think that the use case of printing all items in an iterable is common enough to warrant standard library support, I'd suggest something along these lines: def printseq(seq, sep=" ", to=None): if to is None: to = sys.stdout # dynamic default firsttime = True for item in seq: if firsttime: firsttime = False else: printbare(sep, to=to) printbare(item, to=to) # printbare() is just a suggestion; I'm not too happy with the name. > The point about not needing parentheses for conditionals where a lot of other > languages require them is a good one - I'm sure I write print statements > nearly as often as I write conditionals. I'm sad to see that all the good software engineering habits are dropped the moment people have to type a pair of extra parentheses. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Sun Sep 4 16:55:59 2005 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Sep 2005 07:55:59 -0700 Subject: [Python-Dev] bug in urlparse In-Reply-To: <431AB747.7050500@evhr.net> References: <431AB747.7050500@evhr.net> Message-ID: On 9/4/05, Fabien Schwob wrote: > Hello, > > I'm using the module urlparse and I think I've found a bug in the > urlparse module. When you merge an url and a link > like"../../../page.html" with urljoin, the new url created keep some > "../" in it. Here is an example : > > >>> import urlparse > >>> begin = "http://www.example.com/folder/page.html" > >>> end = "../../../otherpage.html" > >>> urlparse.urljoin(begin, end) > 'http://www.example.com/../../otherpage.html' You seem to be typing this from memory; the example actually gives a single set of "../", not two. > I would more expect the following url : > http://www.example.com/otherpage.html > > It's what is done in most web browser. > > So I would like to know if it's a bug or not. If it is, I would try to > code and to submit a patch. You shouldn't be giving more "../" sequences than are possible. I find the current behavior acceptable. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From skink at evhr.net Sun Sep 4 17:17:12 2005 From: skink at evhr.net (Fabien Schwob) Date: Sun, 04 Sep 2005 17:17:12 +0200 Subject: [Python-Dev] bug in urlparse In-Reply-To: References: <431AB747.7050500@evhr.net> Message-ID: <431B0FF8.7010903@evhr.net> >> >>> import urlparse >> >>> begin = "http://www.example.com/folder/page.html" >> >>> end = "../../../otherpage.html" >> >>> urlparse.urljoin(begin, end) >>'http://www.example.com/../../otherpage.html' > You seem to be typing this from memory; the example actually gives a > single set of "../", not two. No, it's a copy of an interactive session using Python 2.4.1. >>I would more expect the following url : >>http://www.example.com/otherpage.html >> >>It's what is done in most web browser. >> >>So I would like to know if it's a bug or not. If it is, I would try to >>code and to submit a patch. > You shouldn't be giving more "../" sequences than are possible. I find > the current behavior acceptable. Ok, so I would try do dev my own fonction. Mainly because on some web pages that I manipulate (for example [1]) there are more "../" than possible. [1] http://linuxfr.org/~pterjan/19252.html -- Fabien SCHWOB From guido at python.org Sun Sep 4 17:59:02 2005 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Sep 2005 08:59:02 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <8328163536998793998@unknownmsgid> References: <8328163536998793998@unknownmsgid> Message-ID: On 9/3/05, Bill Janssen wrote: > So here's the summary of the arguments against: two style points > (trailing comma and >>stream) (from the man who approved the current > decorator syntax!), and it's hard to extend. (By the way, I agree that > the ">>" syntax is ugly, and IMO a bad idea in general. Shame the "@" > wasn't used instead. :-) > > Seems pretty weak to me. Are there other args against? Sure. I made the mistake of thinking that everybody knew them. But let me first summarize the arguments I've heard for keeping print as a statement: 1. It's always been there 2. We don't want to type parentheses 3. We use it a lot 4. We don't want to change our code I agree that those are strong arguments, so please hear me out. There is a theoretical argument: print is the only application-level functionality that has a statement dedicated to it. Within Python's world, syntax is generally used as a last resort, when something *can't* be done without help from the compiler. Print doesn't qualify for such an exception (quite the opposite actually). But more important to me are my own experiences exploring the boundaries of print. - I quite often come to a point in the evolution of a program where I need to change all print statements into logging calls, or calls into some other I/O or UI library. If print were a function, this would be a straightforward string replacement; as it is, finding where to add the parentheses is often a pain (the end isn't always on the same line as the start). It's even worse if there are already ">>stream" options present. Trailing commas also make this more complicated than it needs to be. - Having special syntax puts up a much larger barrier for evolution of a feature. For examle, adding printf (or changing print to printf) is a much bigger deal now that print is a statement than if it had been a built-in function: trial implementations are much more work, there are only a few people who know how to modify Python's bytecode compiler, etc. (Having printf() be a function and print remain a statement is of course a possibility, but only adds more confusion and makes printf() a second-class citizen, thereby proving my point.) - There is a distinct non-linearity in print's ease of use once you decide that you don't want to have spaces between items; you either have to switch to using sys.stdout.write(), or you have to collect all items in a string. This is not a simple transformation, consider what it takes to get rid of the spaces before the commas in this simple example: print "x =", x, ", y =", y, ", z =", z If it was a built-in function, having a built-in companion function that did a similar thing without inserting spaces and adding a newline would be the logical thing to do (or adding keyword parameters to control that behavior; but I prefer a second function); but with only print as it currently stands, you'd have to switch to something like print "x = " + str(x) + ", y = " + str(x) + ", z = " + str(z) or print "x = %s, y = %s, z = %s" % (x, y, z) neither of which is very attractive. (And don't tell me that the spaces are no big deal -- they aren't in *this* example, but they are in other situations.) - If it were a function, it would be much easier to replace it within one module (just def print(*args):...) or even throughout a program (e.g. by putting a different function in __builtin__.print). As it is, you can do this by writing a class with a write( ) method and assigning that to sys.stdout -- that's not bad, but definitely a much larger conceptual leap, and it works at a different level than print. Summarizing, my main problems with print as a statement are the transformations -- when print doesn't cut it, you have to switch to something entirely different. If it were a function the switch would feel much smoother. I find that important: things that are conceptually related should be syntactically related (within the realm of common sense, as always). -- --Guido van Rossum (home page: http://www.python.org/~guido/) From barry at python.org Sun Sep 4 18:51:02 2005 From: barry at python.org (Barry Warsaw) Date: Sun, 04 Sep 2005 12:51:02 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> Message-ID: <1125852662.10947.5.camel@geddy.wooz.org> On Sat, 2005-09-03 at 12:51, James Y Knight wrote: > On Sep 3, 2005, at 11:32 AM, Barry Warsaw wrote: > > > So I think it's best to have two builtins: > > > > print(*args, **kws) > > printf(fmt, *args, **kws) > > It seems pretty bogus to me to add a second builtin just to apply the > % operator for you. I've always really liked that Python doesn't have > separate xyzf functions, because formatting is an operation you can > do directly on the string and pass that to any function you like. > It's much cleaner... Actually, we probably only /need/ printf(), and certainly for C programmers (are there any of us left? ;), I think that would be a small conceptual leap. The motivation for keeping a non-formatting version is for simple cases, and beginners -- both of which use cases should not be dismissed. The reason I proposed two versions is because I'd really dislike putting the format string in any position other than the first positional argument, and I can't think of a way to definitively distinguish between whether a first arg string is or is not a format string. One possible way out is to define a string literal that creates Template strings, and then make the Template string syntax rich enough to cover today's %-substitutions. Then if the first argument is a Template, you do printf()-like output otherwise you do print()-output. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/e506a9c9/attachment.pgp From barry at python.org Sun Sep 4 18:52:28 2005 From: barry at python.org (Barry Warsaw) Date: Sun, 04 Sep 2005 12:52:28 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <79990c6b050903114260d2e4af@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> <79990c6b050903114260d2e4af@mail.gmail.com> Message-ID: <1125852748.10955.8.camel@geddy.wooz.org> On Sat, 2005-09-03 at 14:42, Paul Moore wrote: > I have to agree. While I accept that Barry has genuine use cases for > the printf form, I don't quite see why %-formatting isn't enough. Is > the print-plus-% form so much less readable and/or maintainable? IMO, yes. I can't tell you how many times I've typo'd logger messages by switching commands and percents. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/36db46d7/attachment.pgp From barry at python.org Sun Sep 4 18:53:49 2005 From: barry at python.org (Barry Warsaw) Date: Sun, 04 Sep 2005 12:53:49 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <8393fff0509031012c3d83db@mail.gmail.com> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <1125760512.19993.60.camel@presto.wooz.org> <8393fff0509031012c3d83db@mail.gmail.com> Message-ID: <1125852829.10947.10.camel@geddy.wooz.org> On Sat, 2005-09-03 at 13:12, Martin Blais wrote: > (defun python-abbrev-print () > "Help me change old habits." > (insert "print()") (backward-char 1) t) > (put 'python-abbrev-print 'no-self-insert t) > (define-abbrev python-mode-abbrev-table "print" "" 'python-abbrev-print) LOL! That's a great solution for the 5 of us dinosaurs still using the One True Editor. :) -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/1ecd5fc2/attachment.pgp From raymond.hettinger at verizon.net Sun Sep 4 19:34:26 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Sun, 04 Sep 2005 13:34:26 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125852662.10947.5.camel@geddy.wooz.org> Message-ID: <001a01c5b176$e5c0c400$2f12c797@oemcomputer> [Barry] > Actually, we probably only /need/ printf(), and certainly for C > programmers (are there any of us left? ;), I think that would be a small > conceptual leap. The motivation for keeping a non-formatting version is > for simple cases, and beginners -- both of which use cases should not be > dismissed. +1 on Barry's proposal for two functions, one formatted and one plain. However, I take issue with the premise that beginners do not need formatting. Almost anyone, beginner or not, needs formatting when they are working on a real application. My experience is that finance people immediately try to format their output (habits from Excel). Most are astonished at how non-trivial it is to add commas, dollar signs, brackets, and a fixed number of decimal places. So, I think beginners should be considered a key constituent for output formatting and that their needs should be accommodated as simply and broadly as possible. Raymond Finance Guy From martin.blais at gmail.com Sun Sep 4 19:59:13 2005 From: martin.blais at gmail.com (Martin Blais) Date: Sun, 4 Sep 2005 13:59:13 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <8393fff050904105940ab7030@mail.gmail.com> On 9/4/05, Tony Meyer wrote: > > Yes. If it didn't have the redirect stuff; I would like it more if it also > didn't have the trailing comma magic. "print" is a fundamental; it deserves > to be a statement :) I don't know exactly what you mean by "fundamental", in opposition to your statement, I just see it as oft-used application-level code that should not live in "the language" (the set of statements that defines control flow and basic data structures) per-se, but in a library. From pinard at iro.umontreal.ca Sun Sep 4 21:22:23 2005 From: pinard at iro.umontreal.ca (=?iso-8859-1?Q?Fran=E7ois?= Pinard) Date: Sun, 4 Sep 2005 15:22:23 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <8328163536998793998@unknownmsgid> Message-ID: <20050904192223.GA10998@phenix.progiciels-bpi.ca> [Guido van Rossum] > [...] print is the only application-level functionality that has a > statement dedicated to it. Within Python's world, syntax is generally > used as a last resort, when something *can't* be done without help > from the compiler. Print doesn't qualify for such an exception (quite > the opposite actually). As I much liked Pascal in its time, `write()' and `writeln()' are nothing awkward to me, yet in Pascal, neither was a "regular" function, and the Pascal compiler had special code for parsing these two. Python functions are designed in such a way that `write()' and `writeln()' in Python could be just functions, with no special compiler stunt, and consequently, they fit even better for Python than they did for Pascal. Let's consider that `print' (or whatever) is a Python function, not negotiable. It should likely be. If people resent the parentheses that a new `print' would impose, then it might mean they would like that there is to be some way so Python functions could be be callable without parentheses in a more general way. It would represent quite a change in the syntax, and pull with it its own flurry of problems; but nevertheless, a seek for such a change might be presented as the only way for introducing `print' in Python 3K without a need for parentheses. Perl, going from version 4 to version 5, was subject to a cleanup between operators and functions which could be seen as similarly encompassing. Logo and a few others also have parentheses-less function calls, yet they may be week at handling functions as first-class objects. (And besides, I'm far from overly liking them! :-). -- Fran?ois Pinard http://pinard.progiciels-bpi.ca From rrr at ronadam.com Sun Sep 4 21:52:04 2005 From: rrr at ronadam.com (Ron Adam) Date: Sun, 04 Sep 2005 15:52:04 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <001a01c5b176$e5c0c400$2f12c797@oemcomputer> References: <001a01c5b176$e5c0c400$2f12c797@oemcomputer> Message-ID: <431B5064.3000601@ronadam.com> Raymond Hettinger wrote: > [Barry] > >>Actually, we probably only /need/ printf(), and certainly for C >>programmers (are there any of us left? ;), I think that would be a > > small > >>conceptual leap. The motivation for keeping a non-formatting version > > is > >>for simple cases, and beginners -- both of which use cases should not > > be > >>dismissed. > > > +1 on Barry's proposal for two functions, one formatted and one plain. +1 There is ... >>> '%r+%r = %r'.__mod__((1,2,3)) '1+2 = 3' Ok, not exactly what he proposed. ;-) Is there a better named method that str.__mod__() calls? > However, I take issue with the premise that beginners do not need > formatting. Almost anyone, beginner or not, needs formatting when they > are working on a real application. My experience is that finance people > immediately try to format their output (habits from Excel). Most are > astonished at how non-trivial it is to add commas, dollar signs, > brackets, and a fixed number of decimal places. So, I think beginners > should be considered a key constituent for output formatting and that > their needs should be accommodated as simply and broadly as possible. I agree, and the next thing programmers with previous experience look for is formatted input. Ok, not the very next thing. :-) Cheers, Ron From pinard at iro.umontreal.ca Sun Sep 4 22:52:23 2005 From: pinard at iro.umontreal.ca (=?iso-8859-1?Q?Fran=E7ois?= Pinard) Date: Sun, 4 Sep 2005 16:52:23 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050904192223.GA10998@phenix.progiciels-bpi.ca> References: <8328163536998793998@unknownmsgid> <20050904192223.GA10998@phenix.progiciels-bpi.ca> Message-ID: <20050904205223.GA14244@phenix.progiciels-bpi.ca> Let me correct two typos (I had to leave in a rush). [Fran?ois Pinard] > [...] Let's consider that `print' (or whatever) is a Python function, > not negotiable. It should likely be. The "It" refers to `print' being a Python function, not the negotiability. > Logo and a few others also have parentheses-less function > calls, yet they may be week at handling functions as first-class > objects. s/week/weak/ -- Fran?ois Pinard http://pinard.progiciels-bpi.ca From steven.bethard at gmail.com Mon Sep 5 00:08:18 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Sun, 4 Sep 2005 16:08:18 -0600 Subject: [Python-Dev] string formatting options and removing basestring.__mod__ (WAS: Replacement for print in Python 3.0) In-Reply-To: <000401c5af21$dc603000$4320c797@oemcomputer> References: <000401c5af21$dc603000$4320c797@oemcomputer> Message-ID: [Raymond Hettinger] > Actually, formatting needs to become a function. The overloading of the > arithmetic mod operator has proven to be unfortunate (if only because of > precedence issues). [Guido van Rossum] > For me, it's not so much the precedence, but the fact that "%s" % x > doesn't work as expected if x is a tuple; you'd have to write "%s" % > (x,) which is tedious. [Raymond Hettinger] > Also, the format coding scheme itself needs to be revisited. There is > no shortage of people who have taken issue with the trailing s in > %(myvar)s. [snip[] > string.Template is a bit too simplified. But perhaps it can be adapted. > We still want some way to express %r, %6.2f, etc. Since string > formatting has been around since Tim was in diapers, we should probably > start by looking at the solutions used by other languages. I was curious about what kind of options there were, so I googled around a bit. Here's what I found: Java[1] uses syntax like: %[argument_index$][flags][width][.precision]conversion which is basically the same as that of C, with positional argument specifiers. Some examples: String.format("Duke's Birthday: %1$tm %1$te,%1$tY", c); System.out.format("Local time: %tT", Calendar.getInstance()); formatter.format("%4$2s %3$2s %2$2s %1$2s", "a", "b", "c", "d") Classes can customize formatting for the 's' specifier by implementing the Formattable interface, which provides a method: formatTo(Formatter fmt, int f, int width, int precision) You can get formatted objects by calling: * The format() methods on Formatter objects * The format() methods on Strings * The format() methods on System.out, System.err, etc. .Net[2] uses syntax like: {index[,alignment][:formatString]} with examples like: String.Format("{0:dddd MMMM}", DateTime.Now) Console.WriteLine("{0:C}", MyInt) String.Format("Name = {0}, hours = {1:hh}, minutes = {1:mm}", myName, DateTime.Now) Classes can customize formatting for any specifier by implementing the ICustomFormatter interface: Format(string format, object arg, IFormatProvider formatProvider); or the IFormattable interface: ToString(string format, IFormatProvider formatProvider); You can get formatted objects by calling: * The ToString method of an IFormattable instance * The Format on Strings * The Write and WriteLine methods of Console, TextWriter, StreamWriter, etc. objects I also briefly looked at Tcl/Tk[3], Common Dylan[4], OCaml[5] and Ruby[6][7], which all appear to use C-style (or similar) formatting. I believe that Ruby, in addition to having printf and sprintf, also uses the % operator like Python does. This was a pretty small sample of languages (just the first few that showed up in google), and I didn't really look at any of them other than Java and .Net in much depth, so I've may have misunderstood some of it. That said, I think it's probably pretty reasonable to conclude that C-style formatting is the choice of a lot of other languages. (Not to imply that it therefore needs to be the choice of Python.) I understand one of the complaints about string formatting in Python is having to write the "s" on things like "%(key)s". I was hoping to get some ideas for alternatives here, but I wasn't able to find any dict-style insertion like in Python. There were a few languages (Java, Tcl) with the N$ positional-style insertion, but I don't think that helps us much. People have also been discussing a builtin format() function to replace the current % operator. Translating into Python the location of the formatting operations in the languages above suggests the following possibilities: * Have all __str__() methods take additonal formatting arguments * Add a format() builtin * Add format() methods on str and unicode objects * Add format() methods on all Python streams (files, sys.stdin, etc.) Of course, these possibilities aren't mutually exclusive, but TOOWTDI suggests that we probably shouldn't have too many of them. If people know of other languages that have a different approach to string formatting, it might be useful to see them. BTW, I intentionally didn't go into Perl's string formatting because I don't know it that well, and figured there are people on this list much more qualified than myself to present it. [1]http://java.sun.com/j2se/1.5.0/docs/api/java/util/Formatter.html [2]http://msdn.microsoft.com/library/default.asp?url=/library/en-us/cpguide/html/cpconFormattingOverview.asp [3]http://www.tcl.tk/man/tcl8.0/TclCmd/format.htm [4]http://gauss.gwydiondylan.org/books/drm/drm_57.html [5]http://caml.inria.fr/pub/docs/manual-ocaml/libref/Printf.html [6]http://www.rubycentral.com/book/ref_c_string.html#String._pc [7]http://www.rubycentral.com/book/ref_m_kernel.html#Kernel.sprintf Steve -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From sdeibel at wingware.com Mon Sep 5 00:38:52 2005 From: sdeibel at wingware.com (Stephan Deibel) Date: Sun, 4 Sep 2005 18:38:52 -0400 (EDT) Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <8328163536998793998@unknownmsgid> Message-ID: On Sun, 4 Sep 2005, Guido van Rossum wrote: > But more important to me are my own experiences exploring the > boundaries of print. > > - I quite often come to a point in the evolution of a program where I > need to change all print statements into logging calls, or calls into > some other I/O or UI library. [...] FWIW, this almost always happens to me. Although I've learned to avoid print in most cases, it was a somewhat painful lesson that seems quite at odds with how the rest of Python is designed -- usually, stuff just works and you aren't led into such design traps. - Stephan From hawk78_it at yahoo.it Mon Sep 5 00:38:23 2005 From: hawk78_it at yahoo.it (Vincenzo Di Massa) Date: Mon, 5 Sep 2005 00:38:23 +0200 Subject: [Python-Dev] New Wiki page - PrintAsFunction In-Reply-To: <43195EB1.3090406@ronadam.com> References: <431900A6.6000406@iinet.net.au> <43195EB1.3090406@ronadam.com> Message-ID: <200509050038.23812.hawk78_it@yahoo.it> Hello, This is my first post here. I like python a lot: great job people! Thank you! Alle 10:28, sabato 03 settembre 2005, Ron Adam ha scritto: > Nick Coghlan wrote: > > All, > > > > I put up a Wiki page for the idea of replacing the print statement with > > an easier to use builtin: > > > > http://wiki.python.org/moin/PrintAsFunction > > > > Cheers, > > Nick. > > Looks like a good start, much better than just expressing opinions. :-) > > > How about making it a class? I like the object idea, really a lot! > > There are several advantages such as persistent separators and being > able to have several different instances active at once. > > Cheers, > Ron > > I think savesep is unusefull. import sys class Print(object): newline = '\n' sep = ' ' def __init__(self, out=sys.stdout, println=""): self.out = out self._print=self.printNOln def __call__(self, *args, **kwds): self._print(*args, **kwds) def printNOln(self, *args, **kwds): try: sep = kwds['sep'] except KeyError: sep = self.sep for arg in args[:1]: self.out.write(str(arg)) for arg in args[1:]: self.out.write(sep) self.out.write(str(arg)) def println(self, *args, **kwds): self.printNOln(*args, **kwds) self.out.write(self.newline) > # default "builtin" instance > write = Print() # could be print in place of write in python 3k. > > write._print=write.println > > # standard printing write(1, 2, 3) > > # print without spaces write(1, 2, 3, sep='') > > # print comma separated write(1, 2, 3, sep=', ') > > # or > write.sep = ', ' # remain until changed > write(1, 2, 3) > write(4, 5, 6) > write.sep = ' ' > > # print without trailing newline write._print=write.printNOln > write(1, 2, 3) > > # print to a different stream > printerr = Print(sys.stderr) printerr._print=write.println printerr(1, 2, 3) > > # print a simple sequence write._print=write.println write(*range(10)) > > # Print a generator expression write(*(x*x for x in range(10))) > > # print to file > f = open('printout.txt','w') > fileprint = Print(f) > fileprint("hello world\n") > f.close() Does this look good? Ciao Vincenzo > > > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/hawk78_it%40yahoo.it ___________________________________ Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB http://mail.yahoo.it From tjreedy at udel.edu Mon Sep 5 01:25:23 2005 From: tjreedy at udel.edu (Terry Reedy) Date: Sun, 4 Sep 2005 19:25:23 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <8328163536998793998@unknownmsgid> Message-ID: "Guido van Rossum" wrote in message news:ca471dc20509040859441cc1dc at mail.gmail.com... > Summarizing, my main problems with print as a statement are the > transformations -- when print doesn't cut it, you have to switch to > something entirely different. If it were a function the switch would > feel much smoother. I find that important: things that are > conceptually related should be syntactically related (within the realm > of common sense, as always). Letting go of my attachment to the status quo, I see a couple of reasons to make print syntactically a function that I had not noticed before. 1. In C, for instance, *all* I/O is done with functions. In Python, *almost all* I/O constructs are functions, but with one exception. This makes the language slightly harder to learn. Many newbies expect uniformity and many have posted code treating print as a function by adding the currently unneeded parentheses. They have to be taught the exception. 2. I/O constructs carry with them assumptions about the environment or peripherals of the computatonal entity. Print, in particular, assumes the presence of a special default character display device (ok, a stdout char stream). Making print a syntax contruct builds that assumption into the syntax. That violates separation of concern principles and makes Python slightly harder to port to systems for which that assumption is not true and for which 'print' might even be meaningless. So I disagree that printing lines of text is fundamental to computation as such. It is certainly no more fundamental than input. And I notice that no one has suggested that (raw)input should be turned into a statement ;-). Terry J. Reedy From jepler at unpythonic.net Mon Sep 5 01:38:08 2005 From: jepler at unpythonic.net (jepler@unpythonic.net) Date: Sun, 4 Sep 2005 18:38:08 -0500 Subject: [Python-Dev] bug in urlparse In-Reply-To: <431AB747.7050500@evhr.net> References: <431AB747.7050500@evhr.net> Message-ID: <20050904233804.GA2731@unpythonic.net> According to RFC 2396[1] section 5.2: g) If the resulting buffer string still begins with one or more complete path segments of "..", then the reference is considered to be in error. Implementations may handle this error by retaining these components in the resolved path (i.e., treating them as part of the final URI), by removing them from the resolved path (i.e., discarding relative levels above the root), or by avoiding traversal of the reference. If I read this right, it explicitly allows the urlparse.urljoin behavior ("handle this error by retaining these components in the resolved path"). Also see C.2. Abnormal Examples. In practice, some implementations strip leading relative symbolic elements (".", "..") after applying a relative URI calculation, based on the theory that compensating for obvious author errors is better than allowing the request to fail. Thus, the above two references will be interpreted as "http://a/g" by some implementations. Jeff [1] http://www.faqs.org/rfcs/rfc2396.html -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/08f66318/attachment.pgp From radeex at gmail.com Mon Sep 5 02:34:00 2005 From: radeex at gmail.com (Christopher Armstrong) Date: Mon, 5 Sep 2005 10:34:00 +1000 Subject: [Python-Dev] Asynchronous use of Traceback objects In-Reply-To: <2mpsrpnml1.fsf@starship.python.net> References: <60ed19d40509030124730b8f5b@mail.gmail.com> <2mpsrpnml1.fsf@starship.python.net> Message-ID: <60ed19d4050904173437457d1b@mail.gmail.com> On 9/4/05, Michael Hudson wrote: > Christopher Armstrong writes: > > > I had the idea to create a fake Traceback object in Python that > > doesn't hold references to any frame objects, but is still able to be > > passed to 'raise' and formatted as tracebacks are, etc. Unfortunately, > > raise does a type check on its third argument and, besides, it seems > > the traceback formatting functions are very reliant on the internal > > structure of traceback objects, so that didn't work. > > An option you may not have considered is to ditch the C code that > formats tracebacks and always use traceback.py (this has a few obvious > problems -- what do you do if traceback.py fails to import, what if > formatting the traceback raises an error -- but nothing too > horrendous, I think). > > Less duplication and less C code are always good things (IMHO, at > least). The problem is, I can't tell Python to use traceback.py to format specifically these tracebacks. Or are you suggesting replacing all of Python's internal traceback printing stuff with traceback.py? I think that's a great idea, and it's what I assumed happened before I found these C-coded printing routines. On the other hand, that has the same problem that the "change to python attribute access" has, specifically that it *requires* a change to CPython itself, and can't be done in an extension module. But that's a purely selfish concern. :) I'm pretty close to getting the extension module that constructs frames, but I'm dealing with segfaults now. Man, PyFrame_New does some weird stuff. :) I may try for another day to get the extension module working, then perhaps give up and try on one of the hacking-CPython strategy. > > One concern is that I really don't like requiring C modules to use > > Twisted; all of the ones currently in there are optional. > > Well, presumably this is optional too -- you only need it if you want > informative tracebacks... Yes, that's true. -- Twisted | Christopher Armstrong: International Man of Twistery Radix | -- http://radix.twistedmatrix.com | Release Manager, Twisted Project \\\V/// | -- http://twistedmatrix.com |o O| | w----v----w-+ From greg.ewing at canterbury.ac.nz Mon Sep 5 02:39:15 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 05 Sep 2005 12:39:15 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <431B93B3.9050406@canterbury.ac.nz> Meyer, Tony wrote: > "print" is the best example I can think of for "practicality beats purity". > > Writing to stdout is as common in the code I write as loops - it's worth > keeping such basic functionality as elegant, simple, easy to understand, > and easy to use as possible. If writing to stdout easily were the only goal, it could be achieved by making stdout a builtin and using stdout.write(...). -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From greg.ewing at canterbury.ac.nz Mon Sep 5 02:42:59 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 05 Sep 2005 12:42:59 +1200 Subject: [Python-Dev] String views In-Reply-To: References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <43167BDB.6010002@canterbury.ac.nz> Message-ID: <431B9493.20204@canterbury.ac.nz> Steve Holden wrote: > Since Python strings *can* contain embedded NULs, doesn't that rather > poo on the idea of passing pointers to their data to C functions as > things stand? If a Python function is clearly wrapping a C function, one doesn't expect to be able to pass strings with embedded NULs to it. Just because a Python string can contain embedded NULs doesn't mean it makes sense to use such strings in all circumstances. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From greg.ewing at canterbury.ac.nz Mon Sep 5 03:53:01 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 05 Sep 2005 13:53:01 +1200 Subject: [Python-Dev] Pascaloid print substitute (Replacement for print in Python 3.0) In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> Message-ID: <431BA4FD.2090506@canterbury.ac.nz> Here's a non-statement print substitute that provides space insertion and newline suppression, and as a bonus it allows Pascal-style numeric formatting. Usage examples: Print["The answer is", 42] Print["Tons of spam:", n:6] Print[x:5:2, "squared is", x*x:10:4] Print["One", "Two", ...] Print["Buckle my shoe"] #---------------------------------------------------- import sys class PasFormat(object): def __init__(self, f): self.f = f def __getitem__(self, arg): #print "PasFormat.__getitem__:", arg if isinstance(arg, tuple): space = "" for item in arg: self.f.write(space) if item is Ellipsis: break self._do(item) space = " " else: self.f.write("\n") else: self._do(arg) self.f.write("\n") def _do(self, item): if isinstance(item, slice): value = item.start width = item.stop or 0 decimals = item.step else: value = item width = 0 decimals = None if decimals is not None: chars = "%*.*f" % (width, decimals, value) else: chars = "%*s" % (width, value) self.f.write(chars) Print = PasFormat(sys.stdout) if __name__ == "__main__": n = 666 x = 3.1415 Print["The answer is", 42] Print["Tons of spam:", n:6] Print[x:5:2, "squared is", x*x:10:4] Print["One", "Two", ...] Print["Buckle my shoe"] #---------------------------------------------------- -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From foom at fuhm.net Mon Sep 5 04:06:54 2005 From: foom at fuhm.net (James Y Knight) Date: Sun, 4 Sep 2005 22:06:54 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125852662.10947.5.camel@geddy.wooz.org> References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> <1125852662.10947.5.camel@geddy.wooz.org> Message-ID: On Sep 4, 2005, at 12:51 PM, Barry Warsaw wrote: > On Sat, 2005-09-03 at 12:51, James Y Knight wrote: > >> On Sep 3, 2005, at 11:32 AM, Barry Warsaw wrote: >> >> >>> So I think it's best to have two builtins: >>> >>> print(*args, **kws) >>> printf(fmt, *args, **kws) >>> >> >> It seems pretty bogus to me to add a second builtin just to apply the >> % operator for you. I've always really liked that Python doesn't have >> separate xyzf functions, because formatting is an operation you can >> do directly on the string and pass that to any function you like. >> It's much cleaner... >> > > Actually, we probably only /need/ printf(), and certainly for C > programmers (are there any of us left? ;), I think that would be a > small > conceptual leap. The motivation for keeping a non-formatting > version is > for simple cases, and beginners -- both of which use cases should > not be > dismissed. No, we certainly don't /need/ printf(), as is well proven by its current absence. Having the operation of printing and the operation of string formatting be separated is good, because it means you can easily do either one without the other. I don't understand why you want to combine these two operations. If it's % you object to, then propose a fix for the actual problem: e.g. a "fmt" function for formatting strings. (Which I would also object to, because I don't believe % is a problem). But proposing "printf" just adds complication for no purpose. It leaves % as a "problem" and adds a new builtin which duplicates existing functionality. James From barry at python.org Mon Sep 5 04:07:18 2005 From: barry at python.org (Barry Warsaw) Date: Sun, 04 Sep 2005 22:07:18 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431A5725.6070500@gmail.com> References: <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125766241.19994.82.camel@presto.wooz.org> <431A5725.6070500@gmail.com> Message-ID: <1125886038.10949.19.camel@geddy.wooz.org> On Sat, 2005-09-03 at 22:08, Nick Coghlan wrote: > > See a (very quick and very dirty ;) strawman that I just posted to the > > wiki. I think this has some interesting semantics, including the > > ability to control the separator inline in a C++-like fashion. The > > writef() version also accepts string.Templates or %s-strings as its > > first argument. I'm not sure I like reserving 'to' and 'nl' keyword > > arguments, and not having the ability to print Separator instances > > directly, but OTOH maybe those aren't big deals. > > The latter problem is easily solved by calling str() at the point of the call > so that write() never sees the actual Separator object. Good point. > However, this 'inline' > behaviour modification has always annoyed me in C++ - if you want this kind of > control over the formatting, a format string is significantly clearer. You're probably right about that. > Separating the formatting out into a separate functions like this addresses > your concern with the namespace conflict for 'to' and 'nl', and also makes the > 'format' builtin more generally useful, as it can be used for cases other than > direct output to a stream. The downside being that you have to type more to get the behavior you want. It does have the advantage of solving the namespace problem. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/b9eac741/attachment.pgp From barry at python.org Mon Sep 5 04:12:35 2005 From: barry at python.org (Barry Warsaw) Date: Sun, 04 Sep 2005 22:12:35 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <8328163536998793998@unknownmsgid> Message-ID: <1125886355.10949.22.camel@geddy.wooz.org> On Sun, 2005-09-04 at 11:59, Guido van Rossum wrote: > I agree that those are strong arguments, so please hear me out. Thanks Guido, I think your arguments are powerful too. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/0008f72d/attachment.pgp From barry at python.org Mon Sep 5 04:17:25 2005 From: barry at python.org (Barry Warsaw) Date: Sun, 04 Sep 2005 22:17:25 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> <1125852662.10947.5.camel@geddy.wooz.org> Message-ID: <1125886645.10950.27.camel@geddy.wooz.org> On Sun, 2005-09-04 at 22:06, James Y Knight wrote: > No, we certainly don't /need/ printf(), as is well proven by its > current absence. Having the operation of printing and the operation > of string formatting be separated is good, because it means you can > easily do either one without the other. I don't understand why you > want to combine these two operations. If it's % you object to, then > propose a fix for the actual problem: e.g. a "fmt" function for > formatting strings. (Which I would also object to, because I don't > believe % is a problem). But proposing "printf" just adds > complication for no purpose. It leaves % as a "problem" and adds a > new builtin which duplicates existing functionality. You can definitely argue about keeping formatting and print separate, but I think Guido and others have explained the problems with %. Also, we already have precedence in format+print in the logging package. I actually think the logging provides a nice, fairly to use interface that print-ng can be modeled on. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/87571f50/attachment.pgp From guido at python.org Mon Sep 5 04:32:45 2005 From: guido at python.org (Guido van Rossum) Date: Sun, 4 Sep 2005 19:32:45 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125886645.10950.27.camel@geddy.wooz.org> References: <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> <1125852662.10947.5.camel@geddy.wooz.org> <1125886645.10950.27.camel@geddy.wooz.org> Message-ID: On 9/4/05, Barry Warsaw wrote: > You can definitely argue about keeping formatting and print separate, > but I think Guido and others have explained the problems with %. To reiterate, "%s" % x is unsafe if you aren't sure that x can't be a tuple -- you'd have to write "%s" % (x,) if it can be one. Also, print("%s %s" % (a, b)) looks a bit ugly with the irregular punctuation. While I'm not going so far as to want a statement dedicated to printing, I'm not against having some redundancy for such an important piece of functionality. > Also, > we already have precedence in format+print in the logging package. I > actually think the logging provides a nice, fairly to use interface that > print-ng can be modeled on. Right. I just have one additional suggestion for the logging package (not sure if it should apply to printf as well): if there's a problem with the format operator, fall back to printing the format string followed by the argument values (if any) without any formatting -- when logging, that's a much better thing to do than dying with an exception. As I said, not sure if printf() should have the same behavior; it's wort a try though. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From raymond.hettinger at verizon.net Mon Sep 5 05:14:35 2005 From: raymond.hettinger at verizon.net (Raymond Hettinger) Date: Sun, 04 Sep 2005 23:14:35 -0400 Subject: [Python-Dev] Pascaloid print substitute (Replacement for print inPython 3.0) In-Reply-To: <431BA4FD.2090506@canterbury.ac.nz> Message-ID: <002201c5b1c7$f19e0340$232dc797@oemcomputer> > Print["One", "Two", ...] > Print["Buckle my shoe"] The ellipsis was a nice touch. Raymond From nnorwitz at gmail.com Mon Sep 5 05:30:43 2005 From: nnorwitz at gmail.com (Neal Norwitz) Date: Sun, 4 Sep 2005 20:30:43 -0700 Subject: [Python-Dev] gdbinit problem Message-ID: break in gdbinit is apparently does not break a loop, but rather sets a break point. I don't know how to hit the break within lineno with a simple test case. Debugging pychecker with a C extension (matplotlib) triggers it. The only way I could see to fix it was by setting a continue flag and testing it. Does anyone know a better way to fix this problem? A patch is attached which fixes the problem for me, but I would rather check in a better solution if one exists. Any ideas? Or should I just check in the attached patch. n -- more gory details. after the patch, i get the expected results: (gdb) pystack ./pychecker/checker.py (573): _initModule ./pychecker/checker.py (537): load ./pychecker/checker.py (517): addModule ./pychecker/checker.py (574): _initModule ./pychecker/checker.py (537): load ./pychecker/checker.py (517): addModule ./pychecker/checker.py (574): _initModule ./pychecker/checker.py (540): load ./pychecker/checker.py (668): processFiles ./pychecker/checker.py (721): main ./pychecker/checker.py (741): ? before the patch, i get this: (gdb) pystack ./pychecker/checker.py (Breakpoint 1 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 2 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 3 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 4 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 5 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 6 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 7 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 8 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 9 at 0x455372: file Objects/typeobject.c, line 2012. 584): _initModule ./pychecker/checker.py (Breakpoint 10 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 11 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 12 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 13 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 14 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 15 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 16 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 17 at 0x455372: file Objects/typeobject.c, line 2012. 545): load ./pychecker/checker.py (Breakpoint 18 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 19 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 20 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 21 at 0x455372: file Objects/typeobject.c, line 2012. 521): addModule ./pychecker/checker.py (Breakpoint 22 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 23 at 0x455372: file Objects/typeobject.c, line 2012. ---Type to continue, or q to quit--- Breakpoint 24 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 25 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 26 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 27 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 28 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 29 at 0x455372: file Objects/typeobject.c, line 2012. 584): _initModule ./pychecker/checker.py (Breakpoint 30 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 31 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 32 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 33 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 34 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 35 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 36 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 37 at 0x455372: file Objects/typeobject.c, line 2012. 545): load ./pychecker/checker.py (Breakpoint 38 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 39 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 40 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 41 at 0x455372: file Objects/typeobject.c, line 2012. 521): addModule ./pychecker/checker.py (Breakpoint 42 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 43 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 44 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 45 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 46 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 47 at 0x455372: file Objects/typeobject.c, line 2012. ---Type to continue, or q to quit--- Breakpoint 48 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 49 at 0x455372: file Objects/typeobject.c, line 2012. 584): _initModule ./pychecker/checker.py (Breakpoint 50 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 51 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 52 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 53 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 54 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 55 at 0x455372: file Objects/typeobject.c, line 2012. 545): load ./pychecker/checker.py (Breakpoint 56 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 57 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 58 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 59 at 0x455372: file Objects/typeobject.c, line 2012. 671): processFiles ./pychecker/checker.py (Breakpoint 60 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 61 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 62 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 63 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 64 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 65 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 66 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 67 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 68 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 69 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 70 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 71 at 0x455372: file Objects/typeobject.c, line 2012. ---Type to continue, or q to quit--- Breakpoint 72 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 73 at 0x455372: file Objects/typeobject.c, line 2012. 735): main ./pychecker/checker.py (Breakpoint 74 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 75 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 76 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 77 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 78 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 79 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 80 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 81 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 82 at 0x455372: file Objects/typeobject.c, line 2012. Breakpoint 83 at 0x455372: file Objects/typeobject.c, line 2012. 796): ? -------------- next part -------------- A non-text attachment was scrubbed... Name: gdbinit-patch Type: application/octet-stream Size: 986 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/6a36f28c/gdbinit-patch-0001.obj From barry at python.org Mon Sep 5 05:53:34 2005 From: barry at python.org (Barry Warsaw) Date: Sun, 04 Sep 2005 23:53:34 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> <1125852662.10947.5.camel@geddy.wooz.org> <1125886645.10950.27.camel@geddy.wooz.org> Message-ID: <1125892414.10955.29.camel@geddy.wooz.org> On Sun, 2005-09-04 at 22:32, Guido van Rossum wrote: > Right. I just have one additional suggestion for the logging package > (not sure if it should apply to printf as well): if there's a problem > with the format operator, fall back to printing the format string > followed by the argument values (if any) without any formatting -- > when logging, that's a much better thing to do than dying with an > exception. As I said, not sure if printf() should have the same > behavior; it's wort a try though. Cool idea, definitely worth a try. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050904/94addb68/attachment.pgp From p.f.moore at gmail.com Mon Sep 5 10:47:51 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Mon, 5 Sep 2005 09:47:51 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <8328163536998793998@unknownmsgid> Message-ID: <79990c6b0509050147224130d2@mail.gmail.com> On 9/4/05, Guido van Rossum wrote: > On 9/3/05, Bill Janssen wrote: > > Seems pretty weak to me. Are there other args against? > > Sure. I made the mistake of thinking that everybody knew them. Looks like I certainly didn't. These are good points, many of which I had missed. I withdraw my objections to print-as-function. These points should be added to the wiki. If no-one else gets to it, I'll do so this evening. Paul. From fredrik at pythonware.com Mon Sep 5 12:14:52 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Mon, 5 Sep 2005 12:14:52 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <20050902142044.GA18622@discworld.dyndns.org><17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: Steven Bethard wrote: > >> > Use the print() method of sys.stderr: >> > >> > sys.stderr.print('error or help message') >> >> so who's going to add print methods to all file-like objects? > > The same people that added __iter__(), next(), readline(), readlines() > and writelines() to their file-like objects who did that? (you completely missed the point -- today's print mechanism works on *any* object that implements a "write" method, no just file objects. saying that "oh, all you need is to add a method" or "here's a nice mixin" doesn't give you a print replacement) From fredrik at pythonware.com Mon Sep 5 13:55:20 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Mon, 5 Sep 2005 13:55:20 +0200 Subject: [Python-Dev] Revising RE docs References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com><87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> <200509021140.36101.gmccaughan@synaptics-uk.com> Message-ID: Guido van Rossum wrote: > I also notice that _compile() is needlessly written as a varargs > function -- all its uses pass it exactly two arguments. that's because the function uses [1] the argument tuple as the cache key, and I wanted to make the "cache hit" path as fast as possible. (but that was back in the 1.6 days; things have changed a lot since then, so maybe someone should benchmark some alternative ways to do this under 2.4...) 1) well, it used to use it. the code was modified slightly in 2.3 to prepend the type of the pattern string; not sure why, since 8-bit and unicode patterns should be equivalent. From gmccaughan at synaptics-uk.com Mon Sep 5 14:57:34 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Mon, 5 Sep 2005 13:57:34 +0100 Subject: [Python-Dev] Revising RE docs In-Reply-To: References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> <200509021140.36101.gmccaughan@synaptics-uk.com> Message-ID: <200509051357.35694.gmccaughan@synaptics-uk.com> Guido wrote: > > > They *are* cached and there is no cost to using the functions instead > > > of the methods unless you have so many regexps in your program that > > > the cache is cleared (the limit is 100). > > > > Sure there is; the cost of looking them up in the cache. ... > > So in this (highly artificial toy) application it's about 7.5/2.5 = 3 times > > faster to use the methods instead of the functions. > > Yeah, but the cost is a constant -- it is not related to the cost of > compiling the re. True. > (You should've shown how much it cost if you > included the compilation in each search.) Why should I have? I don't dispute that the caching helps -- I bet it helps a *lot*. I was just observing that it's not true that there's "no cost to using the functions instead of the methods". > I haven't looked into this, but I bet the overhead you're measuring is > actually the extra Python function call, not the cache lookup itself. Hmm, that's possible. But what matters in practice is how big the cost of using re.search("...","...") rather than compiling once and using the RE object's search method is, not where it comes from. -- g From fredrik at pythonware.com Mon Sep 5 15:40:57 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Mon, 5 Sep 2005 15:40:57 +0200 Subject: [Python-Dev] Revising RE docs References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com><87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> <200509021140.36101.gmccaughan@synaptics-uk.com> Message-ID: Am I the only who are getting mails from "iextream at naver.com" whenever I post to python-dev, btw? My Korean (?) isn't that good, so I'm not sure what they want... From tim.peters at gmail.com Mon Sep 5 16:50:56 2005 From: tim.peters at gmail.com (Tim Peters) Date: Mon, 5 Sep 2005 10:50:56 -0400 Subject: [Python-Dev] Revising RE docs In-Reply-To: References: <20050830143542.niq7a9s8bsrkc8ok@login.werra.lunarpages.com> <87hdd5o5y1.fsf@tleepslib.sk.tsukuba.ac.jp> <200509021140.36101.gmccaughan@synaptics-uk.com> Message-ID: <1f7befae05090507502b42549f@mail.gmail.com> [Fredrik Lundh] > Am I the only who are getting mails from "iextream at naver.com" > whenever I post to python-dev, btw? > > My Korean (?) isn't that good, so I'm not sure what they want... Only thing I've seen from them is one post in the archives, on June 13: http://mail.python.org/pipermail/python-dev/2005-June/054204.html Must be a secret admirer. From guido at python.org Mon Sep 5 16:53:13 2005 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Sep 2005 07:53:13 -0700 Subject: [Python-Dev] gdbinit problem In-Reply-To: References: Message-ID: On 9/4/05, Neal Norwitz wrote: > break in gdbinit is apparently does not break a loop, but rather sets > a break point. I don't know how to hit the break within lineno with a > simple test case. Debugging pychecker with a C extension (matplotlib) > triggers it. > > The only way I could see to fix it was by setting a continue flag and > testing it. Does anyone know a better way to fix this problem? A > patch is attached which fixes the problem for me, but I would rather > check in a better solution if one exists. > > Any ideas? Or should I just check in the attached patch. You're probably one of the two users. :-) So don't hesitate. If the other user disagrees you two can fight it out in CVS. :) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From martin.blais at gmail.com Mon Sep 5 17:16:40 2005 From: martin.blais at gmail.com (Martin Blais) Date: Mon, 5 Sep 2005 11:16:40 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <8328163536998793998@unknownmsgid> Message-ID: <8393fff0509050816b392c12@mail.gmail.com> On 9/4/05, Stephan Deibel wrote: > On Sun, 4 Sep 2005, Guido van Rossum wrote: > > But more important to me are my own experiences exploring the > > boundaries of print. > > > > - I quite often come to a point in the evolution of a program where I > > need to change all print statements into logging calls, or calls into > > some other I/O or UI library. [...] > > FWIW, this almost always happens to me. Although I've learned to > avoid print in most cases, it was a somewhat painful lesson that seems > quite at odds with how the rest of Python is designed -- usually, > stuff just works and you aren't led into such design traps. Happened to me too. However, there is an easy way out: hijack sys.stdout to forward to your logger system. I've got a web application framework that's setup like that right now, it works great (if you will not need the original print-to-stdout anymore in your program, that is). I print, it goes to the logfile. You just have to be careful where--in time-- you replace sys.stdout. cheers, From gmccaughan at synaptics-uk.com Mon Sep 5 17:48:13 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Mon, 5 Sep 2005 16:48:13 +0100 Subject: [Python-Dev] string formatting options and removing basestring.__mod__ (WAS: Replacement for print in Python 3.0) In-Reply-To: References: <000401c5af21$dc603000$4320c797@oemcomputer> Message-ID: <200509051648.13855.gmccaughan@synaptics-uk.com> > If people know of other languages that have a different approach to > string formatting, it might be useful to see them. Common Lisp has something broadly C-like but bigger and hairier. It includes powerful-but-confusing looping and conditional features, letting you say things like (format t "~{~^, ~A~}" 1 2 3 "wombat") which produces 1, 2, 3, wombat or -- you may wish to be sitting down before reading further -- (format t "~#[nothing~;~S~;~S and ~S~:;~@{~#[~; and~] ~S~^ ,~}~]." 1 2 3 "wombat") which produces 1, 2, 3, and wombat and also does the Right Thing with 0, 1 or 2 items. (The first argument to FORMAT, in case you were wondering, determines where the output should go. Feeding in T, as here, sends it to stdout. You can also give it an arbitrary stream, or NIL to return the formatted result as a string.) For the impressive and horrifying full story, see http://www.lisp.org/HyperSpec/Body/sec_22-3.html Most of the features of CL's formatted output are probably, shall we say, inappropriate for Python. It might still be worth a look, to see if there's anything under the rococo exterior that would fit. * Some languages have "picture" formats, where the structure of the format string more closely mimics that of the desired output. (This is true, e.g., of some Basics and of one variety of Perl output.) The trouble with this is that it limits how much information you can provide about *how* each value is to be formatted within the available space. * C++'s << operator represents another way to do formatted output. I regard it as an object lesson in bad design. -- g From gmccaughan at synaptics-uk.com Mon Sep 5 17:52:42 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Mon, 5 Sep 2005 16:52:42 +0100 Subject: [Python-Dev] string formatting options and removing basestring.__mod__ (WAS: Replacement for print in Python 3.0) In-Reply-To: <200509051648.13855.gmccaughan@synaptics-uk.com> References: <200509051648.13855.gmccaughan@synaptics-uk.com> Message-ID: <200509051652.42693.gmccaughan@synaptics-uk.com> I wrote: > C++'s << operator represents another way to do formatted > output. I regard it as an object lesson in bad design. ... and should add: Of course it's usually seen as being about output more than about formatting, but in fact if you want to do what Python does with "%", C with "sprintf" and Common Lisp with (format nil ...) then the Right Thing in C++ (in so far as that exists) is usually to use << with a string stream. -- g From edcjones at comcast.net Mon Sep 5 18:04:33 2005 From: edcjones at comcast.net (Edward C. Jones) Date: Mon, 05 Sep 2005 12:04:33 -0400 Subject: [Python-Dev] Example for "property" violates "Python is not a one pass compiler" Message-ID: <431C6C91.6020101@comcast.net> Here is an example from the "Python Library Reference", Section 2.1 "Built-in Functions": class C(object): def getx(self): return self.__x def setx(self, value): self.__x = value def delx(self): del self.__x x = property(getx, setx, delx, "I'm the 'x' property.") It works. But if I put the property statement first: class C(object): x = property(getx, setx, delx, "I'm the 'x' property.") def getx(self): return self.__x def setx(self, value): self.__x = value def delx(self): del self.__x I get the error: NameError: name 'getx' is not defined Does this violate the principle "Python is not a one pass compiler"? Normally I can use any method of a class anywhere in the definition of the class. From solipsis at pitrou.net Mon Sep 5 18:07:21 2005 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 05 Sep 2005 18:07:21 +0200 Subject: [Python-Dev] string formatting and i18n In-Reply-To: <200509051652.42693.gmccaughan@synaptics-uk.com> References: <200509051648.13855.gmccaughan@synaptics-uk.com> <200509051652.42693.gmccaughan@synaptics-uk.com> Message-ID: <1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr> Le lundi 05 septembre 2005 ? 16:52 +0100, Gareth McCaughan a ?crit : > ... and should add: Of course it's usually seen as being about > output more than about formatting, but in fact if you want > to do what Python does with "%", C with "sprintf" and > Common Lisp with (format nil ...) then the Right Thing in C++ > (in so far as that exists) is usually to use << with a string > stream. Uh, what about internationalization (i18n) ? In i18n you can't avoid the need for parameterized strings. For example I want to write : _("The file '%s' is read only") % filename not : _("The file") + " '" + filename + "' " + _("is read only") because the splitting in the second form will not translate correctly into other languages. You *have* to supply the whole non-splitted sentence to the translators. The bottom line, IMHO, is that there are frequent uses that mandate a nice and easy to use formatting operator. Python has it, let's not remove it. Regards Antoine. From pje at telecommunity.com Mon Sep 5 18:12:33 2005 From: pje at telecommunity.com (Phillip J. Eby) Date: Mon, 05 Sep 2005 12:12:33 -0400 Subject: [Python-Dev] Example for "property" violates "Python is not a one pass compiler" In-Reply-To: <431C6C91.6020101@comcast.net> Message-ID: <5.1.1.6.0.20050905120907.01b74500@mail.telecommunity.com> At 12:04 PM 9/5/2005 -0400, Edward C. Jones wrote: >Normally I can use any method of a class anywhere in the definition of >the class. Not true. You can certainly use any method of a class in any *functions* or methods defined in the body of the class. But you can't use them in the body of the class before they're defined, any more than you can subclass a class that doesn't exist yet. I'm not sure where you got the "Python is not a one pass compiler" idea; I don't recall having seen this meme anywhere before, and I don't see how it's meaningful anyway. From fredrik at pythonware.com Mon Sep 5 18:14:37 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Mon, 5 Sep 2005 18:14:37 +0200 Subject: [Python-Dev] Example for "property" violates "Python is not a onepass compiler" References: <431C6C91.6020101@comcast.net> Message-ID: Edward C. Jones write: > Here is an example from the "Python Library Reference", Section 2.1 > "Built-in Functions": > > class C(object): > def getx(self): return self.__x > def setx(self, value): self.__x = value > def delx(self): del self.__x > x = property(getx, setx, delx, "I'm the 'x' property.") > > It works. But if I put the property statement first: > > class C(object): > x = property(getx, setx, delx, "I'm the 'x' property.") > def getx(self): return self.__x > def setx(self, value): self.__x = value > def delx(self): del self.__x > > I get the error: > NameError: name 'getx' is not defined > > Does this violate the principle "Python is not a one pass compiler"? this has nothing to do with compilation; class objects are created by executing the code inside the class block. that code is only executed once (when the class statement itself is executed). > Normally I can use any method of a class anywhere in the definition of > the class. nope. you can use a method name inside a method that will be *executed* at a *later* time, but you cannot refer to names that hasn't been defined yet. (if you don't intuitively understand this, you need to read up on how namespaces work and how they are populated) From barry at python.org Mon Sep 5 18:47:04 2005 From: barry at python.org (Barry Warsaw) Date: Mon, 05 Sep 2005 12:47:04 -0400 Subject: [Python-Dev] string formatting and i18n In-Reply-To: <1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr> References: <200509051648.13855.gmccaughan@synaptics-uk.com> <200509051652.42693.gmccaughan@synaptics-uk.com> <1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <1125938824.10955.39.camel@geddy.wooz.org> On Mon, 2005-09-05 at 12:07, Antoine Pitrou wrote: > Uh, what about internationalization (i18n) ? > In i18n you can't avoid the need for parameterized strings. > For example I want to write : > _("The file '%s' is read only") % filename > not : > _("The file") + " '" + filename + "' " + _("is read only") > > because the splitting in the second form will not translate correctly > into other languages. You *have* to supply the whole non-splitted > sentence to the translators. Actually, this was part of the motivation behind PEP 292 and Template strings, because what you really want is named parameters, not positional parameters: 'The file $filename in directory $dir is read only' There are a few techniques for getting full i18n for Template strings. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050905/059efe4b/attachment.pgp From sdeibel at wingware.com Mon Sep 5 18:56:59 2005 From: sdeibel at wingware.com (Stephan Deibel) Date: Mon, 5 Sep 2005 12:56:59 -0400 (EDT) Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <8393fff0509050816b392c12@mail.gmail.com> References: <8328163536998793998@unknownmsgid> <8393fff0509050816b392c12@mail.gmail.com> Message-ID: On Mon, 5 Sep 2005, Martin Blais wrote: > However, there is an easy way out: hijack sys.stdout to forward to > your logger system. > I've got a web application framework that's setup like that right now, > it works great (if you will not need the original print-to-stdout > anymore in your program, that is). I print, it goes to the logfile. > You just have to be careful where--in time-- you replace sys.stdout. Sure, and indeed I've done that often enough but it's kind of ugly and doesn't help if you merge bodies of code where some stuff should go to a log, some to stdout, some elsewhere. Hmm, maybe I'd end up avoiding the builtin print() as well, or at least need to pass around the stream where I want output. The general problem of not tying code to a particular output stream is what I'm reacting to. - Stephan From gmccaughan at synaptics-uk.com Mon Sep 5 18:57:53 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Mon, 5 Sep 2005 17:57:53 +0100 Subject: [Python-Dev] string formatting and i18n In-Reply-To: <1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr> References: <200509051652.42693.gmccaughan@synaptics-uk.com> <1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <200509051757.53745.gmccaughan@synaptics-uk.com> On Monday 2005-09-05 17:07, Antoine Pitrou wrote: > Le lundi 05 septembre 2005 ? 16:52 +0100, Gareth McCaughan a ?crit : > > ... and should add: Of course it's usually seen as being about > > output more than about formatting, but in fact if you want > > to do what Python does with "%", C with "sprintf" and > > Common Lisp with (format nil ...) then the Right Thing in C++ > > (in so far as that exists) is usually to use << with a string > > stream. > > Uh, what about internationalization (i18n) ? > In i18n you can't avoid the need for parameterized strings. > For example I want to write : > _("The file '%s' is read only") % filename > not : > _("The file") + " '" + filename + "' " + _("is read only") > > because the splitting in the second form will not translate correctly > into other languages. You *have* to supply the whole non-splitted > sentence to the translators. Yes. If you think I was arguing the opposite, then I failed to communicate clearly and I apologize. > The bottom line, IMHO, is that there are frequent uses that mandate a > nice and easy to use formatting operator. Python has it, let's not > remove it. It's clear (I think) that a good way of formatting strings is necessary. It's less clear what the best way is. Python's % is pretty good; perhaps it's possible to do even better. For instance, take your I18N example. Not all languages have the same word order, as you've observed. When there's more than one parameter, Python's %-interpolation isn't enough in general; you'd need something that can reorder the parameters. I don't know whether this is worth complicating string formatting for, but it's not obvious that it isn't. -- g From s.percivall at chello.se Mon Sep 5 20:04:07 2005 From: s.percivall at chello.se (Simon Percivall) Date: Mon, 5 Sep 2005 20:04:07 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <8328163536998793998@unknownmsgid> <8393fff0509050816b392c12@mail.gmail.com> Message-ID: <5700D8AC-BEE0-47C3-9BD9-35F1D1797168@chello.se> On 5 sep 2005, at 18.56, Stephan Deibel wrote: > On Mon, 5 Sep 2005, Martin Blais wrote: > >> However, there is an easy way out: hijack sys.stdout to forward to >> your logger system. >> I've got a web application framework that's setup like that right >> now, >> it works great (if you will not need the original print-to-stdout >> anymore in your program, that is). I print, it goes to the logfile. >> You just have to be careful where--in time-- you replace sys.stdout. >> > > Sure, and indeed I've done that often enough but it's kind of ugly and > doesn't help if you merge bodies of code where some stuff should go to > a log, some to stdout, some elsewhere. > > Hmm, maybe I'd end up avoiding the builtin print() as well, or at > least need to pass around the stream where I want output. The general > problem of not tying code to a particular output stream is what I'm > reacting to. Easy, just always print to a file-like object when you think you might have to switch destination later, and control the output from there: class Out: def write(self, text): # switch to logging here, or something sys.stdout.write(text) out = Out() print >>out, "I won't have to change this statement at all!" Print being a statement or a function doesn't matter in this case. Search- replacing is a bitch either way. //Simon From pinard at iro.umontreal.ca Mon Sep 5 20:10:18 2005 From: pinard at iro.umontreal.ca (=?iso-8859-1?Q?Fran=E7ois?= Pinard) Date: Mon, 5 Sep 2005 14:10:18 -0400 Subject: [Python-Dev] string formatting and i18n In-Reply-To: <1125938824.10955.39.camel@geddy.wooz.org> References: <200509051648.13855.gmccaughan@synaptics-uk.com> <200509051652.42693.gmccaughan@synaptics-uk.com> <1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr> <1125938824.10955.39.camel@geddy.wooz.org> Message-ID: <20050905181018.GA12726@phenix.progiciels-bpi.ca> [Barry Warsaw] > Actually, this was part of the motivation behind PEP 292 and Template > strings, because what you really want is named parameters, not > positional parameters: > 'The file $filename in directory $dir is read only' > There are a few techniques for getting full i18n for Template strings. Yet, "The file %(filename)s in directory %(dir)s is read only" % vars() is already usable. The need being already filled without Template strings, it could hardly be presented as a motivation for them. :-) -- Fran?ois Pinard http://pinard.progiciels-bpi.ca From barry at python.org Mon Sep 5 20:27:25 2005 From: barry at python.org (Barry Warsaw) Date: Mon, 05 Sep 2005 14:27:25 -0400 Subject: [Python-Dev] string formatting and i18n In-Reply-To: <20050905181018.GA12726@phenix.progiciels-bpi.ca> References: <200509051648.13855.gmccaughan@synaptics-uk.com> <200509051652.42693.gmccaughan@synaptics-uk.com> <1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr> <1125938824.10955.39.camel@geddy.wooz.org> <20050905181018.GA12726@phenix.progiciels-bpi.ca> Message-ID: <1125944845.10950.51.camel@geddy.wooz.org> On Mon, 2005-09-05 at 14:10, Fran?ois Pinard wrote: > "The file %(filename)s in directory %(dir)s is read only" % vars() > > is already usable. The need being already filled without Template > strings, it could hardly be presented as a motivation for them. :-) Except that IME, %(var)s is an error-prone construct for translators. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050905/a08d049c/attachment.pgp From antoine at pitrou.net Mon Sep 5 21:20:27 2005 From: antoine at pitrou.net (Antoine) Date: Mon, 05 Sep 2005 21:20:27 +0200 Subject: [Python-Dev] Re: string formatting and i18n In-Reply-To: 1125936441.23617.4.camel@p-dvsi-418-1.rd.francetelecom.fr Message-ID: <1125948027.6074.15.camel@fsol> > Yes. If you think I was arguing the opposite, then I failed to > communicate clearly and I apologize. Actually, I didn't interpret your message like that, but as I had already seen that proposal (to suppress string formatting), I thought it would be the right time to react ;) > For instance, take your I18N example. Not all languages have the > same word order, as you've observed. When there's more than one > parameter, Python's %-interpolation isn't enough in general; > you'd need something that can reorder the parameters. I don't > know whether this is worth complicating string formatting for, > but it's not obvious that it isn't. Well, I totally agree. I think it could be nice to both: - introduce positional formatting : "%1", "%2"... - make type specification optional, since Python can figure out the type by itself and use the right method; you would only specify the type when you want to have a different formatting (for example, for floats, you could use "%g2" instead of "%2" which would be equivalent to "%f2") Regards Antoine. -- ? On dit que p?trir c'est modeler, Moi je dis que p?ter c'est d?molir. ? Stup?flip From kay.schluehr at gmx.net Mon Sep 5 22:09:00 2005 From: kay.schluehr at gmx.net (Kay Schluehr) Date: Mon, 05 Sep 2005 22:09:00 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <50862ebd050902050650c0a83@mail.gmail.com> <17176.20371.368005.307905@montanaro.dyndns.org> <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: Guido van Rossum wrote: > I see two different ways to support the two most-called-for additional > requirements: (a) an option to avoid the trailing newline, (b) an > option to avoid the space between items. > > One way would be to give the print() call additional keyword > arguments. For example, sep="//" would print double slashes between > the items, and sep="" would concatenate the items directly. And > end="\r\n" could be used to change the newline delimiter to CRLF, > while end="" would mean to suppress the newline altogther. > > But to me that API becomes rather klunky; I'd rather have a separate > function (printbare() or write()?) that just writes its arguments as > strings to sys.stdout (or to the file given with a keyword argument) > without intervening spaces or trailing newline. I guess there are three options: a) keyword arguments b) distributing similar functionality over several functions c) using an object for configuration In case a) I miss some visual clue. That's mostly because an arbitrary string is passed to print(). For this reason I like the current print statement in it's simplicity. b) maybe the least extendable solution but can be mixed with a) if necessary. c) is the most heavyweight solution, but can encapsulate options and is reusable: >>> Writer(sep="//").print("some","text") some//text or writer = Writer(sep="//", file=sys.stderr) writer.print("some","error-text") writer.print("another","error text") A bare print() can be considered as a call to some default_writer. Substituting the default_writer by some custom Writer object may replace the default configuration, which should be easily resetable: >>> Writer.default_writer = Writer(sep="//") >>> print("some","error-text") some//error_text >>> Writer.reset() >>> print("some","error-text") some error-text I think that reduces the weight of the object solution and enables all kind of configurations as user defined default. A lightweight print() is still possible: The print() function would be implemented like this: def print(*args): Writer.default_writer.print(*args) I appreciate very much functions that are just shortcuts for certain methods. For consistency reasons the function write() may be a better name choice then print(), but also a different name for Writer() would be an option in case of c). Kay From guido at python.org Tue Sep 6 02:52:01 2005 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Sep 2005 17:52:01 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: On 9/5/05, Kay Schluehr wrote: > [...] is the most heavyweight solution, but can encapsulate options and is > reusable: > > >>> Writer(sep="//").print("some","text") > some//text > > or > > writer = Writer(sep="//", file=sys.stderr) > writer.print("some","error-text") > writer.print("another","error text") I am disappointed to see several proposals plunge into this type of generality (no matter how cool it is in its application of OO design patterns) without asking whether there is a need. Look at the example -- it is completely useless. I only made it up so that I could present the simpler version; I didn't have a use case myself for arbitrary delimiters. My hypothesis is that there are actually only two use cases that matter enough to be supported directly: (a) quickly print a bunch of items with spaces in between them and a trailing newline (b) print one or more items with precise control over each character If there are other use cases they can obviously be coded by using (b) and a modest amount of custom code. (I know there's the use case of printing sequences; I'll try to give an analysis of this use case in another post if one of its proponents doesn't do so soon.) An additional use case that I am willing to entertain because there is a lot of prior art (like Python's logging package, Bill Janssen's note(), and of course many other languages) is format-directed printing. This can of course be reduced to use case (b) using the str.% operator, but it is common enough to at least *consider* providing a direct solution which avoids the pitfalls of the % operator. Call this use case (c). Interesting, use case (b) can also easily be reduced to use case (c)! In a different thread I mentioned a design principle for which I have no catchy name, but which has often helped me design better APIs. One way to state it is to say that instead of a single "swiss-army-knife" function with various options that choose different behavior variants, it's better to have different dedicated functions for each of the major functionality types. So let's call it the "Swiss Army Knife (...Not)" API design pattern. There are a number of reasons why this API design is often better. These aren't quite the same reasons why a real life Swiss Army knife is often inferior to individual tools, if you have them available, so the analogy isn't perfect. (So sue me. :-) * It reduces the number of parameters, which reduces the cognitive overhead for the human reader. (It also reduces function call overhead some; but that's not the main reason.) * It puts the hint about the specific variant functionality at the front rather than at the end, so it is less likely overlooked. * If one variant is much more common than others, it is easier to learn just that behavior. * In the (common) case where the options are Booleans, it's often confusing whether True or False switches a particular behavior on or off (especially if they are allowed to be specified as positional parameters). * A good test to discover that you should have used this pattern is when you find that the argument specifying a particular option is a constant at every call site (perhaps excluding API wrappers). This is a hint that the different variants of the functionality are catering to different use cases; often you'll find that substituting a different variant behavior just wouldn't work because the use that is made of the returned value expects a specific variant. Some examples of the design pattern in action are str.strip(), str.lstrip() and str.rstrip(), or str.find() and str.rfind(). A much stronger subcase of this pattern (with fewer exceptions) is that the return type of a function shouldn't depend on the value of an argument. I have a feeling that if we were to extend the notion of type to include invariants, you'd find that the basic pattern is actually the same -- often the variant behaviors change the key invariant relationships between input and output. OK, still with me? This, together with the observation that the only use cases for the delimiter are space and no space, suggests that we should have separate printing APIs for each of the use cases (a), (b) and (c) above, rather than trying to fold (b) into (a) using a way to parameterize the separator (and the trailing newline, to which the same argument applies). For example: (a) print(...) (b) printraw(...) or printbare(...) (c) printf(fmt, ...) Each can take a keyword parameter to specify a different stream than sys.stdout; but no other options are needed. The names for (a) and (c) are pretty much fixed by convention (and by the clamoring when I proposed write() :-). I'm not so sure about the best name for (b), but I think picking the right name is important. We could decide not to provide (b) directly, since it is easily reduced to (c) using an appropriate format string ("%s" times the number of arguments). But I expect that use case (b) is pretty important, and not everyone likes having to use format strings. This could be reduced to a special case of the Swiss Army Knife (...Not) rule. BTW we could use "from __future__ import printing" to disable the recognition of 'print' as a keyword in a particular module -- this would provide adequate future-proofing. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From barry at python.org Tue Sep 6 04:24:09 2005 From: barry at python.org (Barry Warsaw) Date: Mon, 05 Sep 2005 22:24:09 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: <1125973449.24500.6.camel@geddy.wooz.org> On Mon, 2005-09-05 at 20:52, Guido van Rossum wrote: > We could decide not to provide (b) directly, since it is easily > reduced to (c) using an appropriate format string ("%s" times the > number of arguments). But I expect that use case (b) is pretty > important, and not everyone likes having to use format strings. This > could be reduced to a special case of the Swiss Army Knife (...Not) > rule. I'm not sure. I do agree with your design principles (though I might call it "Sometime's a Spoon's Just a Spoon" ;) but thinking about my own uses of print, I think we could easily get away with just (a) and (c). I think someone else felt the same way in an earlier response to my strawman, pointing out that the inline Separator instances wasn't really any more usable than just degenerating to the format string version. There's no doubt that the format string approach gives you direct control over every character. Eliminating the newline argument from print() would reduce the number of reserved keyword arguments in my strawman by half. Maybe we could even rename 'to' to '__to__' (!) to eliminate the other namespace wart. Is this really too horrible: print('$user forgot to frobnicate the $file!\n', user=username, file=file.name, __to__=sys.stderr) > BTW we could use "from __future__ import printing" to disable the > recognition of 'print' as a keyword in a particular module -- this > would provide adequate future-proofing. +1 -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050905/f95cc558/attachment.pgp From skip at pobox.com Tue Sep 6 04:40:10 2005 From: skip at pobox.com (skip@pobox.com) Date: Mon, 5 Sep 2005 21:40:10 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <50862ebd0509040334520f1430@mail.gmail.com> References: <200509021214.12531.gmccaughan@synaptics-uk.com> <50862ebd050902050650c0a83@mail.gmail.com> <200509021552.21149.gmccaughan@synaptics-uk.com> <50862ebd0509040334520f1430@mail.gmail.com> Message-ID: <17181.394.810996.111850@montanaro.dyndns.org> Neil> In interactive mode, you are normally interested in the values of Neil> things, not their formatting so it does the right thing. >>> class Dumb: ... def __init__(self, val): ... self.val = val ... def __str__(self): ... return " " % self.val ... >>> d = Dumb(5) >>> d <__main__.Dumb instance at 0x11042d8> >>> print d It's just repr() vs. str(), but the difference can be significant in many circumstances. Skip From skip at pobox.com Tue Sep 6 04:46:22 2005 From: skip at pobox.com (skip@pobox.com) Date: Mon, 5 Sep 2005 21:46:22 -0500 Subject: [Python-Dev] String views In-Reply-To: <431B9493.20204@canterbury.ac.nz> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <43167BDB.6010002@canterbury.ac.nz> <431B9493.20204@canterbury.ac.nz> Message-ID: <17181.766.64934.211439@montanaro.dyndns.org> Greg> If a Python function is clearly wrapping a C function, one doesn't Greg> expect to be able to pass strings with embedded NULs to it. Isn't that just floating an implementation detail up to the programmer (who may well not be POSIX- or Unix-aware)? From skip at pobox.com Tue Sep 6 04:49:37 2005 From: skip at pobox.com (skip@pobox.com) Date: Mon, 5 Sep 2005 21:49:37 -0500 Subject: [Python-Dev] gdbinit problem In-Reply-To: References: Message-ID: <17181.961.655922.441900@montanaro.dyndns.org> Neal> The only way I could see to fix it was by setting a continue flag Neal> and testing it. Does anyone know a better way to fix this Neal> problem? Certainly looks reasonable until we figure out how (if at all) GDB's command language implements a break-like statement. Skip From steve at holdenweb.com Tue Sep 6 05:34:38 2005 From: steve at holdenweb.com (Steve Holden) Date: Mon, 05 Sep 2005 23:34:38 -0400 Subject: [Python-Dev] String views In-Reply-To: <17181.766.64934.211439@montanaro.dyndns.org> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <43167BDB.6010002@canterbury.ac.nz> <431B9493.20204@canterbury.ac.nz> <17181.766.64934.211439@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > Greg> If a Python function is clearly wrapping a C function, one doesn't > Greg> expect to be able to pass strings with embedded NULs to it. > > Isn't that just floating an implementation detail up to the programmer (who may > well not be POSIX- or Unix-aware)? As far as I'm concerned it is, yes. Until this thread highlighted it I hadn't really considered this issue. It's a bit ugly that C extensions won't handle the full range of strings that pure python code will, but it's a typically pragmatic Python solution, so I'm not about to start a war about it. regards Steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC http://www.holdenweb.com/ From bcannon at gmail.com Tue Sep 6 06:01:20 2005 From: bcannon at gmail.com (Brett Cannon) Date: Mon, 5 Sep 2005 21:01:20 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125973449.24500.6.camel@geddy.wooz.org> References: <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125973449.24500.6.camel@geddy.wooz.org> Message-ID: On 9/5/05, Barry Warsaw wrote: > On Mon, 2005-09-05 at 20:52, Guido van Rossum wrote: > > > We could decide not to provide (b) directly, since it is easily > > reduced to (c) using an appropriate format string ("%s" times the > > number of arguments). But I expect that use case (b) is pretty > > important, and not everyone likes having to use format strings. This > > could be reduced to a special case of the Swiss Army Knife (...Not) > > rule. > > I'm not sure. I do agree with your design principles (though I might > call it "Sometime's a Spoon's Just a Spoon" ;) but thinking about my own > uses of print, I think we could easily get away with just (a) and (c). > I think someone else felt the same way in an earlier response to my > strawman, pointing out that the inline Separator instances wasn't really > any more usable than just degenerating to the format string version. > There's no doubt that the format string approach gives you direct > control over every character. > > Eliminating the newline argument from print() would reduce the number of > reserved keyword arguments in my strawman by half. Maybe we could even > rename 'to' to '__to__' (!) to eliminate the other namespace wart. Is > this really too horrible: > > print('$user forgot to frobnicate the $file!\n', > user=username, file=file.name, __to__=sys.stderr) > If I something stupid, I apologize; I have been swamped with orientation stuff while this entire discussion has been going on and so I am sure I have missed some of the finer details. I like the way the above works, but ``print(username, "forgot to frobicate the", file.name)`` just seems nicer for simple output. I do agree that there is a need for simple and formatted versions of print and that controlled output of numbers is important. And I also like the $ formatting so I wished there was a way to take what Barry did above but be able to do formatting, like ``${num:0.6f}`` or something and have that be the formatting version and just have the default be a call on str() for the substitution. > > BTW we could use "from __future__ import printing" to disable the > > recognition of 'print' as a keyword in a particular module -- this > > would provide adequate future-proofing. > > +1 +1 from me as well. -Brett From ironfroggy at gmail.com Tue Sep 6 06:21:48 2005 From: ironfroggy at gmail.com (Calvin Spealman) Date: Tue, 6 Sep 2005 00:21:48 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> Message-ID: <76fd5acf050905212157306c5@mail.gmail.com> On 9/1/05, Guido van Rossum wrote: > [Charles Cazabon] > > >> Perhaps py3k could have a py2compat module. Importing it could have the > > >> effect of (for instance) putting compile, id, and intern into the global > > >> namespace, making print an alias for writeln, > > [Greg Ewing] > > > There's no way importing a module could add something that > > > works like the old print statement, unless some serious > > > magic is going on... > > [Reinhold Birkenfeld] > > You'd have to enclose print arguments in parentheses. Of course, the "trailing > > comma" form would be lost. > > And good riddance! The print statement harks back to ABC and even > (unvisual) Basic. Out with it! > > A transitional strategy could be to start designing the new API and > introduce it in Python 2.x. Here's my strawman: > > (1) Add two new methods the the stream (file) API and extend write(): > stream.write(a1, a2, ...) -- equivalent to map(stream.write, map(str, > [a1, a2, ...])) > stream.writeln(a1, a2, ...) -- equivalent to stream.write(a1, a2, ..., "\n") > stream.writef(fmt, a1, a2, ...) -- equivalent to stream.write(fmt % > (a1, a2, ...)) > > (2) Add builtin functions write(), writeln(), writef() that call the > corresponding method on sys.stdout. (Note: these should not just be > the bound methods; assignment to sys.stdout should immediately affect > those, just like for print. There's an important use case for this.) > > -- > --Guido van Rossum (home page: http://www.python.org/~guido/) There is a lot of debate over this issue, obviously. Now, I think getting rid of the print statement can lead to ugly code, because a write function would be called as an expression, so where we'd once have prints on their own lines, that wouldn't be the case anymore, and things could get ugly. But, print is a little too inflexible. What about adding a special name __print__, which the print statement would call? It should be looked up as a local first, then global. Thus, different parts of a program can define their own __print__, without changing everyone else's stdout. The Python web people would love that. From guido at python.org Tue Sep 6 06:46:04 2005 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Sep 2005 21:46:04 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <76fd5acf050905212157306c5@mail.gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <76fd5acf050905212157306c5@mail.gmail.com> Message-ID: On 9/5/05, Calvin Spealman wrote: > There is a lot of debate over this issue, obviously. Now, I think > getting rid of the print statement can lead to ugly code, because a > write function would be called as an expression, so where we'd once > have prints on their own lines, that wouldn't be the case anymore, and > things could get ugly. Sounds like FUD to me. Lots of functions/methods exist that *could* be embedded in expressions, and never are. Or if they are, there's actually a good reason, and then being a mere function (instead of a statement) would actually be helpful. Anyway, why would it be important that prints are on their own line where so many other important actions don't have that privilege? > But, print is a little too inflexible. > What about adding a special name __print__, which the print statement > would call? It should be looked up as a local first, then global. > Thus, different parts of a program can define their own __print__, > without changing everyone else's stdout. The Python web people would > love that. Too many underscores; __print__ screams "internal use, don't mess" at you. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Tue Sep 6 06:56:34 2005 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Sep 2005 21:56:34 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125973449.24500.6.camel@geddy.wooz.org> References: <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125973449.24500.6.camel@geddy.wooz.org> Message-ID: On 9/5/05, Barry Warsaw wrote: > Eliminating the newline argument from print() would reduce the number of > reserved keyword arguments in my strawman by half. Maybe we could even > rename 'to' to '__to__' (!) to eliminate the other namespace wart. Is > this really too horrible: > > print('$user forgot to frobnicate the $file!\n', > user=username, file=file.name, __to__=sys.stderr) Yes, it is too horrible. As I said in another post, __xyzzy__ screams "special internal use, don't mess with this". I don't think the namespace wart is really a problem though; it's simple enough *not* to use 'to' as a variable name in the format. Didn't you mean printf()? (Though I think if the format string doesn't roughly follow C's format string conventions the function shouldn't be called printf().) What do you think of the trick (that I wasn't aware of before) used in Java and .net of putting an optional position specifier in the format, and using positional arguments? It would be a little less verbose and with sensible defaults wouldn't quite punish everybody as much for the needs of i18n. Formats with more than 3 or 4 variables should be rare in any case (these are not the days of Fortran formatted output). -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Tue Sep 6 06:57:41 2005 From: guido at python.org (Guido van Rossum) Date: Mon, 5 Sep 2005 21:57:41 -0700 Subject: [Python-Dev] gdbinit problem In-Reply-To: <17181.961.655922.441900@montanaro.dyndns.org> References: <17181.961.655922.441900@montanaro.dyndns.org> Message-ID: On 9/5/05, skip at pobox.com wrote: > > Neal> The only way I could see to fix it was by setting a continue flag > Neal> and testing it. Does anyone know a better way to fix this > Neal> problem? > > Certainly looks reasonable until we figure out how (if at all) GDB's command > language implements a break-like statement. Ah. Now you've heard from the other user. :-) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From tanzer at swing.co.at Tue Sep 6 08:33:31 2005 From: tanzer at swing.co.at (tanzer@swing.co.at) Date: Tue, 06 Sep 2005 08:33:31 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Mon, 05 Sep 2005 21:56:34 PDT." Message-ID: Guido van Rossum wrote: > What do you think of the trick (that I wasn't aware of before) > used in Java and .net of putting an optional position specifier > in the format, and using positional arguments? It would be a > little less verbose and with sensible defaults wouldn't quite > punish everybody as much for the needs of i18n. Formats with more > than 3 or 4 variables should be rare in any case (these are not > the days of Fortran formatted output). Positional arguments remove too much meaning from the template. Compare: '$user forgot to frobnicate the $file!\n' with '$1 forgot to frobnicate the $2!\n' Whenever the template definition and its use are not directly adjacent, the template is that much harder to understand (i.e., in the context of translation, one wouldn't see the arguments passed to the template). -- Christian Tanzer http://www.c-tanzer.at/ From fredrik at pythonware.com Tue Sep 6 10:04:37 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Tue, 6 Sep 2005 10:04:37 +0200 Subject: [Python-Dev] String views References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com><17174.22550.862457.829100@montanaro.dyndns.org><43167BDB.6010002@canterbury.ac.nz> <431B9493.20204@canterbury.ac.nz> <17181.766.64934.211439@montanaro.dyndns.org> Message-ID: skip at pobox.com wrote: > Greg> If a Python function is clearly wrapping a C function, one doesn't > Greg> expect to be able to pass strings with embedded NULs to it. > > Isn't that just floating an implementation detail up to the programmer (who may > well not be POSIX- or Unix-aware)? so if POSIX refuses to deal with, e.g., NUL bytes in file names, Python should somehow work around that to avoid "exposing implementation details" ? From nicksjacobson at yahoo.com Tue Sep 6 10:57:32 2005 From: nicksjacobson at yahoo.com (Nick Jacobson) Date: Tue, 6 Sep 2005 01:57:32 -0700 (PDT) Subject: [Python-Dev] reference counting in Py3K Message-ID: <20050906085732.90687.qmail@web53908.mail.yahoo.com> While we're on the subject of Python 3000, what's the chance that reference counting when calling C functions from Python will go away? To me this is one of the few annoyances I have with Python. I know that Ruby somehow gets around the need for ref. counting. __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com From ncoghlan at gmail.com Tue Sep 6 12:44:19 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 06 Sep 2005 20:44:19 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: <431D7303.3020704@gmail.com> Guido van Rossum wrote: > If there are other use cases they can obviously be coded by using (b) > and a modest amount of custom code. (I know there's the use case of > printing sequences; I'll try to give an analysis of this use case in > another post if one of its proponents doesn't do so soon.) I did a fair bit of tinkering with that on the weekend. Printing a sequence of strings is fine - it's the call to "map(str, seq)" that makes printing a sequence of non-strings uglier than it should be. Doing it that way also breaks the Python idiom of letting unicode remain as unicode. However, one thing I eventually remembered was a discussion about a year ago regarding the possibility of allowing str.join and unicode.join to accept non-strings [1]. That discussion ended up leaving the join methods alone, because it is damn hard to do it without slowing down the common case where the sequence is all strings. I'm currently considering proposing a "joinany" method for str and unicode which accepts a sequence of arbitrary objects (I have a patch, but it needs unit tests and docs, and I'm uncomfortable with the amount of code duplication between the join and joinany methods). [1] http://mail.python.org/pipermail/python-dev/2004-August/048516.html > Some examples of the design pattern in action are str.strip(), > str.lstrip() and str.rstrip(), or str.find() and str.rfind(). > > A much stronger subcase of this pattern (with fewer exceptions) is > that the return type of a function shouldn't depend on the value of an > argument. I have a feeling that if we were to extend the notion of > type to include invariants, you'd find that the basic pattern is > actually the same -- often the variant behaviors change the key > invariant relationships between input and output. This becomes especially clear once "sorted" and "list.sort" are given as examples where the various keyword arguments do not change the basic invariant properties of the sorting operations - you start with a sequence, and you end up with essentially the same sequence, only in a different order. The keyword arguments simply control the precise meaning of "different order". > (a) print(...) > (b) printraw(...) or printbare(...) > (c) printf(fmt, ...) Hmm, I like those names better than anything else that has come up so far. > Each can take a keyword parameter to specify a different stream than > sys.stdout; but no other options are needed. The names for (a) and (c) > are pretty much fixed by convention (and by the clamoring when I > proposed write() :-). I'm not so sure about the best name for (b), but > I think picking the right name is important. 'printraw' is good - it makes it clear it is part of the same family as 'print' and 'printf', and explains succintly how it differs from the normal print function. > We could decide not to provide (b) directly, since it is easily > reduced to (c) using an appropriate format string ("%s" times the > number of arguments). But I expect that use case (b) is pretty > important, and not everyone likes having to use format strings. This > could be reduced to a special case of the Swiss Army Knife (...Not) > rule. Additionally, doing 'printraw' with 'printf' is a little tricky - the best I've come up with is "printf('%s'*3, a, b, c)". > BTW we could use "from __future__ import printing" to disable the > recognition of 'print' as a keyword in a particular module -- this > would provide adequate future-proofing. Gah, sometimes I miss the most obvious of solutions. . . Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From greg.ewing at canterbury.ac.nz Tue Sep 6 13:13:57 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 06 Sep 2005 23:13:57 +1200 Subject: [Python-Dev] Pascaloid print substitute (Replacement for print inPython 3.0) In-Reply-To: <002201c5b1c7$f19e0340$232dc797@oemcomputer> References: <002201c5b1c7$f19e0340$232dc797@oemcomputer> Message-ID: <431D79F5.9020309@canterbury.ac.nz> Raymond Hettinger wrote: >> Print["One", "Two", ...] >> Print["Buckle my shoe"] > > The ellipsis was a nice touch. I've been wondering whether it would be worth allowing ellipses to appear in other places besides slice indices, so it could be used in a print-function and other such purposes without having to abuse the slice notation: print("One", "Two", ...) Greg From greg.ewing at canterbury.ac.nz Tue Sep 6 13:22:35 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 06 Sep 2005 23:22:35 +1200 Subject: [Python-Dev] Simplify the file-like-object interface (Replacement for print in Python 3.0) In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> Message-ID: <431D7BFB.70201@canterbury.ac.nz> Fredrik Lundh wrote: > (you completely missed the point -- today's print mechanism works on *any* object > that implements a "write" method, no just file objects. saying that "oh, all you need is > to add a method" or "here's a nice mixin" doesn't give you a print replacement) While we're on the subject, in Py3k I'd like to see readline(), readlines(), etc. removed from file objects and made builtin functions instead. It should only be necessary to implement read() and write() to get a file-like object having equal status with all others. Greg From greg.ewing at canterbury.ac.nz Tue Sep 6 13:37:59 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 06 Sep 2005 23:37:59 +1200 Subject: [Python-Dev] Example for "property" violates "Python is not a one pass compiler" In-Reply-To: <5.1.1.6.0.20050905120907.01b74500@mail.telecommunity.com> References: <5.1.1.6.0.20050905120907.01b74500@mail.telecommunity.com> Message-ID: <431D7F97.1010604@canterbury.ac.nz> Phillip J. Eby wrote: > I'm not sure where you got the "Python is not a one pass compiler" idea; I > don't recall having seen this meme anywhere before, and I don't see how > it's meaningful anyway. Indeed, Python's bytecode compiler essentially *is* a one-pass compiler (or at least it used to be -- not sure what's been done to it recently). But the behaviour seen here is more about what happens at run time than compile time. What you're trying to do is essentially the same as print x x = 42 which fails at run time because x hasn't been bound when the print statement is executed. Greg From duncan.booth at suttoncourtenay.org.uk Tue Sep 6 13:51:24 2005 From: duncan.booth at suttoncourtenay.org.uk (Duncan Booth) Date: Tue, 6 Sep 2005 12:51:24 +0100 Subject: [Python-Dev] bug in urlparse References: <431AB747.7050500@evhr.net> <20050904233804.GA2731@unpythonic.net> Message-ID: jepler at unpythonic.net wrote in news:20050904233804.GA2731 at unpythonic.net: > According to RFC 2396[1] section 5.2: > > g) If the resulting buffer string still begins with one or more > complete path segments of "..", then the reference is > considered to be in error. Implementations may handle this > error by retaining these components in the resolved path (i.e., > treating them as part of the final URI), by removing them from > the resolved path (i.e., discarding relative levels above the > root), or by avoiding traversal of the reference. > > If I read this right, it explicitly allows the urlparse.urljoin behavior > ("handle this error by retaining these components in the resolved path"). > Yes, the urljoin behaviour is explicitly allowed, however it is not the most commonly implemented permitted behaviour. Both IE and Mozilla/Firefox handle this error by stripping the spurious .. elements from the front of the path. Apache, and I hope other web servers, work by the third permitted method, i.e. rejecting requests to these invalid urls. The net effect of this is that on some sites using a Python spider (e.g. webchecker.py) will produce a large number of error messages for links which browsers will actually resolve successfully. (At least that's when I first noticed this particular problem). Depending on your reasons for spidering a site this can be either a good thing or an annoyance. From barry at python.org Tue Sep 6 14:01:07 2005 From: barry at python.org (Barry Warsaw) Date: Tue, 06 Sep 2005 08:01:07 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125973449.24500.6.camel@geddy.wooz.org> Message-ID: <1126008067.13174.11.camel@presto.wooz.org> On Tue, 2005-09-06 at 00:56, Guido van Rossum wrote: > On 9/5/05, Barry Warsaw wrote: > > Eliminating the newline argument from print() would reduce the number of > > reserved keyword arguments in my strawman by half. Maybe we could even > > rename 'to' to '__to__' (!) to eliminate the other namespace wart. Is > > this really too horrible: > > > > print('$user forgot to frobnicate the $file!\n', > > user=username, file=file.name, __to__=sys.stderr) > > Yes, it is too horrible. As I said in another post, __xyzzy__ screams > "special internal use, don't mess with this". Fair enough -- it looked pretty icky to me too. > I don't think the namespace wart is really a problem though; it's > simple enough *not* to use 'to' as a variable name in the format. True. > Didn't you mean printf()? (Though I think if the format string doesn't > roughly follow C's format string conventions the function shouldn't be > called printf().) Yep, I meant printf(). > What do you think of the trick (that I wasn't aware of before) used in > Java and .net of putting an optional position specifier in the format, > and using positional arguments? It would be a little less verbose and > with sensible defaults wouldn't quite punish everybody as much for the > needs of i18n. Formats with more than 3 or 4 variables should be rare > in any case (these are not the days of Fortran formatted output). It's definitely an interesting idea, and would solve the namespace thing too. The above /might/ look like (warning: pre-coffee thought follows): printf('$1 forgot to frobnicate the $2!\n', username, file.name, to=sys.stderr) While that's a little less self-descriptive for a translator to deal with (who would only see the string, not the call site), it certainly looks nicer for a non-i18n application, and could certainly work for an i18n app too. It's a neat idea worth exploring. Also, I think you posted in a separate article a syntactic proposal for including detailed formating in $-vars. ${varname:fmt} where 'varname' could be an identifier a la PEP 292 or possibly a positional argument. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050906/f385b080/attachment.pgp From fredrik at pythonware.com Tue Sep 6 14:03:01 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Tue, 6 Sep 2005 14:03:01 +0200 Subject: [Python-Dev] Simplify the file-like-object interface (Replacement for print in Python 3.0) References: <20050902142044.GA18622@discworld.dyndns.org><17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> <431D7BFB.70201@canterbury.ac.nz> Message-ID: Greg Ewing wrote: >> (you completely missed the point -- today's print mechanism works on *any* object >> that implements a "write" method, no just file objects. saying that "oh, all you need is >> to add a method" or "here's a nice mixin" doesn't give you a print replacement) > > While we're on the subject, in Py3k I'd like to see > readline(), readlines(), etc. removed from file objects > and made builtin functions instead. It should only > be necessary to implement read() and write() to get > a file-like object having equal status with all > others. maybe some variation of http://www.python.org/peps/pep-0246.html combined with "default adapters" could come in handy here ? From fredrik at pythonware.com Tue Sep 6 14:05:47 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Tue, 6 Sep 2005 14:05:47 +0200 Subject: [Python-Dev] Example for "property" violates "Python is not a one pass compiler" References: <5.1.1.6.0.20050905120907.01b74500@mail.telecommunity.com> <431D7F97.1010604@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > Indeed, Python's bytecode compiler essentially *is* > a one-pass compiler for a suitable setting of the "look-ahead window size", at least. some Python constructs cannot be compiled by a truly minimalistic one-pass compiler. From arigo at tunes.org Tue Sep 6 14:30:42 2005 From: arigo at tunes.org (Armin Rigo) Date: Tue, 6 Sep 2005 14:30:42 +0200 Subject: [Python-Dev] bug in urlparse In-Reply-To: References: <431AB747.7050500@evhr.net> <20050904233804.GA2731@unpythonic.net> Message-ID: <20050906123042.GA13252@code1.codespeak.net> Hi Duncan, On Tue, Sep 06, 2005 at 12:51:24PM +0100, Duncan Booth wrote: > The net effect of this is that on some sites using a Python spider (e.g. > webchecker.py) will produce a large number of error messages for links > which browsers will actually resolve successfully. As far as I'm concerned, even if it is not theoretically a buggy behavior, a proposed patch with the above motivation would be welcome (and, of course, this patch wouldn't break the RFC either). Armin From greg.ewing at canterbury.ac.nz Tue Sep 6 14:18:42 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 07 Sep 2005 00:18:42 +1200 Subject: [Python-Dev] String views In-Reply-To: <17181.766.64934.211439@montanaro.dyndns.org> References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <43167BDB.6010002@canterbury.ac.nz> <431B9493.20204@canterbury.ac.nz> <17181.766.64934.211439@montanaro.dyndns.org> Message-ID: <431D8922.2030508@canterbury.ac.nz> skip at pobox.com wrote: > Greg> If a Python function is clearly wrapping a C function, one doesn't > Greg> expect to be able to pass strings with embedded NULs to it. > > Isn't that just floating an implementation detail up to the programmer (who may > well not be POSIX- or Unix-aware)? Yes, but in some cases that's unavoidable. It would be impractical to provide embedded-NUL-capable replacements for all C functions that someone might want (and flat-out impossible for some, e.g. os.open()). Greg From p.f.moore at gmail.com Tue Sep 6 14:49:45 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Tue, 6 Sep 2005 13:49:45 +0100 Subject: [Python-Dev] Simplify the file-like-object interface (Replacement for print in Python 3.0) In-Reply-To: References: <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> <431D7BFB.70201@canterbury.ac.nz> Message-ID: <79990c6b050906054943999631@mail.gmail.com> On 9/6/05, Fredrik Lundh wrote: > Greg Ewing wrote: > > While we're on the subject, in Py3k I'd like to see > > readline(), readlines(), etc. removed from file objects > > and made builtin functions instead. It should only > > be necessary to implement read() and write() to get > > a file-like object having equal status with all > > others. > > maybe some variation of > > http://www.python.org/peps/pep-0246.html > > combined with "default adapters" could come in handy here ? That sounds like a good idea. I'm certainly getting concerned about the proliferation of methods that people "should" add to file-like objects, where read/write are the only fundamental ones needed. I can't see mixins working, as too many file-like objects are written in C... Paul. From greg.ewing at canterbury.ac.nz Tue Sep 6 14:42:30 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 07 Sep 2005 00:42:30 +1200 Subject: [Python-Dev] Simplify the file-like-object interface (Replacement for print in Python 3.0) In-Reply-To: References: <20050902142044.GA18622@discworld.dyndns.org> <17176.26832.44077.299214@montanaro.dyndns.org> <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> <431D7BFB.70201@canterbury.ac.nz> Message-ID: <431D8EB6.1070702@canterbury.ac.nz> Fredrik Lundh wrote: > maybe some variation of > > http://www.python.org/peps/pep-0246.html > > combined with "default adapters" could come in handy here ? I really hope we can get by with something much less heavyweight than that. I'm far from convinced that something like PEP 246 proposes is necessary or desirable. Greg From greg.ewing at canterbury.ac.nz Tue Sep 6 14:44:38 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 07 Sep 2005 00:44:38 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: <431D8F36.2080909@canterbury.ac.nz> Guido van Rossum wrote: > So let's call it the "Swiss Army Knife > (...Not)" API design pattern. Aha! Maybe this is the long-lost 20th principle from the Zen of Python? Greg From solipsis at pitrou.net Tue Sep 6 15:12:40 2005 From: solipsis at pitrou.net (Antoine Pitrou) Date: Tue, 06 Sep 2005 15:12:40 +0200 Subject: [Python-Dev] Simplify the file-like-object interface In-Reply-To: <79990c6b050906054943999631@mail.gmail.com> References: <79990c6b05090208367372f705@mail.gmail.com> <79990c6b05090212453f3b7c77@mail.gmail.com> <431D7BFB.70201@canterbury.ac.nz> <79990c6b050906054943999631@mail.gmail.com> Message-ID: <1126012360.5469.13.camel@p-dvsi-418-1.rd.francetelecom.fr> > That sounds like a good idea. I'm certainly getting concerned about > the proliferation of methods that people "should" add to file-like > objects, where read/write are the only fundamental ones needed. > > I can't see mixins working, as too many file-like objects are written in C... One could use "class decorators". For example if you want to define the method foo() in a file-like class, you could use code like: def FooExtender(cls): class wrapper(cls): pass try: # Don't do anything if "foo" already defined wrapper.foo except AttributeError: def foo(self): """ Automatically generated foo method. """ self.write("foo\n") wrapper.foo = foo return wrapper MyFileClass = FooExtender(MyCFileClass) This is for classes, but the construct can be adapted to work on objects instead. The advantage of using a decorator-like function is that you can do some complex processing in the function (you could for example automatically define __ne__ if only __eq__ is defined, and conversely). And it could probably plug, as an useful convenience or even an automatic mechanism, into a more sophisticated adaptor system. From ncoghlan at gmail.com Tue Sep 6 15:16:31 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 06 Sep 2005 23:16:31 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431D8F36.2080909@canterbury.ac.nz> References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <431D8F36.2080909@canterbury.ac.nz> Message-ID: <431D96AF.7020502@gmail.com> Greg Ewing wrote: > Guido van Rossum wrote: > >>So let's call it the "Swiss Army Knife >>(...Not)" API design pattern. > > > Aha! Maybe this is the long-lost 20th principle from > the Zen of Python? It also sounds like one of the reasons why the ultimates in programming swiss army knives (that is, Lisp macros and Ruby blocks) are unlikely to make an appearance in Python in their full, unconstrained 'glory'. . . There's an interesting comparison with UI design though - having a couple of different tools in the interface with sensible default behaviour is generally easier to use than a single tool where you have to tell it which behaviour you want all the time (or pick one as the default, and have to remember to tell the application when you want the other behaviour). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Tue Sep 6 16:07:01 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 07 Sep 2005 00:07:01 +1000 Subject: [Python-Dev] string.Template format enhancements (Re: Replacement for print in Python 3.0) In-Reply-To: References: Message-ID: <431DA285.4080700@gmail.com> tanzer at swing.co.at wrote: > Positional arguments remove too much meaning from the template. > > Compare: > > '$user forgot to frobnicate the $file!\n' > > with > > '$1 forgot to frobnicate the $2!\n' > > Whenever the template definition and its use are not directly > adjacent, the template is that much harder to understand (i.e., > in the context of translation, one wouldn't see the arguments > passed to the template). Ideas like this one are really about taking the current C-originated string formatting and replacing it with a more powerful version of the new string.Template formatting. One interesting idea to consider is to add "format" and "safe_format" string methods (or "substitute" and "safe_substitute" if matching the PEP 292 method names is considered important) which work something like: # These would be new str and unicode methods def format(*args, **kwds): return string.Template(args[0]).substitute(*args[1:], **kwds) def safe_format(*args, **kwds): return string.Template(args[0]).safe_substitute(*args[1:], **kwds) Then enhance string.Template such that: 1. Item identifiers could be numbers as well as valid Python identifiers 2. Positional arguments were added to the dictionary of items using their argument index as a string Something like: # This would be modified behaviour of the substitute method # rather than a separate method or function def pos_substitute(*args, **kwds): self = args[0] kwds.update((str(idx), arg) for idx, arg in enumerate(args)) # Avoiding including self in kwds is also an option return self.substitute(**kwds) (To try this on Py2.4, use "$p1" for the positional arguments to easily get around the restriction in the string.Template regex) With the above changes, the following would work: "$1: $2".format("Number of bees", "0.5") And produce: "Number of bees: 0.5" When pre-compiling string.Templates, the keyword method is significantly clearer, but if the syntax was accessible through a string method, then being able to use positional arguments would be very handy. At this point, the only thing missing is the ability to handle proper output formatting - that would need to be done by invoking the string mod operator directly on the positional argument via: "$1: $2".format("Number of bees: ", "%0.2f" % val) In theory (I haven't really tried this bit), it should be possible to adjust string.Template to support the following equivalent: "$1: $[0.2f]2".format("Number of bees: ", val) That way, rather than inventing a new formatting language, string.Template could leverage the existing string mod operator by substituting the result of "('%' + fmt) % val" wherever it sees "$[fmt]name" or "$[fmt]{name}" Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From guido at python.org Tue Sep 6 16:15:23 2005 From: guido at python.org (Guido van Rossum) Date: Tue, 6 Sep 2005 07:15:23 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: On 9/5/05, tanzer at swing.co.at wrote: > Positional arguments remove too much meaning from the template. > > Compare: > > '$user forgot to frobnicate the $file!\n' > > with > > '$1 forgot to frobnicate the $2!\n' > > Whenever the template definition and its use are not directly > adjacent, the template is that much harder to understand (i.e., > in the context of translation, one wouldn't see the arguments > passed to the template). The operative word being *whenever*. You're thinking of the i18n use case, where the format string is separated from the arguments. I'm thinking of the non-i18n use case, where the format isalmost always a string *literal* adjacent to the arguments. I'm not at all convinced that we should attempt to find a solution that handles both use cases; most Python code never needs i18n. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From tanzer at swing.co.at Tue Sep 6 16:18:42 2005 From: tanzer at swing.co.at (tanzer@swing.co.at) Date: Tue, 06 Sep 2005 16:18:42 +0200 Subject: [Python-Dev] string.Template format enhancements (Re: Replacement for print in Python 3.0) In-Reply-To: Your message of "Wed, 07 Sep 2005 00:07:01 +1000." <431DA285.4080700@gmail.com> Message-ID: Nick Coghlan wrote: > With the above changes, the following would work: > "$1: $2".format("Number of bees", "0.5") > And produce: > "Number of bees: 0.5" > > When pre-compiling string.Templates, the keyword method is > significantly clearer, but if the syntax was accessible through a > string method, then being able to use positional arguments would > be very handy. As long as named arguments don't get lost, that's fine. I often use templates stored in variables/passed around as arguments, where the positional form is not clear at all: template.format("Number of bees", "0.5") -- Christian Tanzer http://www.c-tanzer.at/ From guido at python.org Tue Sep 6 16:21:09 2005 From: guido at python.org (Guido van Rossum) Date: Tue, 6 Sep 2005 07:21:09 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431D7303.3020704@gmail.com> References: <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <431D7303.3020704@gmail.com> Message-ID: On 9/6/05, Nick Coghlan wrote: > I did a fair bit of tinkering with that on the weekend. Printing a sequence of > strings is fine - it's the call to "map(str, seq)" that makes printing a > sequence of non-strings uglier than it should be. Doing it that way also > breaks the Python idiom of letting unicode remain as unicode. Only of you insist on doing it in a single call. With an explicit for loop all is well. > However, one thing I eventually remembered was a discussion about a year ago > regarding the possibility of allowing str.join and unicode.join to accept > non-strings [1]. > > That discussion ended up leaving the join methods alone, because it is damn > hard to do it without slowing down the common case where the sequence is all > strings. > > I'm currently considering proposing a "joinany" method for str and unicode > which accepts a sequence of arbitrary objects (I have a patch, but it needs > unit tests and docs, and I'm uncomfortable with the amount of code duplication > between the join and joinany methods). Why not take an idea that Fredrik Lundh mentioned earlier, and have a built-in *function* named join() which takes a sequence and a string? joinany() is an ugly name. But what's still missing is a use case analysis where you prove that the use case is common enough to require explicit support. > This becomes especially clear once "sorted" and "list.sort" are given as > examples where the various keyword arguments do not change the basic invariant > properties of the sorting operations - you start with a sequence, and you end > up with essentially the same sequence, only in a different order. The keyword > arguments simply control the precise meaning of "different order". Thanks -- a very good example! > 'printraw' is good - it makes it clear it is part of the same family as > 'print' and 'printf', and explains succintly how it differs from the normal > print function. (Except that 'raw' could mean anything.) > Additionally, doing 'printraw' with 'printf' is a little tricky - the best > I've come up with is "printf('%s'*3, a, b, c)". Yeah, but often the real code you need to do is already written as print("x =", x, "y =", y, "z =", z) and that becomes more readable when you transform it to printf("x = %s y = %s z = %s\n", x, y, z) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Tue Sep 6 16:23:52 2005 From: guido at python.org (Guido van Rossum) Date: Tue, 6 Sep 2005 07:23:52 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126008067.13174.11.camel@presto.wooz.org> References: <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125973449.24500.6.camel@geddy.wooz.org> <1126008067.13174.11.camel@presto.wooz.org> Message-ID: On 9/6/05, Barry Warsaw wrote: > printf('$1 forgot to frobnicate the $2!\n', username, file.name, > to=sys.stderr) > > While that's a little less self-descriptive for a translator to deal > with (who would only see the string, not the call site), it certainly > looks nicer for a non-i18n application, and could certainly work for an > i18n app too. It's a neat idea worth exploring. Is it worth doing this and completely dropping the %-based formats in Py3k? (Just asking -- it might be if we can get people to get over the shock of $ becoming first class ;-). > Also, I think you posted in a separate article a syntactic proposal for > including detailed formating in $-vars. ${varname:fmt} where 'varname' > could be an identifier a la PEP 292 or possibly a positional argument. +1 I proposed ${varname%fmt} earlier but it prevents you to extend the varname syntax to arbitrary expressions, which I think is an extension that will get lots of requests. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From ncoghlan at gmail.com Tue Sep 6 16:24:21 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 07 Sep 2005 00:24:21 +1000 Subject: [Python-Dev] Example for "property" violates "Python is not a one pass compiler" In-Reply-To: <431D7F97.1010604@canterbury.ac.nz> References: <5.1.1.6.0.20050905120907.01b74500@mail.telecommunity.com> <431D7F97.1010604@canterbury.ac.nz> Message-ID: <431DA695.7070401@gmail.com> Greg Ewing wrote: > Phillip J. Eby wrote: > >>I'm not sure where you got the "Python is not a one pass compiler" idea; I >>don't recall having seen this meme anywhere before, and I don't see how >>it's meaningful anyway. > > > Indeed, Python's bytecode compiler essentially *is* > a one-pass compiler (or at least it used to be -- not > sure what's been done to it recently). It builds the symbol table before actually trying to compile anything. This is what allows it to figure out which load commands to use for which symbols. I'm not up to speed on my compiler theory though, so I'm not sure if building the symbol table first is enough to count as two-pass compilation. > But the behaviour seen here is more about what happens > at run time than compile time. What you're trying to > do is essentially the same as > > print x > x = 42 > > which fails at run time because x hasn't been bound > when the print statement is executed. I've found this to be one of the subtler habits to try and break when coming from a static language - in something like C++, functions and classes are interpreted at compile time, and the result made part of the executable. In Python, the only thing created at compile time is the bytecode required to create the class or function - the definition isn't actually processed until runtime. It's easy to forget that difference (mainly because it doesn't matter very often). Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From tanzer at swing.co.at Tue Sep 6 16:25:17 2005 From: tanzer at swing.co.at (tanzer@swing.co.at) Date: Tue, 06 Sep 2005 16:25:17 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Tue, 06 Sep 2005 07:15:23 PDT." Message-ID: Guido van Rossum wrote : > On 9/5/05, tanzer at swing.co.at wrote: > > Whenever the template definition and its use are not directly > > adjacent, the template is that much harder to understand (i.e., This `i.e.` should have read `e.g.` :-( > > in the context of translation, one wouldn't see the arguments > > passed to the template). > > The operative word being *whenever*. You're thinking of the i18n use > case, where the format string is separated from the arguments. I'm > thinking of the non-i18n use case, where the format isalmost always a > string *literal* adjacent to the arguments. I'm not at all convinced > that we should attempt to find a solution that handles both use cases; > most Python code never needs i18n. I often put format strings into class variables (to be overriden) or pass them around as arguments, which has nothing to do with i18n. And i18n is going to be more and more important (says this german speaker who always tries to get away with English programs :-) I'm all for allowing positional arguments but would badly miss named arguments. -- Christian Tanzer http://www.c-tanzer.at/ From solipsis at pitrou.net Tue Sep 6 16:33:36 2005 From: solipsis at pitrou.net (Antoine Pitrou) Date: Tue, 06 Sep 2005 16:33:36 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125973449.24500.6.camel@geddy.wooz.org> <1126008067.13174.11.camel@presto.wooz.org> Message-ID: <1126017216.5469.24.camel@p-dvsi-418-1.rd.francetelecom.fr> (just my 2 cents) Le mardi 06 septembre 2005 ? 07:23 -0700, Guido van Rossum a ?crit : > On 9/6/05, Barry Warsaw wrote: > > printf('$1 forgot to frobnicate the $2!\n', username, file.name, > > to=sys.stderr) > Is it worth doing this and completely dropping the %-based formats in > Py3k? (Just asking -- it might be if we can get people to get over the > shock of $ becoming first class ;-). For me, the problem with that proposal is not the precise format syntax, but the fact that formatting is tied to a specific function which _also_ outputs stuff to screen. There are really use cases where you want formatting without using a "print" function in conjunction. Web pages, sending notification e-mails, changing labels in GUI apps... anything that talks to the user in a different way than using stdout. IMO, printing and formatting must be distinct (*). And formatting should be convenient and i18n-friendly (i18n is more and more important in today's apps). (*) they should be treated separately in the discussion, anyway Regards Antoine. From rzantow at ntelos.net Tue Sep 6 16:58:53 2005 From: rzantow at ntelos.net (rzed) Date: Tue, 06 Sep 2005 10:58:53 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: Message-ID: <431DAEAD.9090403@ntelos.net> Guido van Rossum wrote: [...] > OK, still with me? This, together with the observation that the only > use cases for the delimiter are space and no space, suggests that we > should have separate printing APIs for each of the use cases (a), (b) > and (c) above, rather than trying to fold (b) into (a) using a way to > parameterize the separator (and the trailing newline, to which the > same argument applies). For example: > > (a) print(...) > (b) printraw(...) or printbare(...) > (c) printf(fmt, ...) > > Each can take a keyword parameter to specify a different stream than > sys.stdout; but no other options are needed. The names for (a) and (c) > are pretty much fixed by convention (and by the clamoring when I > proposed write() :-). I'm not so sure about the best name for (b), but > I think picking the right name is important. Applying the same reasoning as above, why not remove the last remaining keyword parameter by adding fprint(ftobj,...) fprintraw( ftobj,...) and fprintf(ftobj,fmt,...) functions? -- rzed From guido at python.org Tue Sep 6 17:28:45 2005 From: guido at python.org (Guido van Rossum) Date: Tue, 6 Sep 2005 08:28:45 -0700 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <20050906085732.90687.qmail@web53908.mail.yahoo.com> References: <20050906085732.90687.qmail@web53908.mail.yahoo.com> Message-ID: On 9/6/05, Nick Jacobson wrote: > While we're on the subject of Python 3000, what's the > chance that reference counting when calling C > functions from Python will go away? We'd have to completely change the implementation. We're not planning on that. > To me this is one of the few annoyances I have with > Python. I know that Ruby somehow gets around the need > for ref. counting. You could always use IronPython or Jython of course, neither of which has this. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From skip at pobox.com Tue Sep 6 17:32:22 2005 From: skip at pobox.com (skip@pobox.com) Date: Tue, 6 Sep 2005 10:32:22 -0500 Subject: [Python-Dev] String views In-Reply-To: References: <2773CAC687FD5F4689F526998C7E4E5F4DB599@au3010avexu1.global.avaya.com> <17174.22550.862457.829100@montanaro.dyndns.org> <43167BDB.6010002@canterbury.ac.nz> <431B9493.20204@canterbury.ac.nz> <17181.766.64934.211439@montanaro.dyndns.org> Message-ID: <17181.46726.741546.790533@montanaro.dyndns.org> Greg> If a Python function is clearly wrapping a C function, one doesn't Greg> expect to be able to pass strings with embedded NULs to it. Skip> Isn't that just floating an implementation detail up to the Skip> programmer (who may well not be POSIX- or Unix-aware)? Fredrik> so if POSIX refuses to deal with, e.g., NUL bytes in file Fredrik> names, Python should somehow work around that to avoid Fredrik> "exposing implementation details" ? I don't know what the correct answer is. I suspect the right thing to do will vary depending on what C function is being wrapped. I was just making sure I understood correctly that there is a potential problem. Skip From mwh at python.net Tue Sep 6 17:35:43 2005 From: mwh at python.net (Michael Hudson) Date: Tue, 06 Sep 2005 16:35:43 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: (Guido van Rossum's message of "Sun, 4 Sep 2005 08:59:02 -0700") References: <8328163536998793998@unknownmsgid> Message-ID: <2my86amcog.fsf@starship.python.net> Guido van Rossum writes: > On 9/3/05, Bill Janssen wrote: >> So here's the summary of the arguments against: two style points >> (trailing comma and >>stream) (from the man who approved the current >> decorator syntax!), and it's hard to extend. (By the way, I agree that >> the ">>" syntax is ugly, and IMO a bad idea in general. Shame the "@" >> wasn't used instead. :-) >> >> Seems pretty weak to me. Are there other args against? > > Sure. I made the mistake of thinking that everybody knew them. [...] > But more important to me are my own experiences exploring the > boundaries of print. [...] Gnnnnnyagh, couldn't you have *started* the thread with that post? :) Cheers, mwh -- Get out your salt shakers folks, this one's going to take more than one grain. -- Ator in an Ars Technica news item From guido at python.org Tue Sep 6 17:44:42 2005 From: guido at python.org (Guido van Rossum) Date: Tue, 6 Sep 2005 08:44:42 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <2my86amcog.fsf@starship.python.net> References: <8328163536998793998@unknownmsgid> <2my86amcog.fsf@starship.python.net> Message-ID: On 9/6/05, Michael Hudson wrote: > Gnnnnnyagh, couldn't you have *started* the thread with that post? :) I hadn't anticipated so many great minds rusted shut. :-) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From gmccaughan at synaptics-uk.com Tue Sep 6 17:53:49 2005 From: gmccaughan at synaptics-uk.com (Gareth McCaughan) Date: Tue, 6 Sep 2005 16:53:49 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126017216.5469.24.camel@p-dvsi-418-1.rd.francetelecom.fr> References: <1126017216.5469.24.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <200509061653.50505.gmccaughan@synaptics-uk.com> > > On 9/6/05, Barry Warsaw wrote: > > > printf('$1 forgot to frobnicate the $2!\n', username, file.name, > > > to=sys.stderr) ... > For me, the problem with that proposal is not the precise format syntax, > but the fact that formatting is tied to a specific function which _also_ > outputs stuff to screen. So borrow a trick from Common Lisp and use a destination of None to mean "return the formatted text as a string". >>> x = printf("$2 $1", 123,321) 321 123 >>> print x None >>> x = printf("$2 $1", 123,321, to=None) >>> print x 321 123 Or is that too cryptic? -- g From p.f.moore at gmail.com Tue Sep 6 18:36:19 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Tue, 6 Sep 2005 17:36:19 +0100 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <200509061653.50505.gmccaughan@synaptics-uk.com> References: <1126017216.5469.24.camel@p-dvsi-418-1.rd.francetelecom.fr> <200509061653.50505.gmccaughan@synaptics-uk.com> Message-ID: <79990c6b050906093615dd2a20@mail.gmail.com> On 9/6/05, Gareth McCaughan wrote: > So borrow a trick from Common Lisp and use a destination of None > to mean "return the formatted text as a string". [...] > Or is that too cryptic? Yes. To my mind, formatting (returning a string) and output are separate operations. A "write formatted output" operation is a useful convenience method, but it's not the basic operation. Paul. From kay.schluehr at gmx.net Tue Sep 6 18:39:58 2005 From: kay.schluehr at gmx.net (Kay Schluehr) Date: Tue, 06 Sep 2005 18:39:58 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431D96AF.7020502@gmail.com> References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <431D8F36.2080909@canterbury.ac.nz> <431D96AF.7020502@gmail.com> Message-ID: Nick Coghlan wrote: > Greg Ewing wrote: > >>Guido van Rossum wrote: >> >> >>>So let's call it the "Swiss Army Knife >>>(...Not)" API design pattern. >> >> >>Aha! Maybe this is the long-lost 20th principle from >>the Zen of Python? > > > It also sounds like one of the reasons why the ultimates in programming swiss > army knives (that is, Lisp macros and Ruby blocks) are unlikely to make an > appearance in Python in their full, unconstrained 'glory'. . . In the context of my proposal it seems to imply some variation of Einsteins famous sentence: "Make everything as general as possible but not more general" or "Make everything as powerfull as possible but not more powerfull". The measure of possibility in this context may be serious community requirements. That's why I might have the impression that the language doesn't get any *deeper* but it is still very close to my actual work and highly usable. On the other hand I have to admit that I'm not really glad about 3 functions in favour for one statement. Introducing of a Writer object is just a different way of factoring and handling special cases and defaults. But I don't believe in an absolute truth about that. I'm not an OO stalinist. > There's an interesting comparison with UI design though - having a couple of > different tools in the interface with sensible default behaviour is generally > easier to use than a single tool where you have to tell it which behaviour you > want all the time (or pick one as the default, and have to remember to tell > the application when you want the other behaviour). Hmm.. Guido cited strip, rstrip and lstrip for a good factoring into different functions. To me this is a limit case. It can become annoying soon and an API design antipattern. May I remember about C's vprintf, vfprintf, vsprintf, vsnprintf or the beauty of execl, execle, execlp, execlpe, execv, execve, execvp, execvpe? That's so grotesque that I feel deeply connected to Xah Lees crusade against UNIX in sudden moments ;) Kay From fredrik at pythonware.com Tue Sep 6 19:06:47 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Tue, 6 Sep 2005 19:06:47 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <8328163536998793998@unknownmsgid> Message-ID: putting on my "on the other hand" hat for a moment... ::: Guido van Rossum wrote: > 1. It's always been there > 2. We don't want to type parentheses > 3. We use it a lot > 4. We don't want to change our code there's also: 5. A reserved word makes it easy to grep for (e.g. to weed out debugging prints before release). 6. It does the right thing under the hood: converts things to their string representation, calls write repeatedly instead of allocating a buffer large enough to hold all components, doesn't print un- necessary trailing spaces, etc. 7. Using a statement syntax makes it possible to use a readable syntax for redirection (and other possible extensions); it's obvious that this isn't printing to stdout: print >>file, bacon(ham), spam, "=", eggs(number=count) while print(bacon(ham), spam, "=", eggs(number=count), to=file) is a lot harder to parse, especially if you add more elements and print lines (how fast can you answer the question "do all these redirect to the same place" ?). 8. It can print to anything that implements a "write" method, no matter what it is or what other methods it implements. (etc) ::: and my "on the other hand" gloves... > - I quite often come to a point in the evolution of a program where I > need to change all print statements into logging calls, or calls into > some other I/O or UI library. never happens to me -- in my experience, good logging requires some basic design up front, so you might as well log via functions right from the start. print is reserved for debugging, tracing during development, and console-oriented scripts. I cannot recall ever having converted one of the latter to a component that needed logging. > - Having special syntax puts up a much larger barrier for evolution of > a feature. on the other hand, having special syntax gives you a lot more flexibility in coming up with readable, grokkable solutions. not everything fits into the callable paren list of comma separated stuff some of it with equal signs in it end paren pattern. > - There is a distinct non-linearity in print's ease of use once you > decide that you don't want to have spaces between items in practice, % is a great way to deal with this. > - If it were a function, it would be much easier to replace it within > one module (just def print(*args):...) or even throughout a program > (e.g. by putting a different function in __builtin__.print). if that's an important feature, what keeps us from adding a hook (along the lines of __import__) ? one could even argue that making it easier to override it locally may make it harder to override it globally; consider this local override: def print(fmt, *args): sys.stdout.write("MY MODULE SAYS " + fmt % args) print("blabla") with this in place, changing __builtin__.print will override everything except the prints in "MY MODULE"... so you end up doing a stdout redirect, just as in today's python. (etc) ::: From fredrik at pythonware.com Tue Sep 6 19:09:59 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Tue, 6 Sep 2005 19:09:59 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <431D8F36.2080909@canterbury.ac.nz><431D96AF.7020502@gmail.com> Message-ID: Kay Schluehr wrote: > In the context of my proposal it seems to imply some variation of > Einsteins famous sentence: "Make everything as general as possible but > not more general" or "Make everything as powerfull as possible but not > more powerfull". I prefer McGrath's variation: "Things should be as complex as necessary but not more complex." From trentm at ActiveState.com Tue Sep 6 20:02:44 2005 From: trentm at ActiveState.com (Trent Mick) Date: Tue, 6 Sep 2005 11:02:44 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1125886645.10950.27.camel@geddy.wooz.org> References: <79990c6b05090208367372f705@mail.gmail.com> <4318F8B1.9090701@gmail.com> <79990c6b050903061575d01712@mail.gmail.com> <1125761529.19992.71.camel@presto.wooz.org> <09A815FE-9213-4C6B-B65C-9C4DE4F00137@fuhm.net> <1125852662.10947.5.camel@geddy.wooz.org> <1125886645.10950.27.camel@geddy.wooz.org> Message-ID: <20050906180244.GA23123@ActiveState.com> [Barry Warsaw wrote] > Also, we already have precedence in format+print in the logging > package. I actually think the logging provides a nice, fairly to use > interface that print-ng can be modeled on. The main reason for doing that in the logging package is for performance: processing the args into the format string can be deferred until the logging system knows that the log message will actually be used. I'm not saying that the separation of 'fmt' and args in the logging methods doesn't have the other benefit of clarity: log.debug("%s %s %s %s ...", arg1, arg2, arg3, really_really_long_arg4,) # nicer log.debug("%s %s %s %s ..." % (arg1, arg2, arg3, really_really_long_arg4)) # icky but the performance reason doesn't apply to the printf()/write() discussion here. Trent -- Trent Mick TrentM at ActiveState.com From mcherm at mcherm.com Tue Sep 6 20:45:23 2005 From: mcherm at mcherm.com (Michael Chermside) Date: Tue, 06 Sep 2005 11:45:23 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 Message-ID: <20050906114523.o93q5xcxu6o84o88@login.werra.lunarpages.com> Please bear with me if this has already been stated, or if I ought to be directing this to the wiki instead of to python-dev at this point. I've been trying to follow this whole discussion but have only gotten as far as last Saturday. Two things: First of all, I wanted to encourage Guido. There have been lots of people objecting to "fixing" the print statement, but I for one agree that it's a wart which should be addressed if it can be done elegantly. I'm just not speaking up because others (particularly Guido) have already said most of what I am thinking and I don't want to clutter the discussion with "me too!"'s. And one thing which (as far as I've read) _IS_ a new suggestion. I agree that a new built-in function ought to be named 'print'. This poses problems for those who want to write code NOW that runs in Python 2.x (for large values of x) which will also run in 3.0. We could satisfy both people if in Python 2.x we introduced a built-in function named "print30" (for Python 3.0) with the intended new behavior. People could start coding now using the "print30" builtin. When Python 3.0 was released, 'print' would no longer be a keyword, and both 'print' and 'print30' would be built-ins that both refered to the same function. Sure, it's a tiny bit of backward compatibility cruft to have a second name for the builtin, but it may be worth it because the ability to write in the "Python 3.0 style" (all new-style classes, only raise proper exceptions, etc) in the 2.x series is a VERY useful feature. We want to handle the transition better than Perl. -- Michael Chermside From mcherm at mcherm.com Tue Sep 6 20:54:27 2005 From: mcherm at mcherm.com (Michael Chermside) Date: Tue, 06 Sep 2005 11:54:27 -0700 Subject: [Python-Dev] Hacking print (was: Replacement for print in Python3.0) Message-ID: <20050906115427.ifapqts7qr6s0ogc@login.werra.lunarpages.com> Bill Hanssen writes: > I think the "-ln" > variants made familiar by Pascal and Java were a bad idea, on a par > with the notion of a split between "text" and "binary" file opens. It's a bit off topic, but it wasn't the languages that introduced the difference between "text" and "binary" files. Pascal defined a difference between "text" and "record" files because the operating systems of the time had two distinct file types. Java initially had only one type (binary files which got automagically converted to a stream of unicode characters) and later modified things to allow manual control of the encoding because "modern" operating systems (like Windows) have two distinct file types. Don't blame the language designers, blame the OS folks. -- Michael Chermside From steve at holdenweb.com Tue Sep 6 21:13:11 2005 From: steve at holdenweb.com (Steve Holden) Date: Tue, 06 Sep 2005 15:13:11 -0400 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <20050906085732.90687.qmail@web53908.mail.yahoo.com> References: <20050906085732.90687.qmail@web53908.mail.yahoo.com> Message-ID: Nick Jacobson wrote: > While we're on the subject of Python 3000, what's the > chance that reference counting when calling C > functions from Python will go away? > > To me this is one of the few annoyances I have with > Python. I know that Ruby somehow gets around the need > for ref. counting. > Reference counting is an implementation detail, and isn't a part of the language specifications. I have no idea why you find it so annoying, but there are other implementations (Jython, Iron Python) that don't use it. regards Steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC http://www.holdenweb.com/ From bob at redivi.com Tue Sep 6 21:44:22 2005 From: bob at redivi.com (Bob Ippolito) Date: Tue, 6 Sep 2005 12:44:22 -0700 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: References: <20050906085732.90687.qmail@web53908.mail.yahoo.com> Message-ID: <116A6D3E-3950-4257-81EB-353105E25247@redivi.com> On Sep 6, 2005, at 12:13 PM, Steve Holden wrote: > Nick Jacobson wrote: > >> While we're on the subject of Python 3000, what's the >> chance that reference counting when calling C >> functions from Python will go away? >> >> To me this is one of the few annoyances I have with >> Python. I know that Ruby somehow gets around the need >> for ref. counting. >> >> > Reference counting is an implementation detail, and isn't a part of > the > language specifications. I have no idea why you find it so > annoying, but > there are other implementations (Jython, Iron Python) that don't > use it. Personally I've found that reference counting makes Python really easy to integrate with other systems that may or may not also use reference counting. It is somewhat of a chore to incref/decref all over the place, but you *are* programming in C. -bob From janssen at parc.com Tue Sep 6 21:54:55 2005 From: janssen at parc.com (Bill Janssen) Date: Tue, 6 Sep 2005 12:54:55 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Sun, 04 Sep 2005 08:59:02 PDT." Message-ID: <05Sep6.125458pdt."58617"@synergy1.parc.xerox.com> Guido, I think this is a very nice summary of the arguments for removing print. Let's change the link in PEP 3000 to point to this message. Bill From janssen at parc.com Tue Sep 6 22:01:00 2005 From: janssen at parc.com (Bill Janssen) Date: Tue, 6 Sep 2005 13:01:00 PDT Subject: [Python-Dev] Hacking print (was: Replacement for print in Python3.0) In-Reply-To: Your message of "Tue, 06 Sep 2005 11:54:27 PDT." <20050906115427.ifapqts7qr6s0ogc@login.werra.lunarpages.com> Message-ID: <05Sep6.130103pdt."58617"@synergy1.parc.xerox.com> Sorry to be confusing. I hadn't meant to imply that the split between text and binary files were somehow the fault of any programming languages, just the split between "write" and "writeln". Equally bad ideas with different origins. Though I continue to believe that Python should default to opening a file as "binary", not "text", and that the current default is a defect in Python. Bill > Bill Hanssen writes: > > I think the "-ln" > > variants made familiar by Pascal and Java were a bad idea, on a par > > with the notion of a split between "text" and "binary" file opens. > > It's a bit off topic, but it wasn't the languages that introduced the > difference between "text" and "binary" files. Pascal defined a difference > between "text" and "record" files because the operating systems of the > time had two distinct file types. Java initially had only one type > (binary files which got automagically converted to a stream of unicode > characters) and later modified things to allow manual control of the > encoding because "modern" operating systems (like Windows) have two > distinct file types. > > Don't blame the language designers, blame the OS folks. > > -- Michael Chermside > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/janssen%40parc.com From steve at holdenweb.com Tue Sep 6 22:32:59 2005 From: steve at holdenweb.com (Steve Holden) Date: Tue, 06 Sep 2005 16:32:59 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> <1125973449.24500.6.camel@geddy.wooz.org> <1126008067.13174.11.camel@presto.wooz.org> Message-ID: Guido van Rossum wrote: > On 9/6/05, Barry Warsaw wrote: > >>printf('$1 forgot to frobnicate the $2!\n', username, file.name, >> to=sys.stderr) >> >>While that's a little less self-descriptive for a translator to deal >>with (who would only see the string, not the call site), it certainly >>looks nicer for a non-i18n application, and could certainly work for an >>i18n app too. It's a neat idea worth exploring. > > > Is it worth doing this and completely dropping the %-based formats in > Py3k? (Just asking -- it might be if we can get people to get over the > shock of $ becoming first class ;-). > > >>Also, I think you posted in a separate article a syntactic proposal for >>including detailed formating in $-vars. ${varname:fmt} where 'varname' >>could be an identifier a la PEP 292 or possibly a positional argument. > > > +1 > > I proposed ${varname%fmt} earlier but it prevents you to extend the > varname syntax to arbitrary expressions, which I think is an extension > that will get lots of requests. > I would anticipate security issues with allowing general expressions: you are effectively allowing access to eval(). If a naiive programmer were to use unverified input as a format string unpleasant things could happen ... your call, but it seems dangerous to me. Remember C's printf has been the source of some very dangerous errors. regards Steve -- Steve Holden +44 150 684 7255 +1 800 494 3119 Holden Web LLC http://www.holdenweb.com/ From steven.bethard at gmail.com Tue Sep 6 23:17:35 2005 From: steven.bethard at gmail.com (Steven Bethard) Date: Tue, 6 Sep 2005 15:17:35 -0600 Subject: [Python-Dev] Simplify the file-like-object interface (Replacement for print in Python 3.0) In-Reply-To: <79990c6b050906054943999631@mail.gmail.com> References: <79990c6b05090212453f3b7c77@mail.gmail.com> <431D7BFB.70201@canterbury.ac.nz> <79990c6b050906054943999631@mail.gmail.com> Message-ID: [Greg Ewing] > While we're on the subject, in Py3k I'd like to see > readline(), readlines(), etc. removed from file objects > and made builtin functions instead. It should only > be necessary to implement read() and write() to get > a file-like object having equal status with all > others. [Fredrik Lundh] > maybe some variation of > > http://www.python.org/peps/pep-0246.html > > combined with "default adapters" could come in handy here ? [Paul Moore] > That sounds like a good idea. I'm certainly getting concerned about > the proliferation of methods that people "should" add to file-like > objects, where read/write are the only fundamental ones needed. > > I can't see mixins working, as too many file-like objects are written in C... I'd also prefer something along the lines of Fredrik's suggestion, but I don't write enough C code to understand Paul's last point. Could someone briefly explain why mixins wouldn't work in C code? I scanned the Python/C API Reference Manual, but all I could find was that, for tp_base "only single inheritance is supported; multiple inheritance require dynamically creating a type object by calling the metatype."[1] [1] http://docs.python.org/api/type-structs.html#l2h-983 STeVe -- You can wordify anything if you just verb it. --- Bucky Katt, Get Fuzzy From tdelaney at avaya.com Wed Sep 7 00:37:26 2005 From: tdelaney at avaya.com (Delaney, Timothy (Tim)) Date: Wed, 7 Sep 2005 08:37:26 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 Message-ID: <2773CAC687FD5F4689F526998C7E4E5F4DB5DD@au3010avexu1.global.avaya.com> Michael Chermside wrote: > We could satisfy both people if in Python 2.x we introduced a > built-in function named "print30" (for Python 3.0) with the intended > new behavior. People could start coding now using the "print30" > builtin. When Python 3.0 was released, 'print' would no longer be > a keyword, and both 'print' and 'print30' would be built-ins that > both refered to the same function. -1000 It's ugly, and it doesn't help the transition whatsoever IMO. We *definitely* don't want a print30 function hanging around in Python 3.0 for backwards compatibility with the miniscule number of people who used it in Python 2.x. The simplest solution is (as already stated):: from __future__ import __print_function__ Tim Delaney From janssen at parc.com Wed Sep 7 01:19:07 2005 From: janssen at parc.com (Bill Janssen) Date: Tue, 6 Sep 2005 16:19:07 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Sun, 04 Sep 2005 09:53:49 PDT." <1125852829.10947.10.camel@geddy.wooz.org> Message-ID: <05Sep6.161909pdt."58617"@synergy1.parc.xerox.com> > LOL! That's a great solution for the 5 of us dinosaurs still using the > One True Editor. :) And who also still program in C now and then :-). I think there are more than 5 of us, though. Bill From janssen at parc.com Wed Sep 7 01:28:02 2005 From: janssen at parc.com (Bill Janssen) Date: Tue, 6 Sep 2005 16:28:02 PDT Subject: [Python-Dev] string formatting options and removing basestring.__mod__ (WAS: Replacement for print in Python 3.0) In-Reply-To: Your message of "Mon, 05 Sep 2005 08:48:13 PDT." <200509051648.13855.gmccaughan@synaptics-uk.com> Message-ID: <05Sep6.162804pdt."58617"@synergy1.parc.xerox.com> > Some languages have "picture" formats, where the structure > of the format string more closely mimics that of the desired > output. (This is true, e.g., of some Basics and of one variety > of Perl output.) The trouble with this is that it limits how > much information you can provide about *how* each value is > to be formatted within the available space. COBOL! From the snippet Steven posted about C#, it seems to have a mode of "custom number formatting" which is picture-based. Bill From janssen at parc.com Wed Sep 7 01:37:57 2005 From: janssen at parc.com (Bill Janssen) Date: Tue, 6 Sep 2005 16:37:57 PDT Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: Your message of "Tue, 06 Sep 2005 05:44:38 PDT." <431D8F36.2080909@canterbury.ac.nz> Message-ID: <05Sep6.163759pdt."58617"@synergy1.parc.xerox.com> Guido van Rossum wrote: > So let's call it the "Swiss Army Knife > (...Not)" API design pattern. IIRC, this is one of the design principles which inspired Lisp mixins. The idea was that different interfaces should be separated into different classes. If you needed a class which combined them, you'd just mix two superclasses together. Bill From T.A.Meyer at massey.ac.nz Mon Sep 5 04:47:25 2005 From: T.A.Meyer at massey.ac.nz (Meyer, Tony) Date: Mon, 5 Sep 2005 14:47:25 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 Message-ID: > In the end the process is not democratic. Which may make it easier: rather than having to convince 50%+ of the people, one only has to convince a single person... > I don't think there's anything that can change my mind > about dropping the statement. As long as "I don't think there's anything" isn't "There isn't anything", there is still hope, and the potential that the one person's opinion that matters can be changed. However, when I wrote the email, I assumed you wouldn't read it (because you said you were leaving the discussion until there was a PEP). What I wanted to know was what the best way of putting together succinct, clear, reasons why you should change your mind would be, so that could be done. Even if you didn't change your mind, at least it would be (judging from previous decision reversals) the best shot. =Tony.Meyer From rbp at isnomore.net Tue Sep 6 22:26:02 2005 From: rbp at isnomore.net (Rodrigo Bernardo Pimentel) Date: Tue, 6 Sep 2005 17:26:02 -0300 Subject: [Python-Dev] Python core documentation Message-ID: <20050906202602.GE4534@isnomore.net> Hi! I sent this to Fred Drake a few weeks ago but got no response. I assume he's busy, or maybe my message never reached him. I hope some of you will have opinions on this (BTW, please Cc: me on any replies, as I am not on python-dev). (Original message below) I was sharing some ideas with Gustavo Niemeyer (who's also receiving a copy of this message) and he told me you'd be the right person to talk to [he was also the one who recommended that I resent it to python-dev]. I'm relatively new to Python, my first project with it started at the beginning of 2004. And, from the start, its documentation bugged me a little. Now I'm completely hooked and am a full-time Python programmer, but I still see the same quirks in documentation. I don't mean to say there's lack of it, but I think it needs some work, it seems quite incomplete. I see some of these characteristics in the tutorial and module documentation, but I'm refering mostly to internal documentation. A simple example: >>> [].sort.__doc__ L.sort(cmpfunc=None) -- stable sort *IN PLACE*; cmpfunc(x, y) -> -1, 0, 1 While it may seem obvious to somewhat experiencied programmers, it should be explicit, at least for newbies, what "-1, 0, 1" means, in term of comparison (and also what happens if cmpfunc is left out - since it defaults to None, one could think no sorting takes place). But this is relatively minor, and not the best example. >>> [].remove.__doc__ L.remove(value) -- remove first occurrence of value What if L doesn't contain 'value', does it raise an exception or does it fail quietly? Does 'remove return anything (the new list, maybe)? >>> [].pop.__doc__ L.pop([index]) -> item -- remove and return item at index (default last) What if L is empty? I think you see what I'm getting at: there's a lack of standardization and completeness that can be frustrating, especially for those new to Python and to programming. When I came to Python, I was already a long-time C and Perl programmer, and I got around these things quite easily, mainly by testing at the prompt or sometimes reading source code, but, still, it doesn't seem like a very pythonic way of doing things (explicit better than implicit?). Not to mention clever editors, which could benefit from standard, complete documentation. There are even some modules with empty docstrings, which I think should be strictly forbidden in core modules. For instance: >>> thread.error.__doc__ >>> As I told Niemeyer, I considered sending documentation patches, but I think a standard should be defined first, and then volunteers (myself included) could sweep over the core language and conform documentation to it. I'm willing to work on it and help however I can, but I wanted to discuss it first (that's why I came to Niemeyer). Well, let me know what you think. Cheers, rbp -- Rodrigo Bernardo Pimentel | GPG KeyId: <0x0DB14978> http://isnomore.net I'll rule you all with an iron fist! [...] You! Obey the fist! -- Invader Zim From fdrake at acm.org Wed Sep 7 04:19:27 2005 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Tue, 6 Sep 2005 22:19:27 -0400 Subject: [Python-Dev] Python core documentation In-Reply-To: <20050906202602.GE4534@isnomore.net> References: <20050906202602.GE4534@isnomore.net> Message-ID: <200509062219.27971.fdrake@acm.org> On Tuesday 06 September 2005 16:26, Rodrigo Bernardo Pimentel wrote: > I sent this to Fred Drake a few weeks ago but got no response. I > assume he's busy, or maybe my message never reached him. I hope some of It did reach me, but feel into the black hole of "I can't deal with this in the next 5 minutes." Sorry. I do intend to read your message carefully and respond then. -Fred -- Fred L. Drake, Jr. From greg.ewing at canterbury.ac.nz Wed Sep 7 04:36:31 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 07 Sep 2005 14:36:31 +1200 Subject: [Python-Dev] Example for "property" violates "Python is not a one pass compiler" In-Reply-To: <431DA695.7070401@gmail.com> References: <5.1.1.6.0.20050905120907.01b74500@mail.telecommunity.com> <431D7F97.1010604@canterbury.ac.nz> <431DA695.7070401@gmail.com> Message-ID: <431E522F.6060901@canterbury.ac.nz> Nick Coghlan wrote: > It builds the symbol table before actually trying to compile anything. This is > what allows it to figure out which load commands to use for which symbols. Yes, nowadays I expect it makes two passes over the parse tree for each function, one to build the symbol table and one to generate the bytecode. It used to make a single pass over the parse tree and then post-process the bytecode once it had figured out which variables were local -- which I suppose you could call one-and-a-bit passes. :-) The distinction between one and two passes isn't so important in a dynamic language like Python anyway, since most of the interesting things happen at run time. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From fdrake at acm.org Wed Sep 7 05:10:09 2005 From: fdrake at acm.org (Fred L. Drake, Jr.) Date: Tue, 6 Sep 2005 23:10:09 -0400 Subject: [Python-Dev] Python core documentation In-Reply-To: <20050906202602.GE4534@isnomore.net> References: <20050906202602.GE4534@isnomore.net> Message-ID: <200509062310.10132.fdrake@acm.org> On Tuesday 06 September 2005 16:26, Rodrigo Bernardo Pimentel wrote: > I sent this to Fred Drake a few weeks ago but got no response. I > assume he's busy, or maybe my message never reached him. I hope some of > you will have opinions on this (BTW, please Cc: me on any replies, as I am > not on python-dev). Again, I'm sorry I haven't had time to reply until now, but reminders/re-posts like this are certainly a welcome reminder! > (Original message below) > > I was sharing some ideas with Gustavo Niemeyer (who's also > receiving a copy of this message) and he told me you'd be the right person > to talk to [he was also the one who recommended that I resent it to > python-dev]. I'll suggest that the Documentation SIG is a better place to discuss this for some reasons, and I've CC'd that list as well. > I'm relatively new to Python, my first project with it started at > the beginning of 2004. And, from the start, its documentation bugged me a > little. Now I'm completely hooked and am a full-time Python programmer, > but I still see the same quirks in documentation. > > I don't mean to say there's lack of it, but I think it needs some > work, it seems quite incomplete. I see some of these characteristics in > the tutorial and module documentation, but I'm refering mostly to internal > documentation. It appears that by "internal" you're referring to the docstrings available from the runtime. I generally only think of those as hints or reminders, and not complete documentation (other minds disagree). For the non-docstring documentation, the same kinds of issues occur, though not always for the same features. I'd categorize the issues you point out into two groups: A) Omissions. You're right; there's a lot of places we haven't been as thorough as we should be. These certainly should be corrected by adding the missing information. B) Vague contracts. There are many places where documentation is omitted because the contracts of the documented feature aren't clearly specified by the code. This may happen for many reasons, but how each should be handled has to be determined on a case-by-case basis. In many cases, it's intentional that edge cases aren't well specified, simply because the treatment hasn't been discussed and decided. This case can usually be resolved by bringing up specific cases; once there's some discussion, useful documentation can be written because the documentation writers learn what the intent was (or the developers have to decide what the contract should be). Historically, I think we've seen a lot of (B) simply because there's an expectation of users will read the source to determine what the feature will do in any given case. As we see more implementations appear, and as the size and range of Python's audience grow, this becomes a less reasonable approach. This is especially the case for features implemented in C, since users are increasingly unlikely to have the C sources handy due to the use of pre-compiled packages on all platforms. [...lots of specific examples elided...] > As I told Niemeyer, I considered sending documentation patches, > but I think a standard should be defined first, and then volunteers > (myself included) could sweep over the core language and conform > documentation to it. I'm willing to work on it and help however I can, but > I wanted to discuss it first (that's why I came to Niemeyer). It would be good to have more specific guidelines for documentation. We've generally avoided trying to specify what exceptions can be raised by various functions or methods, and describe only specific cases that are guaranteed as part of the API. Treatment of edge cases is often left as an accident as well, though not as frequently. As the documentation increasingly becomes the way that programmers learn about the details of the library, we need to think about whether this is the right approach. In addition to this, we should settle the question of completeness of docstrings and document it. Anything missing that should be included according to that decision should then be added. Also, the level of detail regarding edge cases and exceptions that we're willing to make contract should be discussed, and documentation brought up to snuff. This is more likely an issue that will require case-by-case treatment. -Fred -- Fred L. Drake, Jr. From greg.ewing at canterbury.ac.nz Wed Sep 7 05:33:42 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 07 Sep 2005 15:33:42 +1200 Subject: [Python-Dev] Simplify the file-like-object interface (Replacement for print in Python 3.0) In-Reply-To: References: <79990c6b05090212453f3b7c77@mail.gmail.com> <431D7BFB.70201@canterbury.ac.nz> <79990c6b050906054943999631@mail.gmail.com> Message-ID: <431E5F96.2080802@canterbury.ac.nz> Steven Bethard wrote: > Could > someone briefly explain why mixins wouldn't work in C code? Depends on what you mean by "work in C code". It's only possible for a type object to inherit C struct members from one base class, since the struct has to be an extension of the base C struct. Dynamic attributes and methods can be inherited from multiple base classes, however, if you're willing to write the necessary C code to create the type object dynamically, as would happen if it were being defined with Python code. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From greg.ewing at canterbury.ac.nz Wed Sep 7 05:47:10 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 07 Sep 2005 15:47:10 +1200 Subject: [Python-Dev] string formatting options and removing basestring.__mod__ (WAS: Replacement for print in Python 3.0) In-Reply-To: <05Sep6.162804pdt.58617@synergy1.parc.xerox.com> References: <05Sep6.162804pdt.58617@synergy1.parc.xerox.com> Message-ID: <431E62BE.8050201@canterbury.ac.nz> Bill Janssen wrote: > > someone wrote: > > > Some languages have "picture" formats, where the structure > > of the format string more closely mimics that of the desired > > output. > > COBOL! From the snippet Steven posted about C#, it seems to have a > mode of "custom number formatting" which is picture-based. A nice characteristic of a well-designed picture formatting system is that the pictures take up the same amount of space in the format string as the output they generate. So you can write things like headings = "Description Qty Price Amount" format = "AAAAAAAAAAAAAAA ###0 $#,##0.00 $#,###,##0.00" and visually check that the headings and columns will line up, without having to be concerned with the exact numbers of characters in each column. I think a picture-formatting function would be a nice thing to have as an alternative to %-formatting or whatever will replace it. And-then-we-can-call-it-PyBol-3000-ly, -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From stephen at xemacs.org Wed Sep 7 06:20:54 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Wed, 07 Sep 2005 13:20:54 +0900 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: (Guido van Rossum's message of "Tue, 6 Sep 2005 07:15:23 -0700") References: Message-ID: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Guido" == Guido van Rossum writes: Guido> I'm not at all convinced that we should attempt to find a Guido> solution that handles both use cases [print replacement Guido> and i18n]; most Python code never needs i18n. It's true that the majority of Python applications never need i18n, because they're only used in one language. But Python applications are mostly assembled from a large and growing set of Python-standard and other well-known libraries. It would be very useful to keep the barriers to i18n-ization as low as possible to make those libraries as broadly applicable as possible. You're talking about Python 3.0; I don't know if it can be done within a reasonable amount of effort (and if not, too bad), but in that planning horizon it is surely worth some effort to find a solution. -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From guido at python.org Wed Sep 7 06:45:22 2005 From: guido at python.org (Guido van Rossum) Date: Tue, 6 Sep 2005 21:45:22 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: On 9/6/05, Stephen J. Turnbull wrote: > It's true that the majority of Python applications never need i18n, > because they're only used in one language. But Python applications > are mostly assembled from a large and growing set of Python-standard > and other well-known libraries. It would be very useful to keep the > barriers to i18n-ization as low as possible to make those libraries as > broadly applicable as possible. Sure, we must provide good i18n support. But the burden on users who don't need i18n should be negligeable; they shouldn't have to type or know extra stuff that only exists for the needs of i18n. The same is true for many other needs of library authors and programming-in-the-large: programming-in-the-small should come first and foremost. We don't need another J2EE. > You're talking about Python 3.0; I don't know if it can be done within > a reasonable amount of effort (and if not, too bad), but in that > planning horizon it is surely worth some effort to find a solution. There seem to be many people interested in finding this solution; I see it as my task (among others) to make sure that their solution doesn't negatively affect the life of the majority of users who don't need it. Even if there's a class of users who think they don't need it and in the end find they do. That's too bad, they will have to apply some global transformation to their code. I hope that making print a function will help make that transformation easier. I've seen a couple of responses claiming that with good planning there won't be a need for such transformation (and consequently they don't need the changes I'm proposing). Well duh! I've never had perfect foresight. If you always plan ahead for what you might need, you inevitably end up writing an overly heavy framework. Remember YAGNI! -- --Guido van Rossum (home page: http://www.python.org/~guido/) From greg.ewing at canterbury.ac.nz Wed Sep 7 06:46:09 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 07 Sep 2005 16:46:09 +1200 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: References: <20050906085732.90687.qmail@web53908.mail.yahoo.com> Message-ID: <431E7091.5070104@canterbury.ac.nz> Guido van Rossum wrote: >>While we're on the subject of Python 3000, what's the >>chance that reference counting when calling C >>functions from Python will go away? > > We'd have to completely change the implementation. We're not planning on that. Also, the refcounting would have to be replaced by something else that would also be fairly intrusive on the C interface, such as having to remember to make all your local variables known to the garbage collector. A better plan would be to build something akin to Pyrex into the scheme of things, so that all the refcount/GC issues are taken care of automatically. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From guido at python.org Wed Sep 7 06:58:08 2005 From: guido at python.org (Guido van Rossum) Date: Tue, 6 Sep 2005 21:58:08 -0700 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <431E7091.5070104@canterbury.ac.nz> References: <20050906085732.90687.qmail@web53908.mail.yahoo.com> <431E7091.5070104@canterbury.ac.nz> Message-ID: On 9/6/05, Greg Ewing wrote: > A better plan would be to build something akin to > Pyrex into the scheme of things, so that all the > refcount/GC issues are taken care of automatically. That sounds exciting. I have to admit that despite hearing many enthusiastic reviews, I've never used it myself -- in fact I've written very little C code in the last few years, and zero new extension modules. (Lots of Java, but that's another story. :-) I expect that many standard extensions could benefit from a rewrite in Pyrex, although this might take a lot of work and in some cases not necessarily result in better code (_tkinter comes to mind -- though I don't really know why this would be). So this shouldn't be the goal (yet). Instead, we should encourage folks to write *new* extensions using PyRex. How stable is Pyrex? Would you be willing to integrate it thoroughly with the Python source tree, to the point of contributing the code to the PSF? (Without giving up ownership or responsibility for its maintenance.) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From tdelaney at avaya.com Wed Sep 7 07:45:48 2005 From: tdelaney at avaya.com (Delaney, Timothy (Tim)) Date: Wed, 7 Sep 2005 15:45:48 +1000 Subject: [Python-Dev] reference counting in Py3K Message-ID: <2773CAC687FD5F4689F526998C7E4E5F4DB5E7@au3010avexu1.global.avaya.com> Guido van Rossum wrote: > How stable is Pyrex? Would you be willing to integrate it thoroughly > with the Python source tree, to the point of contributing the code to > the PSF? (Without giving up ownership or responsibility for its > maintenance.) +100 I would be *strongly* in favour of this. Apart from anything else, it would greatly lower the bar for people wanting to add to the standard library. And if much of the stdlib were eventually rewritten in Pyrex it would be even better. Tim Delaney From pje at telecommunity.com Wed Sep 7 08:01:01 2005 From: pje at telecommunity.com (Phillip J. Eby) Date: Wed, 07 Sep 2005 02:01:01 -0400 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: References: <431E7091.5070104@canterbury.ac.nz> <20050906085732.90687.qmail@web53908.mail.yahoo.com> <431E7091.5070104@canterbury.ac.nz> Message-ID: <5.1.1.6.0.20050907014538.01b6e8f8@mail.telecommunity.com> At 09:58 PM 9/6/2005 -0700, Guido van Rossum wrote: >On 9/6/05, Greg Ewing wrote: > > A better plan would be to build something akin to > > Pyrex into the scheme of things, so that all the > > refcount/GC issues are taken care of automatically. > >That sounds exciting. I have to admit that despite hearing many >enthusiastic reviews, I've never used it myself -- in fact I've >written very little C code in the last few years, and zero new >extension modules. (Lots of Java, but that's another story. :-) > >I expect that many standard extensions could benefit from a rewrite in >Pyrex, although this might take a lot of work and in some cases not >necessarily result in better code (_tkinter comes to mind -- though I >don't really know why this would be). So this shouldn't be the goal >(yet). Instead, we should encourage folks to write *new* extensions >using PyRex. Just an FYI; Pyrex certainly makes it relatively painless to write code that interfaces with C, but it doesn't do much for performance, and naively-written Pyrex code can actually be slower than carefully-optimized Python code. So, for existing modules that were written in C for performance reasons, Pyrex isn't currently a substitute. One of the reasons for this is that Pyrex code uses the generic Python/C APIs, like PySequence_GetItem, even in cases where PyList_GetItem or its macro form would be more appropriate. Pyrex has no way currently to say, "this is type X's C API, so use it when you have a variable that's of type X, instead of using the generic object protocols." There are other issues that contribute to the inefficiency as well, like redundant refcounting, assigning None to temporary variables, etc. I haven't used the absolute latest version of Pyrex, but older versions also used C strings for attribute lookups, which was horribly slow. I think the latest version now creates string objects at module initialization to avoid this issue, though. Anyway, don't get me wrong - Pyrex is by *far* my preferred way of writing C extensions for Python. And I'd love to see a version that used your proposed static typing syntax in place of the "cdef" syntax it currently uses, thus paving the way to selective translation of Python to C. I'd just like to set expectations appropriately, for what it is and isn't good at. Sadly, the current state of Pyrex is such that to write efficient code, you have to use the Python C API from Pyrex, which tends to result in something that looks like C code with Python-like syntax. But you don't have to worry about memory allocation, exception labels, or reference counting, so it's more compact and less error-prone than C too. From mwh at python.net Wed Sep 7 09:55:09 2005 From: mwh at python.net (Michael Hudson) Date: Wed, 07 Sep 2005 08:55:09 +0100 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <431E7091.5070104@canterbury.ac.nz> (Greg Ewing's message of "Wed, 07 Sep 2005 16:46:09 +1200") References: <20050906085732.90687.qmail@web53908.mail.yahoo.com> <431E7091.5070104@canterbury.ac.nz> Message-ID: <2mu0gxmhwi.fsf@starship.python.net> Greg Ewing writes: > Guido van Rossum wrote: > >>>While we're on the subject of Python 3000, what's the >>>chance that reference counting when calling C >>>functions from Python will go away? >> >> We'd have to completely change the implementation. We're not >> planning on that. > > Also, the refcounting would have to be replaced by > something else that would also be fairly intrusive > on the C interface, such as having to remember to > make all your local variables known to the garbage > collector. > > A better plan would be to build something akin to > Pyrex into the scheme of things, so that all the > refcount/GC issues are taken care of automatically. Certainly, one of the goals of the PyPy project is to do experiments on GC strategy... Cheers, mwh -- If trees could scream, would we be so cavalier about cutting them down? We might, if they screamed all the time, for no good reason. -- Jack Handey From jcarlson at uci.edu Wed Sep 7 09:57:36 2005 From: jcarlson at uci.edu (Josiah Carlson) Date: Wed, 07 Sep 2005 00:57:36 -0700 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: References: <431E7091.5070104@canterbury.ac.nz> Message-ID: <20050907001521.8B56.JCARLSON@uci.edu> Guido van Rossum wrote: > On 9/6/05, Greg Ewing wrote: > > A better plan would be to build something akin to > > Pyrex into the scheme of things, so that all the > > refcount/GC issues are taken care of automatically. > > That sounds exciting. I have to admit that despite hearing many > enthusiastic reviews, I've never used it myself -- in fact I've > written very little C code in the last few years, and zero new > extension modules. (Lots of Java, but that's another story. :-) Here's a perspective "from the trenches" as it were. I've been writing quite a bit of code, initially all in Python (27k lines in the last year or so). It worked reasonably well and fast. It wasn't fast enough. I needed a 25x increase in performance, which would have been easily attainable if I were to rewrite everything in C, but writing a module in pure C is a bit of a pain (as others can attest), so I gave Pyrex a shot (after scipy.weave.inline, ick). Initial versions ran around 2-3x as fast as pure Python. With various tricks, we are now running 75-100x faster in the pure Pyrex portions, with another 2-3x improvement possible (even using the VC6 compiler in Windows and old versions of gcc in linux, talk about multi-platform development!). With experience comes wisdom. I write new functionality that needs to be fast in pure C, wrapping it with Pyrex as necessary (which is quite simple), and make it all work with Python. > I expect that many standard extensions could benefit from a rewrite in > Pyrex, although this might take a lot of work and in some cases not > necessarily result in better code (_tkinter comes to mind -- though I > don't really know why this would be). So this shouldn't be the goal > (yet). Instead, we should encourage folks to write *new* extensions > using Pyrex. I'm not sure this is necessarily desireable. In my limited experience, one starts doing a line-by-line translation, getting Python objects as variables, etc. Then one starts predefining C variables and working with them, increasing speed by some measureable amount. Then one starts thinking about the data structures that are being passed (lists of lists, dictionary of lists, lists of dictionaries, ...), at which point one starts digging into PyList_GetItem, etc., manual in/decrefing, ..., and one's code starts getting the ugly of C modules, without the braces and semicolons. Offering it up as a standard library module: cool, +1. Give people one of the the best tools for wrapping C code and writing high-performance Python-accessable software. Encouraging its use for the writing of new extension modules: ick, -1. Writing pretty yet high performing Pyrex is an art that I'm not sure anyone can master. Perhaps a bit into the future, extending import semantics to notice .pyx files, compare their checksum against a stored md5 in the compiled .pyd/.so, and automatically recompiling them if they (or their includes) have changed: +10 (I end up doing this kind of thing by hand with phantom auto-build modules). - Josiah From fredrik at pythonware.com Wed Sep 7 10:17:22 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Wed, 7 Sep 2005 10:17:22 +0200 Subject: [Python-Dev] reference counting in Py3K References: <431E7091.5070104@canterbury.ac.nz> <20050907001521.8B56.JCARLSON@uci.edu> Message-ID: Josiah Carlson wrote: > Perhaps a bit into the future, extending import semantics to notice .pyx > files, compare their checksum against a stored md5 in the compiled > .pyd/.so, and automatically recompiling them if they (or their includes) > have changed: +10 (I end up doing this kind of thing by hand with > phantom auto-build modules). which reminds me... does anyone know what happened to the various "inline C" versions that were quite popular a few years ago. e.g. http://mail.python.org/pipermail/python-dev/2002-January/019178.html (I've been using an extremely simple home-brewn version in a couple of projects, and it's extremely addictive, at least if you're a C/C++ veteran...) From rkern at ucsd.edu Wed Sep 7 10:40:13 2005 From: rkern at ucsd.edu (Robert Kern) Date: Wed, 07 Sep 2005 01:40:13 -0700 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: References: <431E7091.5070104@canterbury.ac.nz> <20050907001521.8B56.JCARLSON@uci.edu> Message-ID: Fredrik Lundh wrote: > Josiah Carlson wrote: > >>Perhaps a bit into the future, extending import semantics to notice .pyx >>files, compare their checksum against a stored md5 in the compiled >>.pyd/.so, and automatically recompiling them if they (or their includes) >>have changed: +10 (I end up doing this kind of thing by hand with >>phantom auto-build modules). http://www.prescod.net/pyximport/ > which reminds me... does anyone know what happened to the various > "inline C" versions that were quite popular a few years ago. e.g. > > http://mail.python.org/pipermail/python-dev/2002-January/019178.html > > (I've been using an extremely simple home-brewn version in a couple of > projects, and it's extremely addictive, at least if you're a C/C++ veteran...) weave is alive and kicking and actively used although it could use some TLC. http://svn.scipy.org/svn/scipy_core -- Robert Kern rkern at ucsd.edu "In the fields of hell where the grass grows high Are the graves of dreams allowed to die." -- Richard Harter From ncoghlan at gmail.com Wed Sep 7 10:59:56 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 07 Sep 2005 18:59:56 +1000 Subject: [Python-Dev] Python 3 design principles In-Reply-To: <43aa6ff705083114004924f4a9@mail.gmail.com> References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <43aa6ff705083114004924f4a9@mail.gmail.com> Message-ID: <431EAC0C.4030800@gmail.com> Collin Winter wrote: > Am 31-Aug 05, Charles Cazabon schrieb: > > >>Perhaps py3k could have a py2compat module. Importing it could have the >>effect of (for instance) putting compile, id, and intern into the global >>namespace, making print an alias for writeln, alias the standard library >>namespace, ... ? > > > from __past__ import python2 If we ever get the ast-branch finished, then keeping a copy of the final 2.x parser around that targets the Python AST should actually be feasible, too. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From stephen at xemacs.org Wed Sep 7 11:23:16 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Wed, 07 Sep 2005 18:23:16 +0900 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: (Guido van Rossum's message of "Tue, 6 Sep 2005 21:45:22 -0700") References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Guido" == Guido van Rossum writes: Guido> Sure, we must provide good i18n support. But the burden on Guido> users who don't need i18n should be negligeable; they Guido> shouldn't have to type or know extra stuff that only exists Guido> for the needs of i18n. Agreed. That's best for i18n, too, if we can arrange a batteries- included approach to i18n at the same time. >> You're talking about Python 3.0; I don't know if it can be done >> within a reasonable amount of effort (and if not, too bad), but >> in that planning horizon it is surely worth some effort to find >> a solution. Guido> There seem to be many people interested in finding this Guido> solution; I see it as my task (among others) to make sure Guido> that their solution doesn't negatively affect the life of Guido> the majority of users who don't need it. Convenient as a Python optimized for i18n would be for me personally, I agree with that, too. But you wrote, "I'm not at all convinced that we should attempt to find a solution that handles both use cases; most Python code never needs i18n." And now, "That's too bad, [those who need i18n] will have to apply some global transformation to their code." It sounds to me like you have already decided that i18n applications will have to use a different way. But print-ng looks like becoming the OOWTDI for a lot of applications. IMO it's just too early to give up on print-ng becoming the one obvious way to do it for a lot of i18n apps, too. I realize that maybe it won't be solved for Python 3.0. Just, please don't close the door on it yet! Guido> Remember YAGNI! For-values-of-Y=I-A=am-ly y'rs, -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From abo at minkirri.apana.org.au Wed Sep 7 11:53:13 2005 From: abo at minkirri.apana.org.au (Donovan Baarda) Date: Wed, 7 Sep 2005 19:53:13 +1000 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <5.1.1.6.0.20050907014538.01b6e8f8@mail.telecommunity.com> References: <431E7091.5070104@canterbury.ac.nz> <20050906085732.90687.qmail@web53908.mail.yahoo.com> <431E7091.5070104@canterbury.ac.nz> <5.1.1.6.0.20050907014538.01b6e8f8@mail.telecommunity.com> Message-ID: <20050907095313.GA12458@minkirri.apana.org.au> On Wed, Sep 07, 2005 at 02:01:01AM -0400, Phillip J. Eby wrote: [...] > Just an FYI; Pyrex certainly makes it relatively painless to write code > that interfaces with C, but it doesn't do much for performance, and > naively-written Pyrex code can actually be slower than carefully-optimized > Python code. So, for existing modules that were written in C for > performance reasons, Pyrex isn't currently a substitute. I just want to second this; my experiments with pyrex on pysync produced no speedups. I got a much more noticable speed benefit from psyco. This was admittedly a long time ago... -- ---------------------------------------------------------------- Donovan Baarda http://minkirri.apana.org.au/~abo/ ---------------------------------------------------------------- From radeex at gmail.com Wed Sep 7 12:03:45 2005 From: radeex at gmail.com (Christopher Armstrong) Date: Wed, 7 Sep 2005 20:03:45 +1000 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <20050907001521.8B56.JCARLSON@uci.edu> References: <431E7091.5070104@canterbury.ac.nz> <20050907001521.8B56.JCARLSON@uci.edu> Message-ID: <60ed19d40509070303449cafad@mail.gmail.com> On 9/7/05, Josiah Carlson wrote: > > Guido van Rossum wrote: > > On 9/6/05, Greg Ewing wrote: > > > A better plan would be to build something akin to > > > Pyrex into the scheme of things, so that all the > > > refcount/GC issues are taken care of automatically. > > > > That sounds exciting. I have to admit that despite hearing many > > enthusiastic reviews, I've never used it myself -- in fact I've > > written very little C code in the last few years, and zero new > > extension modules. (Lots of Java, but that's another story. :-) > > Here's a perspective "from the trenches" as it were. > > Encouraging its use for the writing of new extension modules: ick, -1. > Writing pretty yet high performing Pyrex is an art that I'm not sure > anyone can master. I'd just like to put in that it seems like the suggestions to use Pyrex were aimed at C-library wrapping extensions, not necessarily ones that were written in C for performance (I gather that there are very few of those, comparatively). So the encouragement to use Pyrex for new extension modules still seems perfect, to me; its use should definitely be encouraged when one needs to wrap some third-party library, and I'd bet that that's the common case. -- Twisted | Christopher Armstrong: International Man of Twistery Radix | -- http://radix.twistedmatrix.com | Release Manager, Twisted Project \\\V/// | -- http://twistedmatrix.com |o O| | w----v----w-+ From barry at python.org Wed Sep 7 14:06:48 2005 From: barry at python.org (Barry Warsaw) Date: Wed, 07 Sep 2005 08:06:48 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: <1126094808.12806.2.camel@presto.wooz.org> On Wed, 2005-09-07 at 05:23, Stephen J. Turnbull wrote: > But print-ng looks > like becoming the OOWTDI for a lot of applications. IMO it's just too > early to give up on print-ng becoming the one obvious way to do it for > a lot of i18n apps, too. +1. I have a gut feeling that we can make it easy for monolinguists to use printng without caring or even knowing about i18n, but also make it relatively painless to integrate i18n into an application or library. However I haven't had time to really explore that idea. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050907/adc63da0/attachment.pgp From p.f.moore at gmail.com Wed Sep 7 14:07:23 2005 From: p.f.moore at gmail.com (Paul Moore) Date: Wed, 7 Sep 2005 13:07:23 +0100 Subject: [Python-Dev] Simplify the file-like-object interface (Replacement for print in Python 3.0) In-Reply-To: References: <79990c6b05090212453f3b7c77@mail.gmail.com> <431D7BFB.70201@canterbury.ac.nz> <79990c6b050906054943999631@mail.gmail.com> Message-ID: <79990c6b0509070507dacb4ef@mail.gmail.com> On 9/6/05, Steven Bethard wrote: > I'd also prefer something along the lines of Fredrik's suggestion, but > I don't write enough C code to understand Paul's last point. Could > someone briefly explain why mixins wouldn't work in C code? I had in mind "it would be complicated and messy, and probably no easier than just implementing all of the extra methods by hand" rather than "it's impossible". Sorry for being unclear. I haven't written many C extensions, either, so I may be misremembering. Also, the biggest extension I wrote, which does implement a file-like object, was written when Python 1.4/1.5 was current, and (still) uses the C API from then. (BTW, that's a great tribute to the backward-compatibility that Python provides!) Paul. From ncoghlan at gmail.com Wed Sep 7 14:55:43 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 07 Sep 2005 22:55:43 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126094808.12806.2.camel@presto.wooz.org> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> Message-ID: <431EE34F.60107@gmail.com> Barry Warsaw wrote: > On Wed, 2005-09-07 at 05:23, Stephen J. Turnbull wrote: > > >>But print-ng looks >>like becoming the OOWTDI for a lot of applications. IMO it's just too >>early to give up on print-ng becoming the one obvious way to do it for >>a lot of i18n apps, too. > > > +1. I have a gut feeling that we can make it easy for monolinguists to > use printng without caring or even knowing about i18n, but also make it > relatively painless to integrate i18n into an application or library. > However I haven't had time to really explore that idea. I found the following to be an interesting experiment: ------------- from string import Template def format(*args, **kwds): fmt = args[0] kwds.update(("p%s" % idx, arg) for idx, arg in enumerate(args)) return Template(fmt).substitute(**kwds) Py> format("$p1: $p2", "Bee count", 0.5) 'Bee count: 0.5' ------------- The leading 'p' (for 'positional') is necessary to get around the fact that $1 is currently an illegal identifier in a Template. If we actually did something like this, I would advocate adding the support for positional arguments directly to string.Template. For il8n output, you would be pulling the format string from somewhere else, so you would stick with the current idiom of using keyword arguments: ------------- Py> fmt = "$item: $count" Py> format(fmt, item="Bee count", count=0.5) 'Bee count: 0.5' ------------- There's also the cute-and-kinda-useless-but-it-also-justifies-the-1-based-indexing: ------------- Py> format("Kinda cute: $p0") 'Kinda cute: Kinda cute: $p0' ------------- Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Wed Sep 7 15:07:54 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 07 Sep 2005 23:07:54 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431EE34F.60107@gmail.com> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <431EE34F.60107@gmail.com> Message-ID: <431EE62A.4060208@gmail.com> Nick Coghlan wrote: > I found the following to be an interesting experiment: > > ------------- > from string import Template > > def format(*args, **kwds): > fmt = args[0] > kwds.update(("p%s" % idx, arg) for idx, arg in enumerate(args)) > return Template(fmt).substitute(**kwds) I forgot to add the following concept: ------------- def printf(*args, **kwds): to = kwds.pop("to", sys.stdout) to.write(format(*args, **kwds)) Py> printf("$p1: $p2\n", 1, 2) 1: 2 Py> printf("$p1: $p2\n", 1, 2, to=sys.stderr) 1: 2 Py> printf("$p1: $p2$to\n", 1, 2, to=sys.stderr) Traceback (most recent call last): File " ", line 1, in ? File " ", line 3, in printf File " ", line 4, in format File "C:\Python24\lib\string.py", line 172, in substitute return self.pattern.sub(convert, self.template) File "C:\Python24\lib\string.py", line 162, in convert val = mapping[named] KeyError: 'to' ------------- If you're dealing with an existing template that uses the 'to' keyword, then it is possible to fall back to using: ------------- def printraw(*args, **kwds): to = kwds.pop("to", sys.stdout) for arg in args: to.write(arg) Py> printraw(format("$p1: $p2$to\n", 1, 2, to="There"), to=sys.stderr) 1: 2There ------------- Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From guido at python.org Wed Sep 7 16:11:43 2005 From: guido at python.org (Guido van Rossum) Date: Wed, 7 Sep 2005 07:11:43 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126094808.12806.2.camel@presto.wooz.org> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> Message-ID: On 9/7/05, Barry Warsaw wrote: > On Wed, 2005-09-07 at 05:23, Stephen J. Turnbull wrote: > > > But print-ng looks > > like becoming the OOWTDI for a lot of applications. IMO it's just too > > early to give up on print-ng becoming the one obvious way to do it for > > a lot of i18n apps, too. > > +1. I have a gut feeling that we can make it easy for monolinguists to > use printng without caring or even knowing about i18n, but also make it > relatively painless to integrate i18n into an application or library. > However I haven't had time to really explore that idea. I certainly didn't mean to rule that out. But I doubt that the only text to be i18n'd will occur in printf format strings. (In fact, I expect that few apps requiring i18n will be so primitive as to use *any* printf calls at all.) Anyway, let us hear what you had in mind rather than arguing over some abstract principle. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From rrr at ronadam.com Wed Sep 7 17:56:48 2005 From: rrr at ronadam.com (Ron Adam) Date: Wed, 07 Sep 2005 11:56:48 -0400 Subject: [Python-Dev] Python core documentation In-Reply-To: <200509062310.10132.fdrake@acm.org> References: <20050906202602.GE4534@isnomore.net> <200509062310.10132.fdrake@acm.org> Message-ID: <431F0DC0.1050109@ronadam.com> Fred L. Drake, Jr. wrote: > It would be good to have more specific guidelines for documentation. Would it be possible to have each item in the documentation start out by auto quoting that items __doc__ string? Then omissions, errors, and contradictions would be easy to find and the full documentation would compliment the __doc__ strings rather than repeat them. Cheers, Ron From jcarlson at uci.edu Wed Sep 7 19:16:17 2005 From: jcarlson at uci.edu (Josiah Carlson) Date: Wed, 07 Sep 2005 10:16:17 -0700 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <60ed19d40509070303449cafad@mail.gmail.com> References: <20050907001521.8B56.JCARLSON@uci.edu> <60ed19d40509070303449cafad@mail.gmail.com> Message-ID: <20050907090119.8B5C.JCARLSON@uci.edu> Christopher Armstrong wrote: > On 9/7/05, Josiah Carlson wrote: > > Guido van Rossum wrote: > > > On 9/6/05, Greg Ewing wrote: > > > > A better plan would be to build something akin to > > > > Pyrex into the scheme of things, so that all the > > > > refcount/GC issues are taken care of automatically. > > > > > > That sounds exciting. I have to admit that despite hearing many > > > enthusiastic reviews, I've never used it myself -- in fact I've > > > written very little C code in the last few years, and zero new > > > extension modules. (Lots of Java, but that's another story. :-) > > > > Here's a perspective "from the trenches" as it were. > > > > Encouraging its use for the writing of new extension modules: ick, -1. > > Writing pretty yet high performing Pyrex is an art that I'm not sure > > anyone can master. > > I'd just like to put in that it seems like the suggestions to use > Pyrex were aimed at C-library wrapping extensions, not necessarily > ones that were written in C for performance (I gather that there are > very few of those, comparatively). So the encouragement to use Pyrex > for new extension modules still seems perfect, to me; its use should > definitely be encouraged when one needs to wrap some third-party > library, and I'd bet that that's the common case. To me, "new extension modules" != "wrapping C libraries for use with Python standard library inclusion". The latter is perfectly fine, the former may lead to fast but ugly Pyrex modules... But what if you don't want speed from pure Pyrex modules? Then why write them in Pyrex, why not stick with Python, or go to C for the speed, and Pyrex for the wrapping? - Josiah From bob at redivi.com Wed Sep 7 21:07:07 2005 From: bob at redivi.com (Bob Ippolito) Date: Wed, 7 Sep 2005 12:07:07 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> Message-ID: <335020FA-59D4-4B4C-8777-53BE4F741CC8@redivi.com> On Sep 7, 2005, at 7:11 AM, Guido van Rossum wrote: > On 9/7/05, Barry Warsaw wrote: > >> On Wed, 2005-09-07 at 05:23, Stephen J. Turnbull wrote: >> >> >>> But print-ng looks >>> like becoming the OOWTDI for a lot of applications. IMO it's >>> just too >>> early to give up on print-ng becoming the one obvious way to do >>> it for >>> a lot of i18n apps, too. >>> >> >> +1. I have a gut feeling that we can make it easy for >> monolinguists to >> use printng without caring or even knowing about i18n, but also >> make it >> relatively painless to integrate i18n into an application or library. >> However I haven't had time to really explore that idea. >> > > I certainly didn't mean to rule that out. But I doubt that the only > text to be i18n'd will occur in printf format strings. (In fact, I > expect that few apps requiring i18n will be so primitive as to use > *any* printf calls at all.) In my experience, implementing i18n with *existing* Python (2.3 at the time) features was not a big deal. We used a translation company to translate all of the strings, and they had no problem with normal Python %(format)s strings. We gave it to them in an excel spreadsheet, and converted it to Apple ".strings" style files (which we parse directly in the Windows version). Granted, we highlighted all of the "%(format)s" in the spreadsheet so it was clear what should be preserved. It worked like this: def _(stringToBeLocalized): return anAppropriateString and all formatted strings in the code looked like this: _('default english string %(variable)s') % someDict from real world production code: _(u'Installing this software requires %(requiredSpace)s of space.\n \nYou have selected to install this software on the iPod "%(podName) s" (%(totalFree)s available)') % { u'requiredSpace': self.installer.getRequiredFreeDiskSpace(), u'podName': self.podName, u'totalFree': space, } I was also able to easily automate the process of extracting strings to create that spreadsheet. I wrote a simple script that parsed the Python modules and looked for function calls of "_" whose only argument was a constant string. Worked great, and it was easy to write. -bob From wsanchez at wsanchez.net Thu Sep 8 02:11:08 2005 From: wsanchez at wsanchez.net (=?ISO-8859-1?Q?Wilfredo_S=E1nchez_Vega?=) Date: Wed, 7 Sep 2005 17:11:08 -0700 Subject: [Python-Dev] Exception Reorg PEP checked in In-Reply-To: References: <6AA5C3D1-6AEC-4CE4-AEB9-84FBDA10EFA9@wsanchez.net> <2moe80uxxg.fsf@starship.python.net> Message-ID: <646B6A3B-B46B-41C6-9B88-DA579DCCA744@wsanchez.net> (sorry for the delayed reply; vacation) On Aug 14, 2005, at 12:27 PM, Guido van Rossum wrote: > On 8/14/05, Michael Hudson wrote: > > >> Wilfredo S >> ?nchez Vega writes: >> >> >>> I'm curious about why Python lacks FileNotFoundError, >>> PermissionError and the like as subclasses of IOError. >>> >> >> Good question. Lack of effort/inertia? >> > > Well, I wonder how often it's needed. My typical use is this: > > try: > f = open(filename) > except IOError, err: > print "Can't open %s: %s" % (filename, err) > return > > and the error printed contains all the necessary details (in fact it > even repeats the filename, so I could probably just say "print err"). > That's fine for log output, but weak for handling the error. > Why do you need to know the exact reason for the failure? If you > simply want to know whether the file exists, I'd use os.path.exists() > or isfile(). (Never mind that this is the sometimes-frowned-upon > look-before-you-leap; I think it's often fine.) > If you're going to wave off the look-before-you-leap argument, I guess you're right in that case, but I think it's a pretty valid argument for a lot of applications. os.path.exists() also has a race condition in the case where the file is deleted between your test for is and the subsequent attempt to access it, so you'd still need to handle that error in the exception handling if you care about correctness. I agree that in many (most?) cases, this can be fudged, but if it's easier to do the correct thing, more people would do the correct thing. In any case, os.path.exists() also doesn't catch permissions errors, and I often find myself wanting to handle those errors specially as well. An example where both are useful is in an HTTP server, where a different status code should be returned to the client depending on success, file not found, permission denied, and other cases. Presently, I (and twisted.web) have to check errno in the IOError exception handler, which is really clunky, and I have to do it fairly often. > Also note that providing the right detail can be very OS specific. > Python doesn't just run on Unix and Windows. > File not found is a detectable error case on all platforms, I think. On an OS that doesn't have permissions errors, I wouldn't expect the existence of an exception that isn't used to be a huge portability problem. I can't imagine that checking errno is a more portable solution. These two exist and are quite useful in Java, for whatever that's worth. -wsv -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3057 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20050907/fa9e1de8/smime.bin From smitty_one_each at bigfoot.com Tue Sep 6 22:53:15 2005 From: smitty_one_each at bigfoot.com (Chris Smith) Date: Tue, 06 Sep 2005 16:53:15 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <17176.26475.644454.492490@montanaro.dyndns.org> <4318F633.6050501@gmail.com> <79990c6b05090306355891f450@mail.gmail.com> <4319AE7E.8020803@gmail.com> <4319C0ED.4060608@libero.it> Message-ID: <874q8xq5ok.fsf@bigfoot.com> >>>>> "Guido" == Guido van Rossum writes: Guido> Guido> In a different thread I mentioned a design principle for Guido> which I have no catchy name, but which has often helped me Guido> design better APIs. One way to state it is to say that Guido> instead of a single "swiss-army-knife" function with Guido> various options that choose different behavior variants, Guido> it's better to have different dedicated functions for each Guido> of the major functionality types. So let's call it the Guido> "Swiss Army Knife (...Not)" API design pattern. I call the idea the 80/20 Split, or 'Convenience Functions'. You have a powerful, highly generalized function that can do most anything, and has an interface to prove it. Then, a collection of 'Convenience Functions' to constrain that "Swiss Army Knife" to handle 80% of the use-cases, while still letting Ye Power User dig a little deeper. The challenge is to keep the Convenience Function population low, so that you don't arrive at 8,020 different functions in the interface. Go, Python! Chris From greg.ewing at canterbury.ac.nz Thu Sep 8 06:06:56 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 08 Sep 2005 16:06:56 +1200 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: References: <20050906085732.90687.qmail@web53908.mail.yahoo.com> <431E7091.5070104@canterbury.ac.nz> Message-ID: <431FB8E0.1090609@canterbury.ac.nz> Guido van Rossum wrote: > How stable is Pyrex? Would you be willing to integrate it thoroughly > with the Python source tree, to the point of contributing the code to > the PSF? (Without giving up ownership or responsibility for its > maintenance.) It's reasonably stable now, I think, although some details might yet change. I'd want another couple of releases before calling it finished. When I do reach that point, I'd be perfectly willing to contribute it to the PSF. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From greg.ewing at canterbury.ac.nz Thu Sep 8 06:31:17 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 08 Sep 2005 16:31:17 +1200 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: <5.1.1.6.0.20050907014538.01b6e8f8@mail.telecommunity.com> References: <431E7091.5070104@canterbury.ac.nz> <20050906085732.90687.qmail@web53908.mail.yahoo.com> <431E7091.5070104@canterbury.ac.nz> <5.1.1.6.0.20050907014538.01b6e8f8@mail.telecommunity.com> Message-ID: <431FBE95.3060005@canterbury.ac.nz> Phillip J. Eby wrote: > Just an FYI; Pyrex certainly makes it relatively painless to write code > that interfaces with C, but it doesn't do much for performance, and > naively-written Pyrex code can actually be slower than > carefully-optimized Python code. So, for existing modules that were > written in C for performance reasons, Pyrex isn't currently a substitute. If the performance-critical parts of the C code were translated into Pyrex-C (i.e. Pyrex code that only performs C operations) it shouldn't be significantly different. Alternatively, Pyrex could be used to create a wrapper around the existing C code, freeing it from having to deal with the Python/C API, thus making it easier to maintain. > One of the reasons for this is that Pyrex code uses the generic Python/C > APIs, like PySequence_GetItem, even in cases where PyList_GetItem or its > macro form would be more appropriate. Pyrex has no way currently to > say, "this is type X's C API, so use it when you have a variable that's > of type X, instead of using the generic object protocols." I'm thinking of teaching it about some of the builtin types (e.g. list, dict) so it can use more efficient APIs when appropriate. In the meantime, you can declare and call most of the APIs explicitly if you want. The exception is some of the macro forms which have nonstandard refcount behaviour. I'm also thinking of adding some declarations to help with that. > There are other issues that contribute to the inefficiency as well, like > redundant refcounting, assigning None to temporary variables, etc. Yes, it could probably do with a bit of flow analysis and peephole optimisation to deal with things like that. > I haven't used the absolute latest version of Pyrex, but older versions > also used C strings for attribute lookups, which was horribly slow. I > think the latest version now creates string objects at module > initialization to avoid this issue, though. Yes, it now precreates and interns all identifier strings, which should help considerably. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From kbk at shore.net Thu Sep 8 06:31:14 2005 From: kbk at shore.net (Kurt B. Kaiser) Date: Thu, 8 Sep 2005 00:31:14 -0400 (EDT) Subject: [Python-Dev] Weekly Python Patch/Bug Summary Message-ID: <200509080431.j884VEHG028517@bayview.thirdcreek.com> Patch / Bug Summary ___________________ Patches : 342 open ( +3) / 2923 closed ( +1) / 3265 total ( +4) Bugs : 908 open ( +5) / 5232 closed (+10) / 6140 total (+15) RFE : 188 open ( +1) / 185 closed ( +1) / 373 total ( +2) New / Reopened Patches ______________________ String formatting character for str.join (2005-09-04) CLOSED http://python.org/sf/1281573 opened by Nick Coghlan Speed up gzip.readline (~40%) (2005-09-04) http://python.org/sf/1281707 opened by April King Enable SSL for smtplib (2005-09-05) http://python.org/sf/1282340 opened by Phil Underwood AIX port from Elemental Security (2005-09-07) http://python.org/sf/1284289 opened by Guido van Rossum Patches Closed ______________ String formatting character for str.join (2005-09-04) http://python.org/sf/1281573 closed by ncoghlan New / Reopened Bugs ___________________ time.strptime() fails with unicode date string, de_DE locale (2005-09-01) CLOSED http://python.org/sf/1280061 opened by Adam Monsen tokenize module does not detect inconsistent dedents (2005-06-21) http://python.org/sf/1224621 reopened by arigo PyDateTime_Check references static object (2005-09-02) CLOSED http://python.org/sf/1280924 opened by Skip Montanaro xml.sax.expatreader doesn't pass encoding to ParserCreate (2005-09-02) http://python.org/sf/1281032 opened by Samuel Bayer Erroneous \url command in python.sty (2005-09-03) http://python.org/sf/1281291 opened by Rory Yorke array.arrays are not unpickleable (2005-09-03) CLOSED http://python.org/sf/1281383 opened by Reinhold Birkenfeld Py_BuildValue k format units don't work with big values (2005-09-04) http://python.org/sf/1281408 opened by Adal Chiriliuc exception when unpickling array.array objects (2005-09-04) http://python.org/sf/1281556 opened by John Machin urllib violates rfc 959 (2005-09-04) http://python.org/sf/1281692 opened by Matthias Klose logging.shutdown() not correct for chained handlers (2005-09-05) http://python.org/sf/1282539 opened by Fons Dijkstra socket.getaddrinfo() bug for IPv6 enabled platforms (2005-09-06) http://python.org/sf/1282647 opened by Ganesan Rajagopal PyArg_ParseTupleAndKeywords gives misleading error message (2005-09-06) http://python.org/sf/1283289 opened by japierro nit for builtin sum doc (2005-09-07) http://python.org/sf/1283491 opened by daishi harada os.path.abspath() / os.chdir() buggy with unicode paths (2005-09-07) http://python.org/sf/1283895 opened by Antoine Pitrou re nested conditional matching (?()) doesn't work (2005-09-07) http://python.org/sf/1284341 opened by Erik Demaine Bugs Closed ___________ codecs.StreamRecoder.next doesn't encode (2005-07-10) http://python.org/sf/1235646 closed by doerwalter time.strptime() fails with unicode date string, de_DE locale (2005-09-01) http://python.org/sf/1280061 closed by bcannon crash recursive __getattr__ (2005-08-24) http://python.org/sf/1267884 closed by tjreedy Lambda and deepcopy (2005-08-31) http://python.org/sf/1277718 closed by tjreedy logging module broken for multiple threads? (2005-09-01) http://python.org/sf/1277903 closed by vsajip PyDateTime_Check references static object (2005-09-02) http://python.org/sf/1280924 closed by montanaro bz2module.c compiler warning (2005-08-26) http://python.org/sf/1274069 closed by birkenfeld discrepancy between str.__cmp__ and unicode.__cmp__ (2005-08-29) http://python.org/sf/1275719 closed by rhettinger array.arrays are not unpickleable (2005-09-03) http://python.org/sf/1281383 closed by rhettinger SyntaxError raised on win32 for correct files (2005-05-12) http://python.org/sf/1200686 closed by tim_one New / Reopened RFE __________________ non-sequence map() arguments for optimization (2005-09-02) CLOSED http://python.org/sf/1281053 opened by Ecnassianer of the Green Storm Give __len__() advice for "don't know" (2005-09-06) http://python.org/sf/1283110 opened by Tim Peters RFE Closed __________ non-sequence map() arguments for optimization (2005-09-03) http://python.org/sf/1281053 closed by birkenfeld From wsanchez at apple.com Thu Sep 8 02:06:08 2005 From: wsanchez at apple.com (=?ISO-8859-1?Q?Wilfredo_S=E1nchez_Vega?=) Date: Wed, 7 Sep 2005 17:06:08 -0700 Subject: [Python-Dev] Exception Reorg PEP checked in In-Reply-To: References: <6AA5C3D1-6AEC-4CE4-AEB9-84FBDA10EFA9@wsanchez.net> <2moe80uxxg.fsf@starship.python.net> Message-ID: (sorry for the delayed reply; vacation) On Aug 14, 2005, at 12:27 PM, Guido van Rossum wrote: > On 8/14/05, Michael Hudson wrote: > >> Wilfredo S?nchez Vega writes: >> >>> I'm curious about why Python lacks FileNotFoundError, >>> PermissionError and the like as subclasses of IOError. >> >> Good question. Lack of effort/inertia? > > Well, I wonder how often it's needed. My typical use is this: > > try: > f = open(filename) > except IOError, err: > print "Can't open %s: %s" % (filename, err) > return > > and the error printed contains all the necessary details (in fact it > even repeats the filename, so I could probably just say "print err"). That's fine for log output, but weak for handling the error. > Why do you need to know the exact reason for the failure? If you > simply want to know whether the file exists, I'd use os.path.exists() > or isfile(). (Never mind that this is the sometimes-frowned-upon > look-before-you-leap; I think it's often fine.) If you're going to wave off the look-before-you-leap argument, I guess you're right in that case, but I think it's a pretty valid argument for a lot of applications. os.path.exists() also has a race condition in the case where the file is deleted between your test for is and the subsequent attempt to access it, so you'd still need to handle that error in the exception handling if you care about correctness. I agree that in many (most?) cases, this can be fudged, but if it's easier to do the correct thing, more people would do the correct thing. In any case, os.path.exists() also doesn't catch permissions errors, and I often find myself wanting to handle those errors specially as well. An example where both are useful is in an HTTP server, where a different status code should be returned to the client depending on success, file not found, permission denied, and other cases. Presently, I (and twisted.web) have to check errno in the IOError exception handler, which is really clunky, and I have to do it fairly often. > Also note that providing the right detail can be very OS specific. > Python doesn't just run on Unix and Windows. File not found is a detectable error case on all platforms, I think. On an OS that doesn't have permissions errors, I wouldn't expect the existence of an exception that isn't used to be a huge portability problem. I can't imagine that checking errno is a more portable solution. These two exist and are quite useful in Java, for whatever that's worth. -wsv -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3057 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20050907/241d7cd0/smime-0001.bin From stephen at xemacs.org Thu Sep 8 12:12:30 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Thu, 08 Sep 2005 19:12:30 +0900 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: (Guido van Rossum's message of "Wed, 7 Sep 2005 07:11:43 -0700") References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> Message-ID: <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Guido" == Guido van Rossum writes: Guido> I certainly didn't mean to rule that out. Speaking for myself, that's all I really wanted to hear at this time. As Bob Ippolito said, currently it's straightforward to internationalize an application, and well worth the minimal overhead if it's at all serious. It's just that it would be nice if quick and dirty additions for i18n programs could be done as easily and with the same facility as for mono-Euro-lingual programs. I also think that at present Python is to the point where it's natural to write in a style where i18n is nearly costless (I use unicode strings habitually, and prefer %(var)s to positional %s anyway, because I find it easier to read). It would be a shame to regress from that! Why "mono-Euro-lingual"? Well, in teaching Python in Japan, one thing that is really annoying about the current print statement is that automatic spacing. Japanese doesn't use spaces to separate words, so you basically have to start with the '%' operator when teaching Japanese students output using variables. Several of them have said "oh, another typical American software that breaks Japanese". Dunno what to do about that, though, setting that based on the POSIX locale would break my personal usage (when things are broken, I want to see the debug output in English!) Guido> But I doubt that the only text to be i18n'd will occur in Guido> printf format strings. (In fact, I expect that few apps Guido> requiring i18n will be so primitive as to use *any* printf Guido> calls at all.) Personally I don't write complex applications in native Python, I write them for Zope or something like that. Then I don't have to worry about generic Python facilities; I have to use whatever the substrate is using. However, I do write simple CGIs that need to produce English and Japanese pages (at least), and it's often enough to write something like (this is from memory): def addressWarningPage (formDict) simplePageHeader (_("Address Warning")) print _("""\ I'm sorry, %(user)s, but the address you submitted is %(address)s, which appears to be a mobile phone address. Please use a real email address, because the mailing list for %(course)s distributes large attachments.""") % formDict simplePageFooter () where the simplePageFunctions themselves have been inherited from old code that simply 'print'ed to stdout, and formDict is constructed by the underlying CGI handler, so it's always available. I write a fair number of these pages, there are always new ways to go wrong.... This is very similar to what Bob Ippolito describes, and it's easy enough to do. However, my translators _do_ confuse the "s" for "string argument" with English pluralization (they're not native English speakers, usually). It would be nice (for me) if we could use notation that doesn't use stray format characters. It would be nice to be able to lose the "_()" calls to gettext(). The function would look to see if a message catalog was available for the current output stream, and if not, do no translation. (I'm not sure this can work, because it might conflict with things done automatically based on environment settings of POSIX locale.) It would be nice if a single function could support format strings with positional arguments and those with named variable substitution. (Not at the same time, though, that should be an error, I think.) If not, a separate function would be easy enough to support in a conversion script. All that's still pretty abstract, I guess. But so far, I don't see any reason why your proposal for the $1 positional syntax in printf() hinders any of the above. I just wanted to make sure that asking for them is OK. -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From solipsis at pitrou.net Thu Sep 8 13:48:01 2005 From: solipsis at pitrou.net (Antoine Pitrou) Date: Thu, 08 Sep 2005 13:48:01 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> Hi, Le jeudi 08 septembre 2005 ? 19:12 +0900, Stephen J. Turnbull a ?crit : > It would be > nice to be able to lose the "_()" calls to gettext(). The function > would look to see if a message catalog was available for the current > output stream, and if not, do no translation. That doesn't sound right to me. 1. You still need to do automatic extraction of these strings (gettext has tools for that, which rely on the use of the "_()" function - or any other dedicated function (*)). 2. You can't assume that all strings must be i18n'ed. For example if I'm interfacing with the user via a text-based network protocol which has a field named "Length", I don't want that "Length" field to be replaced with the Japanese translation of the word "Length". For i18n, "explicit is better than implicit" ;) The beauty of "_()" is that it's at the same time explicit, easily recognizable, and very short to type and read (it doesn't clutter the source code). If I dare say, the "%" operator has the same qualities. (*) of course more automatization of what gettext does could be a nice improvement too! > But so far, I don't see > any reason why your proposal for the $1 positional syntax in printf() > hinders any of the above. As I said Python needs an operator or function that does string formatting using a simple template, *without* doing output at the same time. The current syntax is the '%' operator, it could change, but it shouldn't be removed in favor of an inflexible print-with-formatting approach. Regards Antoine. From mcherm at mcherm.com Thu Sep 8 14:14:02 2005 From: mcherm at mcherm.com (Michael Chermside) Date: Thu, 08 Sep 2005 05:14:02 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 Message-ID: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> Guido writes: > Is it worth doing this and completely dropping the %-based formats in > Py3k? (Just asking -- it might be if we can get people to get over the > shock of $ becoming first class ;-). In my opinion, YES -- it's worth seriously considering it. A single, well-designed solution for string interpolation (with syntactic support if needed to make it very easy to use) is FAR better than having one good solution and another legacy solution. Just the awkwardness of the trailing s in "%(foo)s" is enough to motivate a search for something better. But this presuposes that there IS a single well-designed solution. PEP 292 templates are an excellent start, but they are not that solution. The largest problem is the lack of a means for formatting numbers. People should think hard about good solutions. He continues: > I proposed ${varname%fmt} earlier but it prevents you to extend the > varname syntax to arbitrary expressions, which I think is an extension > that will get lots of requests. I certainly agree that we should keep open the syntactic possibility to allow arbitrary Python expressions between "${" and "}" in an interpolation string even though it may not be supported today. I favor idea (Barry's?) of using "${ : : }" where is an identifier (but someday might allow expressions), and and behave like the % interpolation modifiers today. I would have suggested it myself, but somehow I failed to realize that slice literals are allowed only within subscripts and thus do not conflict with this use. -- Michael Chermside From barry at python.org Thu Sep 8 14:38:22 2005 From: barry at python.org (Barry Warsaw) Date: Thu, 08 Sep 2005 08:38:22 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <431EE34F.60107@gmail.com> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <431EE34F.60107@gmail.com> Message-ID: <1126183102.12805.48.camel@presto.wooz.org> On Wed, 2005-09-07 at 08:55, Nick Coghlan wrote: > The leading 'p' (for 'positional') is necessary to get around the fact that $1 > is currently an illegal identifier in a Template That should be fixable. Ideally, $1 is better than $p1. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050908/b20f8e73/attachment.pgp From barry at python.org Thu Sep 8 14:42:14 2005 From: barry at python.org (Barry Warsaw) Date: Thu, 08 Sep 2005 08:42:14 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <335020FA-59D4-4B4C-8777-53BE4F741CC8@redivi.com> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <335020FA-59D4-4B4C-8777-53BE4F741CC8@redivi.com> Message-ID: <1126183334.12803.53.camel@presto.wooz.org> On Wed, 2005-09-07 at 15:07, Bob Ippolito wrote: > I was also able to easily automate the process of extracting strings > to create that spreadsheet. I wrote a simple script that parsed the > Python modules and looked for function calls of "_" whose only > argument was a constant string. Worked great, and it was easy to write. I don't think enough people know about Tools/i18n/pygettext. It does all the extractions for you, producing a GNU gettext compatible .pot file. You can even teach it to recognize extraction keywords other than the default _(). printf() should be easy to recognize, although we might have to make a slight modification since IIRC, pygettext will only extract strings from keyword functions with exactly one argument. That should be easy to fix. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050908/eb3405d7/attachment.pgp From barry at python.org Thu Sep 8 14:45:00 2005 From: barry at python.org (Barry Warsaw) Date: Thu, 08 Sep 2005 08:45:00 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <1126183500.12803.56.camel@presto.wooz.org> On Thu, 2005-09-08 at 07:48, Antoine Pitrou wrote: > As I said Python needs an operator or function that does string > formatting using a simple template, *without* doing output at the same > time. The current syntax is the '%' operator, it could change, but it > shouldn't be removed in favor of an inflexible print-with-formatting > approach. I believe we already have that in the constituent parts of stream.write() and Template.substitute(). I don't have any problem with the built-in print() function (or printf()) combining the two for convenience. After all, print's entire purpose is convenience. -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 307 bytes Desc: This is a digitally signed message part Url : http://mail.python.org/pipermail/python-dev/attachments/20050908/759e0b6f/attachment.pgp From ncoghlan at gmail.com Thu Sep 8 15:15:54 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 08 Sep 2005 23:15:54 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126183102.12805.48.camel@presto.wooz.org> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <431EE34F.60107@gmail.com> <1126183102.12805.48.camel@presto.wooz.org> Message-ID: <4320398A.6090809@gmail.com> Barry Warsaw wrote: > On Wed, 2005-09-07 at 08:55, Nick Coghlan wrote: > > >>The leading 'p' (for 'positional') is necessary to get around the fact that $1 >>is currently an illegal identifier in a Template > > > That should be fixable. Ideally, $1 is better than $p1. Oh, I know. I just didn't feel like cranking my brain up to the point of figuring out the necessary change to the string.Template regex. It turns out the one required change to the pattern is truly trivial though (I guess the grief we gave PEP 292 about easy customisation was actually worthwhile): from string import Template class fmtTemplate(Template): idpattern = '[_a-z0-9]*' def format(*args, **kwds): if kwds and (len(args) > 1): raise ValueError("Cannot use both keyword and positional arguments") fmt = fmtTemplate(args[0]) kwds.update(((str(idx), arg) for idx, arg in enumerate(args))) return fmt.substitute(**kwds) Py> format("$1: $2", "Num bees", 0.5) 'Num bees: 0.5' Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Thu Sep 8 16:10:19 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 09 Sep 2005 00:10:19 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126183102.12805.48.camel@presto.wooz.org> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <431EE34F.60107@gmail.com> <1126183102.12805.48.camel@presto.wooz.org> Message-ID: <4320464B.3090100@gmail.com> Barry Warsaw wrote: > On Wed, 2005-09-07 at 08:55, Nick Coghlan wrote: > > >>The leading 'p' (for 'positional') is necessary to get around the fact that $1 >>is currently an illegal identifier in a Template > > > That should be fixable. Ideally, $1 is better than $p1. I also looked into the idea of adding formatting support to the string.Template syntax. Given a reimplementation of string.Template with the following pattern: pattern = r""" %(delim)s(?: (?P %(delim)s) | # An escape sequence of two delimiters, or ( (\[(?P [^%%]*)\])? # an optional simple format string, ( (?P %(id)s) | # and a Python identifier, or {(?P %(id)s)} # a braced identifier ) ) | (?P ) # An ill-formed delimiter expr ) """ And "convert" functions modified to use "fmt" where "'%s'" is currently used, with "fmt" defined via: fmt = mo.group('format') if fmt is None: fmt = '%s' # Default to simple string format else: fmt = '%' + fmt The following works: Py> t = format.Template("$item: $[0.2f]quantity") Py> t.format(quantity=0.5, item='Bees') 'Bees: 0.50' Combining with a 'format' function similar to the one in my previous message, and an id pattern modified to permit numbers as identifiers: Py> format("$1: $[0.2f]2", 'Bees', 0.5) 'Bees: 0.50' Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From dalcinl at gmail.com Thu Sep 8 17:52:35 2005 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Thu, 8 Sep 2005 12:52:35 -0300 Subject: [Python-Dev] PEP 3000 and new style classes Message-ID: PEP 3000 - Core language says (http://www.python.org/peps/pep-3000.html#core-language) : - Support only new-style classes; classic classes will be gone Any possibility to add something like from __future__ import new_style_classes to have newly defined classes implicitly derive from 'object' (I understand this will be the implicit behavior when classic classes go away in Py3.0). -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From stephen at xemacs.org Thu Sep 8 18:38:56 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Fri, 09 Sep 2005 01:38:56 +0900 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> (Antoine Pitrou's message of "Thu, 08 Sep 2005 13:48:01 +0200") References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <87psrjv7j3.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Antoine" == Antoine Pitrou writes: Antoine> Le jeudi 08 septembre 2005 ? 19:12 +0900, Stephen Antoine> J. Turnbull a ?crit : >> It would be nice to be able to lose the "_()" calls to >> gettext(). The function would look to see if a message catalog >> was available for the current output stream, and if not, do no >> translation. I should have been more explicit. I meant only in the context of printf. You're right, gettext (and aliases) are still needed. Antoine> That doesn't sound right to me. 1. You still need to do Antoine> automatic extraction of these strings (gettext has tools Antoine> for that, which rely on the use of the "_()" function - Antoine> or any other dedicated function (*)). I think printf is an excellent candidate for such a function. Antoine> 2. You can't assume that all strings must be i18n'ed. IMO strings that are being printf'd can probably be assumed to be human readable, and therefore candidates for translation. This assumption does impose runtime overhead on non-i18n users, but I suspect it's smaller than that of caching regexps which has been determined to be acceptable. It would also make printf more hazardous for use in implementing protocol messages, but I think that's already pretty hazardous with the print statement. Although Guido has been very firm about not imposing overhead on _all_ users for the sake of i18n, implementing protocols is a similar minority activity, and there might be an acceptable tradeoff there. Antoine> As I said Python needs an operator or function that does Antoine> string formatting using a simple template, *without* Antoine> doing output at the same time. The current syntax is the Antoine> '%' operator, it could change, but it shouldn't be Antoine> removed in favor of an inflexible print-with-formatting Antoine> approach. AFAICT, that is the consensus view. Is there something concrete you're worried about here? Cheers, -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From aahz at pythoncraft.com Thu Sep 8 18:43:12 2005 From: aahz at pythoncraft.com (Aahz) Date: Thu, 8 Sep 2005 09:43:12 -0700 Subject: [Python-Dev] PEP 3000 and new style classes In-Reply-To: References: Message-ID: <20050908164312.GA26993@panix.com> On Thu, Sep 08, 2005, Lisandro Dalcin wrote: > > Any possibility to add something like > > from __future__ import new_style_classes > > to have newly defined classes implicitly derive from 'object' (I > understand this will be the implicit behavior when classic classes go > away in Py3.0). You can already do __metaclass__ = type within each module -- Aahz (aahz at pythoncraft.com) <*> http://www.pythoncraft.com/ The way to build large Python applications is to componentize and loosely-couple the hell out of everything. From ironfroggy at gmail.com Thu Sep 8 18:53:09 2005 From: ironfroggy at gmail.com (Calvin Spealman) Date: Thu, 8 Sep 2005 12:53:09 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <7168d65a050831132415118382@mail.gmail.com> <20050831204439.GA3775@discworld.dyndns.org> <4316749F.6060204@canterbury.ac.nz> <76fd5acf050905212157306c5@mail.gmail.com> Message-ID: <76fd5acf05090809535fb8b43d@mail.gmail.com> On 9/6/05, Guido van Rossum wrote: > On 9/5/05, Calvin Spealman wrote: > > There is a lot of debate over this issue, obviously. Now, I think > > getting rid of the print statement can lead to ugly code, because a > > write function would be called as an expression, so where we'd once > > have prints on their own lines, that wouldn't be the case anymore, and > > things could get ugly. > > Sounds like FUD to me. Lots of functions/methods exist that *could* be > embedded in expressions, and never are. Or if they are, there's > actually a good reason, and then being a mere function (instead of a > statement) would actually be helpful. Anyway, why would it be > important that prints are on their own line where so many other > important actions don't have that privilege? For the same reason any statement is not an expression. Python doesn't allow assignments as expression, even though it has been implemented. Nor imports or function and class definitions. Readability is key. On the other hand, I actually don't like there being a print statement at all. We don't live in the days were console software rules and any other form of interface is an after thought. First-class printing to standard out seems to make a statement (no pun intended) that the language is intended for Unix-emulating operating systems (even Windows does, to some extent) and that anything you don't pipe through stdout or pull from stdin is something extra tossed in for a special crowd. Interface equality and neutrality would be a good thing in the language. But, I guess what I'm getting at is that if you do give special case to anything, give it special case properly. If text console IO is going to be only through functions and not directly in the language syntax, should it even be a built-in? Bring it to the level of any other interface API or keep it at its own status, but any middle ground seems half-hearted. From dalcinl at gmail.com Thu Sep 8 18:56:07 2005 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Thu, 8 Sep 2005 13:56:07 -0300 Subject: [Python-Dev] PEP 3000 and new style classes In-Reply-To: <20050908164312.GA26993@panix.com> References: <20050908164312.GA26993@panix.com> Message-ID: On 9/8/05, Aahz wrote: > > You can already do > > __metaclass__ = type > > within each module > Yes, you are right. But this way, you are making explicit a behavior that will be implicit in the future. For example, we could also do: two = float(4)/float(2) instead of from __future__ import division two = 4/2 -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From tommy at ilm.com Thu Sep 8 18:59:28 2005 From: tommy at ilm.com (Tommy Burnette) Date: Thu, 8 Sep 2005 09:59:28 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> Message-ID: <17184.28144.66163.891298@evoke.lucasfilm.com> Michael Chermside writes: | Guido writes: | > Is it worth doing this and completely dropping the %-based formats in | > Py3k? (Just asking -- it might be if we can get people to get over the | > shock of $ becoming first class ;-). | | In my opinion, YES -- it's worth seriously considering it. A single, | well-designed solution for string interpolation (with syntactic support | if needed to make it very easy to use) is FAR better than having one | good solution and another legacy solution. Just the awkwardness of the | trailing s in "%(foo)s" is enough to motivate a search for something | better. hey folks, I managed to lose a few days worth of python-dev mail so I'm late to this discussion, but I thought I'd toss in a few (possibly outlying) data points form the visual effects/3d animation world. here at ILM we use python as the expression langauge in a number of 3d applications, and we usually end up adding a front-end parser so users can reference variable values inline via $ sytanx. they're still essentially writing python code, but with the extra added suger of $ references. I have first-hand information that the engineers at Pixar chose tcl over python a few years back as the expression language in their commercial shader editor "slim" for exactly this reason as well (i.e tcl already had $ refs, and they didn't want to present their own python-but-not language like we do here). so if replacing '' % () formatting with $ refs is an option in py3k, allow me to offer a +1000 vote for that :) From bob at redivi.com Thu Sep 8 19:08:53 2005 From: bob at redivi.com (Bob Ippolito) Date: Thu, 8 Sep 2005 10:08:53 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126183334.12803.53.camel@presto.wooz.org> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <335020FA-59D4-4B4C-8777-53BE4F741CC8@redivi.com> <1126183334.12803.53.camel@presto.wooz.org> Message-ID: <082F2FD4-33D7-42AF-94B8-F7746E0C020B@redivi.com> On Sep 8, 2005, at 5:42 AM, Barry Warsaw wrote: > On Wed, 2005-09-07 at 15:07, Bob Ippolito wrote: > > >> I was also able to easily automate the process of extracting strings >> to create that spreadsheet. I wrote a simple script that parsed the >> Python modules and looked for function calls of "_" whose only >> argument was a constant string. Worked great, and it was easy to >> write. >> > > I don't think enough people know about Tools/i18n/pygettext. It does > all the extractions for you, producing a GNU gettext compatible .pot > file. You can even teach it to recognize extraction keywords other > than > the default _(). printf() should be easy to recognize, although we > might have to make a slight modification since IIRC, pygettext will > only > extract strings from keyword functions with exactly one argument. > That > should be easy to fix. You're right, I think Tools is probably a bad place for anything. If it's not part of the stdlib, I'll likely never find it. -bob From fperez.net at gmail.com Thu Sep 8 19:04:13 2005 From: fperez.net at gmail.com (Fernando Perez) Date: Thu, 08 Sep 2005 11:04:13 -0600 Subject: [Python-Dev] reference counting in Py3K References: <431E7091.5070104@canterbury.ac.nz> <20050907001521.8B56.JCARLSON@uci.edu> Message-ID: Josiah Carlson wrote: > Here's a perspective "from the trenches" as it were. > > I've been writing quite a bit of code, initially all in Python (27k > lines in the last year or so). It worked reasonably well and fast. It > wasn't fast enough. I needed a 25x increase in performance, which would > have been easily attainable if I were to rewrite everything in C, but > writing a module in pure C is a bit of a pain (as others can attest), so > I gave Pyrex a shot (after scipy.weave.inline, ick). Would you care to elaborate on the reasons behind the 'ick'? I'm a big fan of weave.inline and have used it very successfully for my own needs, so I'm genuinely curious (as I tend to teach its use, I like to know of potential problems I may not have seen). I should also add that a while ago a number of extremely annoying spurious recompilation bugs were finally fixed, in case this was what bothered you. Those bugs (hard to find) made weave in certain cases useless, as it recompiled everything blindly, thus killing its whole purpose. Feel free to reply off-list if you feel this is not appropriate for python-dev, though I think that a survey of the c-python bridges may be of interest to others. Cheers, f From jcarlson at uci.edu Thu Sep 8 20:10:12 2005 From: jcarlson at uci.edu (Josiah Carlson) Date: Thu, 08 Sep 2005 11:10:12 -0700 Subject: [Python-Dev] reference counting in Py3K In-Reply-To: References: <20050907001521.8B56.JCARLSON@uci.edu> Message-ID: <20050908104228.8B76.JCARLSON@uci.edu> Fernando Perez wrote: > Josiah Carlson wrote: > > Here's a perspective "from the trenches" as it were. > > > > I've been writing quite a bit of code, initially all in Python (27k > > lines in the last year or so). It worked reasonably well and fast. It > > wasn't fast enough. I needed a 25x increase in performance, which would > > have been easily attainable if I were to rewrite everything in C, but > > writing a module in pure C is a bit of a pain (as others can attest), so > > I gave Pyrex a shot (after scipy.weave.inline, ick). > > Would you care to elaborate on the reasons behind the 'ick'? I'm a big fan of > weave.inline and have used it very successfully for my own needs, so I'm > genuinely curious (as I tend to teach its use, I like to know of potential > problems I may not have seen). 1. Mixing multiple languages in a single source file is bad form, yet it seems to be encouraged in weave.inline and other such packages (it becomes a big deal when the handful of Python becomes 20+ lines of C). 2. I experienced some minor but annoying issues in regards to automatic type conversions (strings, mmaps, buffers, and arrays if I remember correctly, it has been since February or March). There were other things, but I'm not sure if I am remembering them correctly or not (I spent around 12 hours over two days wrestling with weave.inline, but in 10 minutes I was using Pyrex effectively). > I should also add that a while ago a number of extremely annoying spurious > recompilation bugs were finally fixed, in case this was what bothered you. > Those bugs (hard to find) made weave in certain cases useless, as it > recompiled everything blindly, thus killing its whole purpose. I was actually finding that weave wasn't recompiling /enough/. I'd change some source, and get old behavior. I'd delete the various cache files, then see the recompilation and new behavior. With Pyrex and a bit of magic, I get auto-recompilation (though will seriously consider switching to Pyximport as another suggested). It also seemed to have some issues with interactive sessions, but I may be misremembering. > Feel free to reply off-list if you feel this is not appropriate for python-dev, > though I think that a survey of the c-python bridges may be of interest to > others. Agreed. I admit that some of my issues would likely be lesser if I were to start to use inline now, with additional experience with such things. But with a few thousand lines of Pyrex and C working right now, I'm hard pressed to convince anyone (including myself) that such a switch is worthwhile. - Josiah From mike at skew.org Thu Sep 8 20:41:39 2005 From: mike at skew.org (Mike Brown) Date: Thu, 8 Sep 2005 12:41:39 -0600 (MDT) Subject: [Python-Dev] bug in urlparse In-Reply-To: <20050904233804.GA2731@unpythonic.net> Message-ID: <200509081841.j88IfdtW023619@chilled.skew.org> jepler at unpythonic.net wrote: > According to RFC 2396[1] section 5.2: RFC 2396 is obsolete. It was superseded by RFC 3986 / STD 66 early this year. In particular, the procedure for removing dot-segments from the path component of a URI reference -- a procedure that is only supposed to be done when 'resolving' a reference to absolute form (i.e., merging it with a base URI, which, being a URI, not a URI reference, is not allowed to contain dot-segments) -- has received a significant overhaul. The implementation guidance you quoted from RFC 2396 is no longer relevant. Technically, it never was relevant, since urlparse only claims to implement RFC 1808 (2396's predecessor, now ten years old). The new procedure says "...dot-segments are intended for use in URI references to express an identifier relative to the hierarchy of names in the base URI. The remove_dot_segments algorithm respects that hierarchy by removing extra dot-segments rather than treat them as an error or leaving them to be misinterpreted by dereference implementations." -Mike From fperez.net at gmail.com Thu Sep 8 22:05:15 2005 From: fperez.net at gmail.com (Fernando Perez) Date: Thu, 08 Sep 2005 14:05:15 -0600 Subject: [Python-Dev] reference counting in Py3K References: <20050907001521.8B56.JCARLSON@uci.edu> <20050908104228.8B76.JCARLSON@uci.edu> Message-ID: Josiah Carlson wrote: > Fernando Perez wrote: >> Would you care to elaborate on the reasons behind the 'ick'? I'm a big fan >> of weave.inline and have used it very successfully for my own needs, so I'm >> genuinely curious (as I tend to teach its use, I like to know of potential >> problems I may not have seen). > > 1. Mixing multiple languages in a single source file is bad form, yet it > seems to be encouraged in weave.inline and other such packages (it > becomes a big deal when the handful of Python becomes 20+ lines of C). Agreed. I only use inline with explicit C strings for very short stuff, and typically use a little load_C_snippet() utility I wrote. That lets me keep the C sources in real C files, with proper syntax highlighting in Xemacs and whatnot. [... summary of weave problems] > Agreed. I admit that some of my issues would likely be lesser if I were > to start to use inline now, with additional experience with such things. > But with a few thousand lines of Pyrex and C working right now, I'm hard > pressed to convince anyone (including myself) that such a switch is > worthwhile. Thanks for your input. I certainly wasn't trying to suggest you change, I was just curious about your experiences. If you ever see this again, specific feedback on the scipy list would be very welcome. While I'm not 'officially' a scipy developer, I care enough about weave that occasionally I dig in and go in bugfixing expeditions. With proper bug reports we could improve a system which I think has a place (especially for scientific computing, with the Blitz support for arrays, which gives Numpy-like arrays in C++). I don't see weave as a competitor to pyrex, but rather as an alternate tool which can be excellent in certain contexts, and which I'd like to see improve whre possible. Regards, f From mithrandi-python-dev at mithrandi.za.net Thu Sep 8 23:13:07 2005 From: mithrandi-python-dev at mithrandi.za.net (Tristan Seligmann) Date: Thu, 8 Sep 2005 23:13:07 +0200 Subject: [Python-Dev] PEP 3000 and new style classes In-Reply-To: References: <20050908164312.GA26993@panix.com> Message-ID: <20050908211307.GA506@mithrandi.za.net> * Lisandro Dalcin [2005-09-08 13:56:07 -0300]: > Yes, you are right. But this way, you are making explicit a behavior > that will be implicit in the future. > > For example, we could also do: > > two = float(4)/float(2) > > instead of > > from __future__ import division > two = 4/2 Why does it matter if the single statement you insert is spelled "__metaclass__ = type" instead of "from __future__ import whatever"? Remember, unlike the division example, you would only have to insert one statement, as opposed to changing every use of integer division. -- mithrandi, i Ainil en-Balandor, a faer Ambar -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: Digital signature Url : http://mail.python.org/pipermail/python-dev/attachments/20050908/73486fa9/attachment.pgp From jepler at unpythonic.net Fri Sep 9 01:35:32 2005 From: jepler at unpythonic.net (jepler@unpythonic.net) Date: Thu, 8 Sep 2005 18:35:32 -0500 Subject: [Python-Dev] bug in urlparse In-Reply-To: <200509081841.j88IfdtW023619@chilled.skew.org> References: <20050904233804.GA2731@unpythonic.net> <200509081841.j88IfdtW023619@chilled.skew.org> Message-ID: <20050908233528.GA19942@unpythonic.net> On Thu, Sep 08, 2005 at 12:41:39PM -0600, Mike Brown wrote: > jepler at unpythonic.net wrote: > > According to RFC 2396[1] section 5.2: > > RFC 2396 is obsolete. It was superseded by RFC 3986 / STD 66 early this year. Thanks for the correction. Jeff -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available Url : http://mail.python.org/pipermail/python-dev/attachments/20050908/af83e008/attachment.pgp From t-meyer at ihug.co.nz Fri Sep 9 03:23:16 2005 From: t-meyer at ihug.co.nz (Tony Meyer) Date: Fri, 9 Sep 2005 13:23:16 +1200 Subject: [Python-Dev] Tools directory (Was RE: Replacement for print in Python 3.0) In-Reply-To: Message-ID: [finding Tools/i18n/pygettext.py] > You're right, I think Tools is probably a bad place for > anything. If it's not part of the stdlib, I'll likely never > find it. Agreed. Maybe with the introduction of -m in Python 2.4, some of the Tools/ scripts could be put in __main__ sections of appropriate modules? So that "python -m gettext" would be equivilant to "python Tools/i18n/pygettext.py"? (However, pyggettext.py is 22KB, which is a big addition to the module; not everything in Tools/Scripts might be used enough for this, or have an appopriate module to be put in either). Are there other ideas about how Tools/ could be improved? Either moving things, or making it more likely that people will look there for scripts? =Tony.Meyer From bcannon at gmail.com Fri Sep 9 03:52:59 2005 From: bcannon at gmail.com (Brett Cannon) Date: Thu, 8 Sep 2005 18:52:59 -0700 Subject: [Python-Dev] Tools directory (Was RE: Replacement for print in Python 3.0) In-Reply-To: References: Message-ID: On 9/8/05, Tony Meyer wrote: > [finding Tools/i18n/pygettext.py] > > You're right, I think Tools is probably a bad place for > > anything. If it's not part of the stdlib, I'll likely never > > find it. > > Agreed. Maybe with the introduction of -m in Python 2.4, some of the Tools/ > scripts could be put in __main__ sections of appropriate modules? So that > "python -m gettext" would be equivilant to "python Tools/i18n/pygettext.py"? > > (However, pyggettext.py is 22KB, which is a big addition to the module; not > everything in Tools/Scripts might be used enough for this, or have an > appopriate module to be put in either). > > Are there other ideas about how Tools/ could be improved? Either moving > things, or making it more likely that people will look there for scripts? > I assume that the Windows installer includes the Tools/ directory. If it doesn't that is one problem. =) Otherwise it is mostly a lack of advertisement and them not being installed by ``make install``. If you just download the soure and install you will never know the directory even exists. It needs to be made obvious to people that it is even there. Probably the only way is to document the directory. -Brett From greg.ewing at canterbury.ac.nz Fri Sep 9 04:12:51 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 09 Sep 2005 14:12:51 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> Message-ID: <4320EFA3.8070607@canterbury.ac.nz> Michael Chermside wrote: > In my opinion, YES -- it's worth seriously considering it. A single, > well-designed solution for string interpolation (with syntactic support > if needed to make it very easy to use) is FAR better than having one > good solution and another legacy solution. Maybe backquotes could be repurposed in Py3k for interpolated string literals? -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From greg.ewing at canterbury.ac.nz Fri Sep 9 04:21:35 2005 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 09 Sep 2005 14:21:35 +1200 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <87psrjv7j3.fsf@tleepslib.sk.tsukuba.ac.jp> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> <87psrjv7j3.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: <4320F1AF.6090802@canterbury.ac.nz> Stephen J. Turnbull wrote: > IMO strings that are being printf'd can probably be assumed to be > human readable, and therefore candidates for translation. This That's a dangerous assumption to make, I think. I'd be uncomfortable with having some strings in my program translated automatically and others not. EIBTI here, I feel. -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | A citizen of NewZealandCorp, a | Christchurch, New Zealand | wholly-owned subsidiary of USA Inc. | greg.ewing at canterbury.ac.nz +--------------------------------------+ From stephen at xemacs.org Fri Sep 9 08:29:29 2005 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Fri, 09 Sep 2005 15:29:29 +0900 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <4320F1AF.6090802@canterbury.ac.nz> (Greg Ewing's message of "Fri, 09 Sep 2005 14:21:35 +1200") References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> <87psrjv7j3.fsf@tleepslib.sk.tsukuba.ac.jp> <4320F1AF.6090802@canterbury.ac.nz> Message-ID: <878xy6ycs6.fsf@tleepslib.sk.tsukuba.ac.jp> >>>>> "Greg" == Greg Ewing writes: Greg> Stephen J. Turnbull wrote: >> IMO strings that are being printf'd can probably be assumed to >> be human readable, and therefore candidates for translation. >> This Greg> That's a dangerous assumption to make, I think. Could be. For me, the name "print" is associated with a long history of magical behavior that only a human could possibly feel comfortable with. One of the great sins of Pascal was tarring the name "write" with the same brush! Greg> I'd be uncomfortable with having some strings in my program Greg> translated automatically and others not. EIBTI here, I Greg> feel. If printf is going to be part of a magical family of print* functions that do things like insert interword spacing and EOLs, I have no problem with documenting that among the other magical things that printf does, it translates strings. This is no less explicit than any other function that bundles several more primitive functions. If instead, we come up with a sufficiently excellent set of formatting and interpolation notations that printf isn't magic at all, simply a function that interprets a precisely defined set of explicit notations, then i18n should have its own notation, too. On reviewing the thread, the latter seems to be the direction things are going. Although several people have defended print's magical behaviors, most of them (and several others) seem at least as excited about a printf with a more economical yet powerful set of operators. -- School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software. From fredrik at pythonware.com Fri Sep 9 08:30:36 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Fri, 9 Sep 2005 08:30:36 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > Maybe backquotes could be repurposed in Py3k for interpolated > string literals? backquotes are a PITA to type on many non-US keyboards. From theller at python.net Fri Sep 9 09:19:55 2005 From: theller at python.net (Thomas Heller) Date: Fri, 09 Sep 2005 09:19:55 +0200 Subject: [Python-Dev] Replacement for print in Python 3.0 References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> Message-ID: "Fredrik Lundh" writes: > Greg Ewing wrote: > >> Maybe backquotes could be repurposed in Py3k for interpolated >> string literals? > > backquotes are a PITA to type on many non-US keyboards. Even more since they are especially broken in Windows XEmacs. Thomas From ncoghlan at gmail.com Fri Sep 9 10:16:24 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 09 Sep 2005 18:16:24 +1000 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> Message-ID: <432144D8.8020903@gmail.com> Fredrik Lundh wrote: > Greg Ewing wrote: > > >>Maybe backquotes could be repurposed in Py3k for interpolated >>string literals? > > > backquotes are a PITA to type on many non-US keyboards. Not to mention the annoyingly large number of fonts that make '`' and ''' look virtually identical :( Besides, backquotes don't give you a way to supply the values to feed into the interpolated literal the way string.Template does, or a 'format' builtin would. This does make me think of the interesting prospect of an internationalised string literal, though (e.g., _"This an il8n string"). I'm not sure it would be enough of a win over the status quo though, since doing the language conversion at compile time could make it interesting to try and switch languages at run time. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From solipsis at pitrou.net Fri Sep 9 11:28:03 2005 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 09 Sep 2005 11:28:03 +0200 Subject: [Python-Dev] international python In-Reply-To: <432144D8.8020903@gmail.com> References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> <432144D8.8020903@gmail.com> Message-ID: <1126258083.14863.11.camel@p-dvsi-418-1.rd.francetelecom.fr> > This does make me think of the interesting prospect of an internationalised > string literal, though (e.g., _"This an il8n string"). I'm not sure it would > be enough of a win over the status quo though, I don't think so either. i18n doesn't require its specific string notation (in addition, dropping "_()" may make it harder to interface with standard gettext tools). On the hand, international support in Python apps will benefit from: - seamless unicode support: how about making the default Python charset utf-8 instead of ascii ? right now, someone (say an American or English) who does not design his app with non-ascii characters in mind may have a surprise when users enter those characters in customizable fields: for example, debug print statements which end up crashing the app with an UnicodeException on the user's machine, without even a way to diagnose this when the app is a GUI app and stdout is not shown ;)) - simple formatting syntax (the current "%" operator is quite fine in that regard) As for seamless unicode support, there are also problems sometimes with filenames and filepaths: see e.g. https://sourceforge.net/tracker/?func=detail&aid=1283895&group_id=5470&atid=105470 Regards Antoine. From fredrik at pythonware.com Fri Sep 9 12:15:25 2005 From: fredrik at pythonware.com (Fredrik Lundh) Date: Fri, 9 Sep 2005 12:15:25 +0200 Subject: [Python-Dev] international python References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com><4320EFA3.8070607@canterbury.ac.nz> <432144D8.8020903@gmail.com> <1126258083.14863.11.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: Antoine Pitrou wrote: > - seamless unicode support: how about making the default Python > charset utf-8 instead of ascii ? right now, someone (say an American or > English) who does not design his app with non-ascii characters in mind > may have a surprise when users enter those characters in customizable > fields: for example, debug print statements which end up crashing the > app with an UnicodeException on the user's machine, without even a way > to diagnose this when the app is a GUI app and stdout is not shown ;)) using a variable-width default encoding will break stuff that expect string lengths to be constant, or just prefer their character and slice indices to stay where they are. defaulting to "replace" (or better, an escaping UnicodeEncodeError handler) on the standard output channels would be a better idea. From amk at amk.ca Fri Sep 9 15:13:06 2005 From: amk at amk.ca (A.M. Kuchling) Date: Fri, 9 Sep 2005 09:13:06 -0400 Subject: [Python-Dev] Tools directory (Was RE: Replacement for print in Python 3.0) In-Reply-To: References: Message-ID: <20050909131306.GA22016@rogue.amk.ca> On Thu, Sep 08, 2005 at 06:52:59PM -0700, Brett Cannon wrote: > Otherwise it is mostly a lack of advertisement and them not being > installed by ``make install``. If you just download the soure and Agreed. I've often wished that reindent.py was installed somewhere. > Probably the only way > is to document the directory. How should we document it? Writing man pages for the scripts and installing them is probably the minimum. Would there also need to be a LaTeX document for all the scripts, or is that overkill? --amk From nyamatongwe at gmail.com Fri Sep 9 15:09:13 2005 From: nyamatongwe at gmail.com (Neil Hodgson) Date: Fri, 9 Sep 2005 23:09:13 +1000 Subject: [Python-Dev] international python In-Reply-To: <1126258083.14863.11.camel@p-dvsi-418-1.rd.francetelecom.fr> References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> <432144D8.8020903@gmail.com> <1126258083.14863.11.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <50862ebd05090906091457689c@mail.gmail.com> Antoine Pitrou: > As for seamless unicode support, there are also problems sometimes with > filenames and filepaths: see e.g. > https://sourceforge.net/tracker/?func=detail&aid=1283895&group_id=5470&atid=105470 This bug report is using byte string arguments causing byte string processing rather than unicode calls with unicode processing. Windows code that may encounter file paths outside the default locale should stick to unicode for paths. Try converting os.curdir to unicode before calling other functions: os.path.abspath(unicode(os.curdir)) Neil From tim.peters at gmail.com Fri Sep 9 16:20:10 2005 From: tim.peters at gmail.com (Tim Peters) Date: Fri, 9 Sep 2005 10:20:10 -0400 Subject: [Python-Dev] Tools directory (Was RE: Replacement for print in Python 3.0) In-Reply-To: References: Message-ID: <1f7befae05090907204e6a8db6@mail.gmail.com> [Brett Cannon] > I assume that the Windows installer includes the Tools/ directory. It installs part of it, not all: C:\Python24\Tools>dir/b i18n pynche Scripts versioncheck webchecker So it's missing these Tools directories: audiopy bgen compiler faqwiz framer freeze modulator msi unicode world Historically, a Tools directory got into the Windows installer iff somone asked for it. From solipsis at pitrou.net Fri Sep 9 16:45:44 2005 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 09 Sep 2005 16:45:44 +0200 Subject: [Python-Dev] international python In-Reply-To: <50862ebd05090906091457689c@mail.gmail.com> References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> <432144D8.8020903@gmail.com> <1126258083.14863.11.camel@p-dvsi-418-1.rd.francetelecom.fr> <50862ebd05090906091457689c@mail.gmail.com> Message-ID: <1126277144.14863.22.camel@p-dvsi-418-1.rd.francetelecom.fr> Le vendredi 09 septembre 2005 ? 23:09 +1000, Neil Hodgson a ?crit : > Antoine Pitrou: > > > As for seamless unicode support, there are also problems sometimes with > > filenames and filepaths: see e.g. > > https://sourceforge.net/tracker/?func=detail&aid=1283895&group_id=5470&atid=105470 > > This bug report is using byte string arguments causing byte string > processing rather than unicode calls with unicode processing. Windows > code that may encounter file paths outside the default locale should > stick to unicode for paths. Try converting os.curdir to unicode before > calling other functions: > > os.path.abspath(unicode(os.curdir)) I don't have a Windows machine at hand right now to test it, but, even if this solution works, it breaks the principle of least astonishment: os.path.abspath() should do the Right Thing regardless of what the current locale is. Regards Antoine. From guido at python.org Fri Sep 9 17:28:15 2005 From: guido at python.org (Guido van Rossum) Date: Fri, 9 Sep 2005 08:28:15 -0700 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <878xy6ycs6.fsf@tleepslib.sk.tsukuba.ac.jp> References: <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> <87psrjv7j3.fsf@tleepslib.sk.tsukuba.ac.jp> <4320F1AF.6090802@canterbury.ac.nz> <878xy6ycs6.fsf@tleepslib.sk.tsukuba.ac.jp> Message-ID: On 9/8/05, Stephen J. Turnbull wrote: > Could be. For me, the name "print" is associated with a long history > of magical behavior that only a human could possibly feel comfortable > with. One of the great sins of Pascal was tarring the name "write" > with the same brush! Well, apart from your personal history, and in the light of future developments, for most people who aren't programmers using dinosaur languages, "print" will probably mean "convert a document to bits of ink on paper" or perhaps by extension into the third dimension "produce a physical object from a virtual one". (I've seen some amazing demos of the latter at foocamp, even though the equipment is still a bit big to fit in a typical kitchen. :) While I laugh at the naive view of people who write things like "Interface equality and neutrality would be a good thing in the language" and seriously (? I didn't see a smiley) use this argument to plead for not making print() a built-in, I do think that avoiding the 'print' name would be a good thing if it could be done without ticking off the old-timers. On the third hand, I notice that Java uses read()/write() and class names ending in Stream for a byte-oriented API, and print()/println() with class names ending in Reader/Writer for a text/character-based API. (Some classes provide both print() and write() methods and there the distinction is clearest.) Since Python 3000 is heading in the same direction, I wouldn't mind having some API distinction so it's clearer to the reader whether we are writing binary or or text data. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From skip at pobox.com Fri Sep 9 17:48:05 2005 From: skip at pobox.com (skip@pobox.com) Date: Fri, 9 Sep 2005 10:48:05 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> Message-ID: <17185.44725.259767.288163@montanaro.dyndns.org> Fredrik> backquotes are a PITA to type on many non-US keyboards. Interesting. On US keyboards they are often easier to type than parens... Skip From jimjjewett at gmail.com Fri Sep 9 17:47:52 2005 From: jimjjewett at gmail.com (Jim Jewett) Date: Fri, 9 Sep 2005 11:47:52 -0400 Subject: [Python-Dev] Tools directory (Was RE: Replacement for print in Python 3.0) Message-ID: > How should we document [the tools directory] At the interactive prompt, help() lets me get a list of topics (not including tools), keywords, or modules -- but no mention of tools. I didn't find any references at http://python.org/doc/ The tutorial does mention the standard library (and the library reference documents it), but I didn't find any suggestion in either that there was another library out there under a Tools or Scripts directory. -jJ From rowen at cesmail.net Fri Sep 9 20:31:47 2005 From: rowen at cesmail.net (Russell E. Owen) Date: Fri, 09 Sep 2005 11:31:47 -0700 Subject: [Python-Dev] PEP 3000 and new style classes References: <20050908164312.GA26993@panix.com> <20050908211307.GA506@mithrandi.za.net> Message-ID: In article <20050908211307.GA506 at mithrandi.za.net>, Tristan Seligmann wrote: > * Lisandro Dalcin [2005-09-08 13:56:07 -0300]: > > > Yes, you are right. But this way, you are making explicit a behavior > > that will be implicit in the future. > > > > For example, we could also do: > > > > two = float(4)/float(2) > > > > instead of > > > > from future import division > > two = 4/2 > > Why does it matter if the single statement you insert is spelled > " metaclass = type" instead of "from future import whatever"? > Remember, unlike the division example, you would only have to insert one > statement, as opposed to changing every use of integer division. It matters because "metaclass = type" is completely obscure. How would any non-expert have a clue what it means? -- Russell From hpk at trillke.net Fri Sep 9 22:30:37 2005 From: hpk at trillke.net (holger krekel) Date: Fri, 9 Sep 2005 22:30:37 +0200 Subject: [Python-Dev] PEP 3000 and new style classes In-Reply-To: References: <20050908164312.GA26993@panix.com> <20050908211307.GA506@mithrandi.za.net> Message-ID: <20050909203037.GA7577@solar.trillke.net> On Fri, Sep 09, 2005 at 11:31 -0700, Russell E. Owen wrote: > In article <20050908211307.GA506 at mithrandi.za.net>, > Tristan Seligmann wrote: > > > > Why does it matter if the single statement you insert is spelled > > " metaclass = type" instead of "from future import whatever"? > > Remember, unlike the division example, you would only have to insert one > > statement, as opposed to changing every use of integer division. > > It matters because "metaclass = type" is completely obscure. How would > any non-expert have a clue what it means? How would this non-expert have a clue what "from __future__ import new_style_classes" means? holger From martin.blais at gmail.com Fri Sep 9 22:35:06 2005 From: martin.blais at gmail.com (Martin Blais) Date: Fri, 9 Sep 2005 16:35:06 -0400 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <8393fff05090913353f6133dc@mail.gmail.com> On 9/8/05, Antoine Pitrou wrote: > > Hi, > > Le jeudi 08 septembre 2005 ? 19:12 +0900, Stephen J. Turnbull a ?crit : > > > It would be > > nice to be able to lose the "_()" calls to gettext(). The function > > would look to see if a message catalog was available for the current > > output stream, and if not, do no translation. > > That doesn't sound right to me. > 1. You still need to do automatic extraction of these strings (gettext > has tools for that, which rely on the use of the "_()" function - or any > other dedicated function (*)). > 2. You can't assume that all strings must be i18n'ed. For example if I'm > interfacing with the user via a text-based network protocol which has a > field named "Length", I don't want that "Length" field to be replaced > with the Japanese translation of the word "Length". > > For i18n, "explicit is better than implicit" ;) The beauty of "_()" is > that it's at the same time explicit, easily recognizable, and very short > to type and read (it doesn't clutter the source code). If I dare say, > the "%" operator has the same qualities. > > (*) of course more automatization of what gettext does could be a nice > improvement too! Here goes something: for applications targeted to the web, where newlines don't matter, the line breaks in _()'ed strings are superfluous. In order to avoid the problem of not being able to "fix" my strings when reindenting the source, and to avoid the need in general to have newlines in the po files, I added an option to pygettext that allows it to flatten the strings in a single line. This does not break the old functionality, just allows you more flexbility in the input source (you can break strings on multiple lines) and the strings in the catalogs are nicer too (no newlines clutter). I submitted a patch on 2005-01-08, nobody has had time to review/integrate it yet. If you're interested, see [ 1098749 ] Single-line option to pygettext.py http://sourceforge.net/tracker/index.php?func=detail&aid=1098749&group_id=5470&atid=305470 cheers, From mcherm at mcherm.com Fri Sep 9 23:12:20 2005 From: mcherm at mcherm.com (Michael Chermside) Date: Fri, 09 Sep 2005 14:12:20 -0700 Subject: [Python-Dev] PEP 3000 and new style classes Message-ID: <20050909141220.agwrsjrqikbowso4@login.werra.lunarpages.com> Lisandro Dalc?n proposes: > Any possibility to add something like > > from __future__ import new_style_classes Tristan Seligmann writes: > Why does it matter if the single statement you insert is spelled > " metaclass = type" instead of "from future import whatever"? Russell Owen responds: > It matters because "metaclass = type" is completely obscure. How would > any non-expert have a clue what it means? Holger asks: > How would this non-expert have a clue what > "from __future__ import new_style_classes" means? Mon-expert users can safely assume that any "from __future__ import" statements are there to future-proof a program or make use of advanced features early. Non-expert users cannot safely assume anything about assignments to __metaclass__ and, in fact, may well break into a cold sweat any time they hear the word "metaclass". I'm not saying that it's necessary, but if it were my call (and it isn't) I'd probably not bother to code "from __future__ import new_style_classes" but I'd probably accept a patch if someone wrote one. I think it would provide a REALLY nice migration path if it were possible to write Python 3.0 code in Python 2.x (for large values of x) so long as you included an appropriate preamble of "from __future__ import" statements. I don't believe we'll ever get it perfect because there would be a few minor incompatibilities no matter how hard we try, but just imagine how the Perl5 users today would feel if they were told that they could use Perl6 code in the Perl5 interpreter by using the "@ .fture. <<" command. I love making Perl users jealous, so I certainly wouldn't vote less than -0 ("I don't care so why bother") on a proposal like this one. -- Michael Chermside From rowen at cesmail.net Fri Sep 9 23:23:19 2005 From: rowen at cesmail.net (Russell E. Owen) Date: Fri, 09 Sep 2005 14:23:19 -0700 Subject: [Python-Dev] PEP 3000 and new style classes References: <20050908164312.GA26993@panix.com> <20050908211307.GA506@mithrandi.za.net> <20050909203037.GA7577@solar.trillke.net> Message-ID: In article <20050909203037.GA7577 at solar.trillke.net>, hpk at trillke.net (holger krekel) wrote: > On Fri, Sep 09, 2005 at 11:31 -0700, Russell E. Owen wrote: > > In article <20050908211307.GA506 at mithrandi.za.net>, > > Tristan Seligmann wrote: > > > > > > Why does it matter if the single statement you insert is spelled > > > " metaclass = type" instead of "from future import whatever"? > > > Remember, unlike the division example, you would only have to insert one > > > statement, as opposed to changing every use of integer division. > > > > It matters because "metaclass = type" is completely obscure. How would > > any non-expert have a clue what it means? > > How would this non-expert have a clue what > "from __future__ import new_style_classes" means? Because it's plain english. Also because it's easy to look up. For example: google for: - python "from __future__ import"; the third link is useful, though a bit technical; presumably it's is in the manual somewhere as well - python "new style classes" python; the first link is useful If and when the __future__ directive under discussion is added, I would try googling for the whole line and probably hit it on the first go. Now try that with "metaclass = type". Good luck. I tried all sorts of variants and came up with nothing except a tutorial on metaclasses, which was interesting, but NOT a ready explanation of what "metaclass = type" does. -- Russell From dalcinl at gmail.com Sat Sep 10 00:15:01 2005 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Fri, 9 Sep 2005 19:15:01 -0300 Subject: [Python-Dev] PEP 3000 and iterators Message-ID: PEP 3000 says (http://www.python.org/peps/pep-3000.html) : Core language - Return iterators instead of lists where appropriate for atomic type methods (e.g. dict.keys(), dict.values(), dict.items(), etc.) Built-in Namespace - Make built-ins return an iterator where appropriate (e.g. range(), zip(), etc.) - Relevant functions should consume iterators (e.g. min(), max()) To be removed: - xrange(): use range() instead [1] Any possibility to add one (or more) __future__ statement to implicitly get this behavior? Any suggestion about naming? -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From gustavo at niemeyer.net Sat Sep 10 01:47:09 2005 From: gustavo at niemeyer.net (Gustavo Niemeyer) Date: Fri, 9 Sep 2005 20:47:09 -0300 Subject: [Python-Dev] SIGPIPE => SIG_IGN? Message-ID: <20050909234709.GA24789@localhost.localdomain> Greetings, I was wondering, why are we setting SIGPIPE to SIG_IGN in initsigs(): static void initsigs(void) { #ifdef SIGPIPE PyOS_setsig(SIGPIPE, SIG_IGN); #endif [...] } One of the side effects is: >>> os.system("yes | read any") yes: standard output: Broken pipe yes: write error 0 >>> os.system("yes | head -1") y yes: standard output: Broken pipe yes: write error 0 That stops when setting to SIG_DFL: >>> signal.signal(signal.SIGPIPE, signal.SIG_DFL) 1 >>> os.system("yes | head -1") y 0 >>> os.system("yes | read any") 0 Out of curiosity, many of the google results for "yes: standard output: Broken pipe" are from Python programs. :-) Regards, -- Gustavo Niemeyer http://niemeyer.net From guido at python.org Sat Sep 10 02:42:02 2005 From: guido at python.org (Guido van Rossum) Date: Fri, 9 Sep 2005 17:42:02 -0700 Subject: [Python-Dev] PEP 3000 and new style classes In-Reply-To: References: <20050908164312.GA26993@panix.com> <20050908211307.GA506@mithrandi.za.net> <20050909203037.GA7577@solar.trillke.net> Message-ID: Can you all just stop discussing this? In the last 4 contributions nothing has been added that hasn't been said yet. It's not going to change. Get used to it.There are more important issues. On 9/9/05, Russell E. Owen wrote: > In article <20050909203037.GA7577 at solar.trillke.net>, > hpk at trillke.net (holger krekel) wrote: > > > On Fri, Sep 09, 2005 at 11:31 -0700, Russell E. Owen wrote: > > > In article <20050908211307.GA506 at mithrandi.za.net>, > > > Tristan Seligmann wrote: > > > > > > > > Why does it matter if the single statement you insert is spelled > > > > " metaclass = type" instead of "from future import whatever"? > > > > Remember, unlike the division example, you would only have to insert one > > > > statement, as opposed to changing every use of integer division. > > > > > > It matters because "metaclass = type" is completely obscure. How would > > > any non-expert have a clue what it means? > > > > How would this non-expert have a clue what > > "from __future__ import new_style_classes" means? > > Because it's plain english. > > Also because it's easy to look up. For example: > > google for: > - python "from __future__ import"; the third link is useful, though a > bit technical; presumably it's is in the manual somewhere as well > - python "new style classes" python; the first link is useful > > If and when the __future__ directive under discussion is added, I would > try googling for the whole line and probably hit it on the first go. > > Now try that with "metaclass = type". Good luck. I tried all sorts of > variants and came up with nothing except a tutorial on metaclasses, > which was interesting, but NOT a ready explanation of what "metaclass = > type" does. > > -- Russell > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Sat Sep 10 02:49:39 2005 From: guido at python.org (Guido van Rossum) Date: Fri, 9 Sep 2005 17:49:39 -0700 Subject: [Python-Dev] PEP 3000 and iterators In-Reply-To: References: Message-ID: On 9/9/05, Lisandro Dalcin wrote: > PEP 3000 says > (http://www.python.org/peps/pep-3000.html) : > > Core language > - Return iterators instead of lists where appropriate for atomic type > methods (e.g. dict.keys(), dict.values(), dict.items(), etc.) > > Built-in Namespace > - Make built-ins return an iterator where appropriate (e.g. range(), > zip(), etc.) > - Relevant functions should consume iterators (e.g. min(), max()) > To be removed: > - xrange(): use range() instead [1] > > Any possibility to add one (or more) __future__ statement to > implicitly get this behavior? Any suggestion about naming? For the builtins, it would actually be possible to do this by simply importing an alternate builtins module. Something like from future_builtins import min, max, zip, range For methods on standard objects like dicts it's not really possible either way; the type of a dict is determined by the module containing the code creating it, not the module containing the code using it. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Sat Sep 10 02:51:13 2005 From: guido at python.org (Guido van Rossum) Date: Fri, 9 Sep 2005 17:51:13 -0700 Subject: [Python-Dev] SIGPIPE => SIG_IGN? In-Reply-To: <20050909234709.GA24789@localhost.localdomain> References: <20050909234709.GA24789@localhost.localdomain> Message-ID: > I was wondering, why are we setting SIGPIPE to SIG_IGN > in initsigs(): Because you can get a SIGPIPE from writing to a socket whose other side has shut down, and we want to turn that into an error. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From ncoghlan at gmail.com Sat Sep 10 03:54:31 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 10 Sep 2005 11:54:31 +1000 Subject: [Python-Dev] Tools directory (Was RE: Replacement for print in Python 3.0) In-Reply-To: References: Message-ID: <43223CD7.8060507@gmail.com> Jim Jewett wrote: >>How should we document [the tools directory] > > > At the interactive prompt, help() lets me get a list > of topics (not including tools), keywords, or modules -- > but no mention of tools. > > I didn't find any references at http://python.org/doc/ > > The tutorial does mention the standard library (and > the library reference documents it), but I didn't find > any suggestion in either that there was another > library out there under a Tools or Scripts directory. Even adding something (e.g., Tools/README) to the "undocumented modules" section of the standard library would be an improvement on the status quo. I also noticed that the Windows installer does *not* install "Tools/README.txt", so there isn't even a synopsis of the tools which _are_ included with the Windows installer. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From ncoghlan at gmail.com Sat Sep 10 04:17:05 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 10 Sep 2005 12:17:05 +1000 Subject: [Python-Dev] PEP 3000 and iterators In-Reply-To: References: Message-ID: <43224221.1090506@gmail.com> Guido van Rossum wrote: > On 9/9/05, Lisandro Dalcin wrote: >>Any possibility to add one (or more) __future__ statement to >>implicitly get this behavior? Any suggestion about naming? > > > For the builtins, it would actually be possible to do this by simply > importing an alternate builtins module. Something like > > from future_builtins import min, max, zip, range > > For methods on standard objects like dicts it's not really possible > either way; the type of a dict is determined by the module containing > the code creating it, not the module containing the code using it. However, such a "future_builtins" module could still include modified versions of those standard objects - such "future-proofed" code would simply still need to deal with the fact that other libraries or clients may pass in the old-style components (e.g. just as unicode-aware code needs to deal with the fact that other libraries or clients may produce 8-bit strings rather than unicode text). Also, an alternative to changing the builtins piecemeal would be to have "__python3_builtin__" in sys.modules and do: try: import __python3_builtin__ __builtins__ = __python3_builtin__ except ImportError: # What you do here depends on whether or not __python3_builtin__ # stays around in Py3k or not. You could then write a script to extract all known changes to the Py3k builtins by looking for differences between the two modules. Another trick would be to have an "everything in Python 3" option for any syntax changes too: from __future__ import __python3__ Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From nyamatongwe at gmail.com Sat Sep 10 05:17:55 2005 From: nyamatongwe at gmail.com (Neil Hodgson) Date: Sat, 10 Sep 2005 13:17:55 +1000 Subject: [Python-Dev] international python In-Reply-To: <1126277144.14863.22.camel@p-dvsi-418-1.rd.francetelecom.fr> References: <20050908051402.m7j55nypuvwg448k@login.werra.lunarpages.com> <4320EFA3.8070607@canterbury.ac.nz> <432144D8.8020903@gmail.com> <1126258083.14863.11.camel@p-dvsi-418-1.rd.francetelecom.fr> <50862ebd05090906091457689c@mail.gmail.com> <1126277144.14863.22.camel@p-dvsi-418-1.rd.francetelecom.fr> Message-ID: <50862ebd050909201713dae184@mail.gmail.com> Antoine Pitrou: > I don't have a Windows machine at hand right now to test it, but, even > if this solution works, it breaks the principle of least astonishment: Astonishment is subjective and so a poor tool to measure by. At one stage Ruby tried to follow the more common formulation "principle of least surprise" (POLS) but this produced arguments of the following form: I am surprised by X. Therefore, X contradicts POLS. Therefore, X must be fixed. POLS was then abandoned. > os.path.abspath() should do the Right Thing regardless of what the > current locale is. This was discussed recently and the consensus position was for functions that can not return a value in the default encoding to instead return a unicode value. Correct implementation of this would require not only changing the behaviour of functions returning strings but also those receiving strings (which should treat byte strings as being in the default encoding). This would require a large amount of work, and is unlikely to be performed in the near future. Neil From t-meyer at ihug.co.nz Sat Sep 10 05:41:22 2005 From: t-meyer at ihug.co.nz (Tony Meyer) Date: Sat, 10 Sep 2005 15:41:22 +1200 Subject: [Python-Dev] [draft] python-dev Summary for 2005-08-16 through 2005-08-31 Message-ID: If anyone would like to take a break from all this Py3k discussion, please feel free to read through the following draft for the second August summary. Checking over the "O(N**2) behaviour in StreamReader.readline" summary would be particularly appreciated. As always, any corrections/suggestions should be sent to me or Steve (steven.bethard at gmail.com). Thanks! ============= Announcements ============= ------------------ PyPy release 0.7.0 ------------------ PyPy_ has a new release, 0.7.0, which is now a fully self-contained Python implementation. It includes whole-program type inference and translation to both C and LLVM, a choice of refcounting or Boehm garbage collectors, language-level compliancy with CPython 2.4.1 and much more. If you haven't already, now's the time to check it out! .. _PyPy: http://codespeak.net/pypy/ Contributing thread: - `PyPy release 0.7.0 `__ [SJB] ------------------ New mailbox module ------------------ Gregory K. Johnson, who's been working with A.M. Kuchling for Google's Summer of Code, has completed a new version of the mailbox module that allows the adding and removing of messages. It will be double-checked for code quality, complete documentation, and full backwards-compatibility and then hopefully checked in. Contributing thread: - `New mailbox module `__ [SJB] ========= Summaries ========= -------- str.find -------- Terry Reedy suggested that str.find() be removed in Python 3.0, in favour of str.index(); the main issue with str.find() is that it returns -1 on failure, leading to the common "if s.find(sub):" bug (which should be "if sub in s:"); -1 is also a valid index into a string. Guido agreed that removal would be a good idea, however Tim Peters pointed out that the requirement to use a try/except clause can lead to another kind of sloppy code. Raymond Hettinger suggested that the ideal solution would be to replace str.find() with new methods, str.partition() and str.rpartition(), which work like:: >>> s = ' http://www.python.org' >>> s.partition('://') ('http', '://', 'www.python.org') >>> s.rpartition('.') (' http://www.python', '.', 'org') >>> s.partition('?') ('' http://www.python.org', '', '') Replacing str.find() with str.partition() in the standard library generally resulted in much cleaner and clearer code, without requiring the addition of try/except blocks. Comments were overwhelmingly in favour of this new method. "part" and "cut" were suggested as alternative names to "partition", although Raymond is very attached to the "partition" name. Contributing threads: - `Remove str.find in 3.0? `__ - `partition() (was: Remove str.find in 3.0?) `__ - `Proof of the pudding: str.partition() `__ - `partition() `__ - `Alternative name for str.partition() `__ [TAM] ------------------------------------------------ PEP 348: Exception Reorganization for Python 3.0 ------------------------------------------------ This fortnight saw the final demise of `PEP 348`_. This began with `Guido's agreement`_ to remove bare "except:" from Python 3.0 entirely. Introducing a transition plan for this change in Python 3.0 proved problematic, however. To quote Michael Chermside, "no syntax will work in BOTH 2.5 and 3.0". For example, the proposed Python 3.0 code:: try: my_result = call_some_library(my_data) except Exception: # doesn't catch KeyboardInterrupt or SystemExit report_nonterminal_error() would need to be written in Python 2.5 as:: try: my_result = call_some_library(my_data) except (KeyboardInterrupt, SystemExit): raise except: report_nonterminal_error() Note that the final ``except:`` in the 2.5 code can't be written as ``except Exception:`` - Python 2.5 will still allow exceptions that do not derive from Exception (e.g. string exceptions). Thus deprecating bare ``except:`` would mean that some code would produce warnings, and yet not have any way to be rewritten that would be upwards-compatible. As a result, Guido rejected the entire PEP. .. _PEP 348: http://www.python.org/peps/pep-0348.html .. _Guido's agreement: http://mail.python.org/pipermail/python-dev/2005-August/055620.html Contributing threads: - `Bare except clauses in PEP 348 `__ - `FW: Bare except clauses in PEP 348 `__ - `rev. 1.9 of PEP 348: Raymond tested, Guido approved `__ [SJB] ----------------------------------------------- PEP 347: Migrating the Python CVS to Subversion ----------------------------------------------- Discussion about the conversion to subversion and subsequent move of the Python source repository to svn.python.org (outlined in `PEP 347`_) continued this fortnight. Discussion particularly covered the means of authentication that would be used to access svn.python.org, how names would appear in revision logs, and other minor details like that. Martin has set up a test installation (for current Python developers; there is no anonymous access) on svn.python.org, to check that the system will work as described in the PEP. Assuming that things go well with this test installation, it seems likely that the PEP will be accepted and the migration will take place at some point in the future. .. _PEP 347: http://www.python.org/peps/pep-0347.html Contributing threads: - `PEP 347: Migration to Subversion `__ - `Admin access using svn+ssh `__ - `Collecting SSH keys `__ - `On distributed vs centralised SCM for Python `__ - `Fwd: Distributed RCS `__ - `wush.net details `__ - `Subversion instructions `__ [TAM] -------------------------- Partial method application -------------------------- Ian Bicking suggested a partialmethod() function along the lines of the operator module's itemgetter() and attrgetter(). The partialmethod() function would allow the "self" argument of a method to be supplied later, e.g.:: lst = ['A', 'b', 'C'] lst.sort(key=partialmethod('lower')) Martin v. L?wis argued (convincing Guido at least) that a better style for delayed method calls would look something like:: lst.sort(key=virtual.lower) where the "virtual" object would serve as a virtual instance so that the "self" argument to the "lower" method could be supplied later. There was a brief discussion about consistency between this proposal and the operator module's itemgetter() and attrgetter() which, unlike Martin's proposal, use argument strings instead of attributes to determine the appropriate function to produce. Additionally, in Python 2.5, both itemgetter() and attrgetter() will allow multiple arguments, while none of the method-calling solutions above extended reasonably to multiple methods. However, people seemed in general agreement that the use case was a single method call, and that supporting multiple method calls was unnecessary. The thread concluded without coming to a full resolution. For the moment at least, it seems that defining a regular Python function for the key= argument is still considered the best style. Contributing thread: - `PEP 309: Partial method application `__ [SJB] ----------------------------- Moving id() to the sys module ----------------------------- Christian Robottom Reis suggested that the built-in function id() should be moved into the sys module, as "id" is a useful and common name, to avoid shadowing built-ins. This is also list as one of `Guido's regrets`_. He asked whether adding sys.id() would be possible in 2.5, and adding a deprecation warning to __builtins__.id() (to be removed in Python 3.0). This gathered quite a lot of support, and few comments against the proposal. However, Anthony Baxter warned that using the warnings module is expensive, and so issuing a deprecation warning might not be a good idea. Interestingly, Guido's opinion (not universally shared) is that shadowing at least some built-in names is perfectly acceptable. It wasn't clear whether this meant that he would be against the move, or that his reasons for the move were different (e.g. simply a more appropriate place, reducing the number of built-ins). .. _Guido's regrets: http://www.python.org/doc/essays/ppt/regrets/PythonRegrets.ppt Contributing thread: - `Deprecating builtin id (and moving it to sys()) `__ [TAM] ----------------------------------------- O(N**2) behavior in StreamReader.readline ----------------------------------------- Keir Mierle reported a problem where _PyUnicodeUCS2_IsLinebreak was called excessively, resulting in a huge slowdown of a CGI script. The code that caused the slowdown was adding the encoding line "# -*- coding: iso8859-1 -*-"; this is caused by changes to codecs in 2.4, which no longer rely on C's readline() to do line splitting, but use unicode.splitlines() instead, and also that StreamReader.readline performs splitline on the entire input, only to fetch the first line, and also uses splitlines on a single line to remove any trailing line breaks. As a result, for a file with N lines, IsLinebreak is invoked up to N*N/2 times per character. Walter D?rwald and Martin v. L?wis worked on solutions to this problem. Martin's `eventual solution`_ keeps the splitlines result and only invokes IsLinebreak once per character, and copies each character only one. This should be much faster than the current code. .. _eventual solution: http://www.python.org/sf/1268314 Contributing threads: - `51 Million calls to _PyUnicodeUCS2_IsLinebreak() (???) `__ - `[Argon] Re: 51 Million calls to _PyUnicodeUCS2_IsLinebreak() (???) `__ [TAM] ---------------------------------------------- PEP 349: Allow str() to return unicode strings ---------------------------------------------- Neil Schemenauer updated `PEP 349`_, which had previously proposed a text() builtin, to instead propose that str() be allowed to return unicode strings. The new str() would remove two types of UnicodeEncodeErrors that str() had previously raised: * No UnicodeEncodeError would be raised if the argument to str() is unicode. Instead, the unicode object would be returned unmodified. * No UnicodeEncodeError would be raised if the argument's __str__() method returns a unicode object. Instead, this returned object would be in turn returned from the str() call. In the following brief discussion, it was suggested that unicode.__str__ should be changed to return "self" instead of trying to encode itself into ascii. (Otherwise subclasses of unicode would likely get UnicodeEncodeErrors when their __str__() methods were called.) There was a little feedback on the proposal, with a few people wanting to go back to the text() builtin instead of changing str(), but no final decisions were made. .. _PEP 349: http://mail.python.org/pipermail/python-dev/2005-August/055557.html Contributing thread: - `Revised PEP 349: Allow str() to return unicode strings `__ [SJB] --------------------------------- One argument form of setdefault() --------------------------------- Tim Peters asked if anyone remembered why setdefault's second argument is optional, given that it doesn't seem at all useful, and that he wasn't able to find any use cases outside of the test suite. The likely explanation seemed that it was a result of setdefault() following the behaviour of dict.get(). Tim suggested dropping the optional nature of the second argument for Python 3.0 - Raymond upped this by suggesting that it could be done earlier (e.g. with a deprecation warning in 2.5 and gone in 2.6). Tim did later find a use (in Zope), but the author of the code, David Goodger, indicated that it would probably be better written in other ways, and that if dict.pop() could be used (it was introduced in Python 2.3) then that would be preferable, so it still seems likely that the second argument will be made mandatory. Contributing thread: - `setdefault's second argument `__ [TAM] --------------------------- dir() returning non-strings --------------------------- As a result of a suggested patch by Michael Krasnyk, the question of whether dir() should only return strings was raised. Guido's position was that dir() should hide non-strings, as these are not attributes if you use the definition that an attribute name is a valid parameter to a getattr() or setattr() call. Guido suggested that a useful relationship (excluding where __getattr__ or __getattribute__ is overridden) is:: name in dir(x) <==> getattr(x, name) is valid Contributing threads: - `SWIG and rlcompleter `__ [TAM] ----------------- file.readblocks() ----------------- Raymond Hettinger wanted to move away from the current empty-string API that file objects use for indicating that the end of the file has been reached. To cover at least some of the use-cases, he suggested a readblocks() method, so that code like:: while 1: block = f.read(20) if line == '': break ... could be instead written as:: for block in f.readblocks(20): ... Guido couldn't see a use case for this though, and suggested that there were other issues with files/streams that were more important (e.g. buffering transparency and character set encodings) some of which he'd been working on in the sandbox_. .. _sandbox: http://cvs.sourceforge.net/viewcvs.py/python/python/nondist/sandbox/sio/ Contributing thread: - `empty string api for files `__ [SJB] ---------------------------- Python 3.0 design principles ---------------------------- Raymond Hettinger is planning to put together a draft list of Python design principles. For example, "don't let the *type* of the return value depend on the *value* of the arguments". These will complement the Zen of Python, and provide a document to refer people to when proposing new/changed features. Although in design principle discussion, there has been discussion about the Python 2.x->Python 3.0 transition, and whether it will be possible to write code that runs in both Python 2.x and 3.0. [TAM] Contributing threads: - `Design Principles `__ - `Python 3 design principles `__ =============== Skipped Threads =============== - `Extension to dl module to allow passing strings from native function `__ - `implementation of copy standard lib `__ - `dev listinfo page (was: Re: Python + Ping) `__ - `remote debugging with pdb `__ - `A testing challenge `__ - `On decorators implementation `__ - `[Python-checkins] python/dist/src setup.py, 1.219, 1.220 `__ - `Weekly Python Patch/Bug Summary `__ - `[Python-checkins] python/dist/src/Modules _hashopenssl.c, NONE, 2.1 sha256module.c, NONE, 2.1 sha512module.c, NONE, 2.1 md5module.c, 2.35, 2.36 shamodule.c, 2.22, 2.23 `__ - `PEP 342 Implementation `__ - `Modules _hashopenssl, sha256, sha512 compile in MinGW, test_hmac.py passes `__ - `python/dist/src/Doc/tut tut.tex,1.276,1.277 `__ - `Docs/Pointer to Tools/scripts? `__ - `python-dev Summary for 2005-08-01 through 2005-08-15 [draft] `__ - `Style for raising exceptions (python-dev Summary for 2005-08-01 through 2005-08-15 [draft]) `__ - `PEP 342: simple example, closure alternative `__ - `operator.c for release24-maint and test_bz2 on Python 2.4.1 `__ - `test_bz2 on Python 2.4.1 `__ - `[Python-checkins] python/dist/src/Lib/test test_bz2.py, 1.18, 1.19 `__ - `test_bz2 fails on Python 2.4.1 from CVS, passes on same from source archieve `__ - `Python 3.0 blocks? `__ - `Any detail list of change between version 2.1-2.2-2.3-2.4 of Python? `__ - `info/advices about python readline implementation `__ - `test_bz2 and Python 2.4.1 `__ - `[Python-checkins] python/dist/src/Doc/whatsnew whatsnew25.tex, 1.18, 1.19 `__ - `Revising RE docs (was: partition() (was: Remove str.find in 3.0?)) `__ - `Switching re and sre `__ From __peter__ at web.de Sat Sep 10 09:22:59 2005 From: __peter__ at web.de (Peter Otten) Date: Sat, 10 Sep 2005 09:22:59 +0200 Subject: [Python-Dev] [Python-checkins] python/dist/src/Lib urllib.py, 1.169, 1.170 In-Reply-To: <20050910022744.A46E21E4004@bag.python.org> References: <20050910022744.A46E21E4004@bag.python.org> Message-ID: <200509100923.00060.__peter__@web.de> Am Samstag, 10. September 2005 04:27 schrieb rhettinger at users.sourceforge.net: > Update of /cvsroot/python/python/dist/src/Lib > In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv3622 > > Modified Files: > urllib.py > Log Message: > Simplify and speed-up quote_plus(). > > Index: urllib.py > =================================================================== > RCS file: /cvsroot/python/python/dist/src/Lib/urllib.py,v > retrieving revision 1.169 > retrieving revision 1.170 > diff -u -d -r1.169 -r1.170 > --- urllib.py 9 Sep 2005 22:27:13 -0000 1.169 > +++ urllib.py 10 Sep 2005 02:27:41 -0000 1.170 > @@ -1115,12 +1115,9 @@ > def quote_plus(s, safe = ''): > """Quote the query fragment of a URL; replacing ' ' with '+'""" > if ' ' in s: > - l = s.split(' ') > - for i in range(len(l)): > - l[i] = quote(l[i], safe) > - return '+'.join(l) > - else: > - return quote(s, safe) > + s = s.replace(' ', '+') > + safe += '+' > + return quote(s, safe) > > def urlencode(query,doseq=0): > """Encode a sequence of two-element tuples or dictionary into a URL > query string. You also change the behaviour. Before: >>> urllib.quote_plus("alpha+beta gamma") 'alpha%2Bbeta+gamma' After: >>> urllib.quote_plus("alpha+beta gamma") 'alpha+beta+gamma' Is that intentional? If so, you also have to update the documentation, which currently reads: quote_plus(string[, safe]) ... Plus signs in the original string are escaped unless they are included in safe. ... Peter From widjason8 at bellsouth.net Fri Sep 9 10:27:42 2005 From: widjason8 at bellsouth.net (Jason) Date: Fri, 09 Sep 2005 08:27:42 +0000 Subject: [Python-Dev] Wanting to learn Message-ID: <1126254462.3888.5.camel@localhost.localdomain> Hi My name is Jason & i have a great interest in progamming whether it be python or what have you. From my understanding Python is written in C right ? I am willing to do grunt work just to learn .I a quick to catch on given the right path to follow.Please let me know if you will let me learn help in my endeavor to learn to program. I am eager to hear back .Thanks for your time.Or if not Maybe you could point me in the right direction Jason From dalcinl at gmail.com Fri Sep 9 23:17:54 2005 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Fri, 9 Sep 2005 18:17:54 -0300 Subject: [Python-Dev] PEP 3000 and new style classes In-Reply-To: <20050909203037.GA7577@solar.trillke.net> References: <20050908164312.GA26993@panix.com> <20050908211307.GA506@mithrandi.za.net> <20050909203037.GA7577@solar.trillke.net> Message-ID: On 9/9/05, holger krekel wrote: > > > > It matters because "metaclass = type" is completely obscure. How would > > any non-expert have a clue what it means? > > How would this non-expert have a clue what > "from __future__ import new_style_classes" means? > That is the point!!! If I am a developer, I think is better to have a __future__ statement for things that are planned to change in the future. If you are a non-expert, you will google for new style classes, and I think is far easier to understand what a new style class is than metaclasses. How many non-expert knows about nested scopes subjects introduced in Py2.1? it has a __future__ statement. Additionaly, a __future__ statement can be easily removed when a new Python release cames. We could simply use the script Tools/scripts/cleanfuture.py to eliminate those __future__ imports when Py3.0 is available. The same applies for generators in Py2.4 ... -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From dalcinl at gmail.com Fri Sep 9 23:38:18 2005 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Fri, 9 Sep 2005 18:38:18 -0300 Subject: [Python-Dev] PEP 3000 and new style classes In-Reply-To: <20050909141220.agwrsjrqikbowso4@login.werra.lunarpages.com> References: <20050909141220.agwrsjrqikbowso4@login.werra.lunarpages.com> Message-ID: On 9/9/05, Michael Chermside wrote: > I think it would > provide a REALLY nice migration path if it were possible to write > Python 3.0 code in Python 2.x (for large values of x) so long as you > included an appropriate preamble of "from __future__ import" statements. Perhaps I was not clear, but that was the reason of my proposal. I agree it is not necesary, but this will help developers to transition from Py2.X to Py3.0 in a consistent way. I will vote +1 for any attemps to populate __future__ module if that enables writing working Py3K code as soon as possible in Py2.X .. > I love making Perl users jealous, So I love! ;) -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From pjones at redhat.com Sat Sep 10 21:29:31 2005 From: pjones at redhat.com (Peter Jones) Date: Sat, 10 Sep 2005 15:29:31 -0400 Subject: [Python-Dev] unintentional and unsafe use of realpath() Message-ID: <1126380571.21655.18.camel@localhost.localdomain> Hi, In Python 2.4.1, Python/sysmodule.c includes a function PySys_SetArgv(). One of the things it does is attempt to resolve symbolic links into absolute paths. Currently, it uses readlink() if configure found that your system supports it, and then it tries to do the same thing again using realpath() if you system supports that. This seems wrong; there's really no reason to do both. So here's a patch to move the realpath() usage into a #else following the HAVE_READLINK test: --- Python-2.4.1/Python/sysmodule.c.readlink 2005-09-10 14:05:26.000000000 -0400 +++ Python-2.4.1/Python/sysmodule.c 2005-09-10 14:06:00.000000000 -0400 @@ -1211,7 +1211,7 @@ } } } -#endif /* HAVE_READLINK */ +#else /* HAVE_READLINK */ #if SEP == '\\' /* Special case for MS filename syntax */ if (argc > 0 && argv0 != NULL) { char *q; @@ -1244,6 +1244,7 @@ #endif p = strrchr(argv0, SEP); } +#endif /* HAVE_READLINK */ if (p != NULL) { #ifndef RISCOS n = p + 1 - argv0; Another problem (which I have not fixed) is that when realpath() is used, in some cases MAXPATHLEN is smaller than the system's PATH_MAX/pathconf(path, _PC_PATH_MAX). When that happens, the realpath() usage is a potential buffer overflow, which can be selectively aggravated using a carefully constructed symbolic link. This is the case currently on Fedora Core (Linux) at least, and possibly other OSes. Recent versions of gcc include a feature to check for this kind of bug at runtime, and if you build python with gcc+glibc and the gcc command line arguments "-Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector" it will give you a failure that looks something like: case $MAKEFLAGS in \ *-s*) LD_LIBRARY_PATH=/home/pjones/build/BUILD/Python-2.4.1: CC='gcc -pthread' LDSHARED='gcc -pthread -shared' OPT='-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=4 -m32 -march=i386 -mtune=pentium4 -fasynchronous-unwind-tables -D_GNU_SOURCE -fPIC -I/usr/kerberos/include ' ./python -E ./setup.py -q build;; \ *) LD_LIBRARY_PATH=/home/pjones/build/BUILD/Python-2.4.1: CC='gcc -pthread' LDSHARED='gcc -pthread -shared' OPT='-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=4 -m32 -march=i386 -mtune=pentium4 -fasynchronous-unwind-tables -D_GNU_SOURCE -fPIC -I/usr/kerberos/include ' ./python -E ./setup.py build;; \ esac *** buffer overflow detected ***: ./python terminated ======= Backtrace: ========= /lib/libc.so.6(__chk_fail+0x41)[0x590495] /lib/libc.so.6(__ptsname_r_chk+0x0)[0x590ac0] /home/pjones/build/BUILD/Python-2.4.1/libpython2.4.so.1.0(PySys_SetArgv +0x1a1)[0x228cf5] /home/pjones/build/BUILD/Python-2.4.1/libpython2.4.so.1.0(Py_Main +0x671)[0x22bedd] ./python(main+0x2a)[0x804859a] /lib/libc.so.6(__libc_start_main+0xdf)[0x4c74ff] ./python[0x80484ed] ======= Memory map: ======== 00185000-0026e000 r-xp 00000000 03:03 278531 /home/pjones/build/BUILD/Python-2.4.1/libpython2.4.so.1.0 0026e000-00295000 rwxp 000e8000 03:03 278531 /home/pjones/build/BUILD/Python-2.4.1/libpython2.4.so.1.0 00295000-00298000 rwxp 00295000 00:00 0 00495000-004ae000 r-xp 00000000 03:03 1966150 /lib/ld-2.3.90.so 004ae000-004af000 r-xp 00018000 03:03 1966150 /lib/ld-2.3.90.so 004af000-004b0000 rwxp 00019000 03:03 1966150 /lib/ld-2.3.90.so 004b2000-005d7000 r-xp 00000000 03:03 1966261 /lib/libc-2.3.90.so 005d7000-005d9000 r-xp 00124000 03:03 1966261 /lib/libc-2.3.90.so 005d9000-005db000 rwxp 00126000 03:03 1966261 /lib/libc-2.3.90.so 005db000-005dd000 rwxp 005db000 00:00 0 005df000-00602000 r-xp 00000000 03:03 1966272 /lib/libm-2.3.90.so 00602000-00603000 r-xp 00022000 03:03 1966272 /lib/libm-2.3.90.so 00603000-00604000 rwxp 00023000 03:03 1966272 /lib/libm-2.3.90.so 00606000-00608000 r-xp 00000000 03:03 1966270 /lib/libdl-2.3.90.so 00608000-00609000 r-xp 00001000 03:03 1966270 /lib/libdl-2.3.90.so 00609000-0060a000 rwxp 00002000 03:03 1966270 /lib/libdl-2.3.90.so 006f7000-00705000 r-xp 00000000 03:03 1966283 /lib/libpthread-2.3.90.so 00705000-00706000 r-xp 0000d000 03:03 1966283 /lib/libpthread-2.3.90.so 00706000-00707000 rwxp 0000e000 03:03 1966283 /lib/libpthread-2.3.90.so 00707000-00709000 rwxp 00707000 00:00 0 0073f000-00743000 r-xp 00000000 03:03 1442230 /home/pjones/build/BUILD/Python-2.4.1/Modules/stropmodule.so 00743000-00745000 rwxp 00004000 03:03 1442230 /home/pjones/build/BUILD/Python-2.4.1/Modules/stropmodule.so 00a22000-00a2b000 r-xp 00000000 03:03 1966127 /lib/libgcc_s-4.0.1-20050906.so.1 00a2b000-00a2c000 rwxp 00009000 03:03 1966127 /lib/libgcc_s-4.0.1-20050906.so.1 00df7000-00df9000 r-xp 00000000 03:03 1968751 /lib/libutil-2.3.90.so 00df9000-00dfa000 r-xp 00001000 03:03 1968751 /lib/libutil-2.3.90.so 00dfa000-00dfb000 rwxp 00002000 03:03 1968751 /lib/libutil-2.3.90.so 00e33000-00e34000 r-xp 00e33000 00:00 0 [vdso] 08048000-08049000 r-xp 00000000 03:03 278551 /home/pjones/build/BUILD/Python-2.4.1/python 08049000-0804a000 rw-p 00000000 03:03 278551 /home/pjones/build/BUILD/Python-2.4.1/python 08086000-0810b000 rw-p 08086000 00:00 0 [heap] b7e69000-b7f2c000 rw-p b7e69000 00:00 0 b7f2d000-b7fb1000 rw-p b7f2d000 00:00 0 b7fc4000-b7fc5000 rw-p b7fc4000 00:00 0 bffaf000-bffc5000 rw-p bffaf000 00:00 0 [stack] /bin/sh: line 1: 9510 Aborted (core dumped) LD_LIBRARY_PATH=/home/pjones/build/BUILD/Python-2.4.1: CC='gcc -pthread' LDSHARED='gcc -pthread -shared' OPT='-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=4 -m32 -march=i386 -mtune=pentium4 -fasynchronous-unwind-tables -D_GNU_SOURCE -fPIC -I/usr/kerberos/include ' ./python -E ./setup.py -q build make: *** [sharedmods] Error 134 -- Peter From dalcinl at gmail.com Sun Sep 11 00:07:35 2005 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Sat, 10 Sep 2005 19:07:35 -0300 Subject: [Python-Dev] PEP 3000 and iterators In-Reply-To: References: Message-ID: On 9/9/05, Guido van Rossum wrote: > > For the builtins, it would actually be possible to do this by simply > importing an alternate builtins module. Something like > > from future_builtins import min, max, zip, range > Yes. A straightforward solution... > For methods on standard objects like dicts it's not really possible > either way; the type of a dict is determined by the module containing > the code creating it, not the module containing the code using it. > I had that in mind when I wrote my post; changing types is not the way, that will not work. That is why I proposed __future__ (I really do not know very well the implementation details of that feature) because I think the parser/compiler can (magically) make the replacements, e.g. dict.items -> dict.iteritems for Py2.X series in codes *using* dicts . Do you think something like this could be implemented in a safer way? -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From dalcinl at gmail.com Sun Sep 11 00:13:39 2005 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Sat, 10 Sep 2005 19:13:39 -0300 Subject: [Python-Dev] Wanting to learn In-Reply-To: <1126254462.3888.5.camel@localhost.localdomain> References: <1126254462.3888.5.camel@localhost.localdomain> Message-ID: Jason, this mailing list is related to Python development. If you are a new at Python, a far better place for help is comp.lang.python group. Please go to Google Grups and take a look. If you do a search in those archives, you will find many good links. -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From mark.dufour at gmail.com Sun Sep 11 00:36:41 2005 From: mark.dufour at gmail.com (Mark Dufour) Date: Sun, 11 Sep 2005 00:36:41 +0200 Subject: [Python-Dev] First release of Shed Skin, a Python-to-C++ compiler. Message-ID: <8180ef6905091015361d3ffdf7@mail.gmail.com> After nine months of hard work, I am proud to introduce my baby to the world: an experimental Python-to-C++ compiler. It can convert many Python programs into optimized C++ code, without any user intervention such as adding type declarations. It uses rather advanced static type inference techniques to deduce type information by itself. In addition, it determines whether deduced types may be parameterized, and if so, it generates corresponding C++ generics. Based on deduced type information, it also attempts to convert heap allocation into stack and static preallocation (falling back to libgc in case this fails.) The compiler was motivated by the belief that in many cases it should be possible to automatically deduce C++ versions of Python programs, enabling users to enjoy both the productivity of Python and the efficiency of C++. It works best for Python programs written in a relatively static C++-style, in essence enabling users to specify C++ programs at a higher level. At the moment the compiler correctly handles 124 unit tests, six of which are serious programs of between 100 and 200 lines: -an othello player -two satisfiability solvers -a japanese puzzle solver -a sudoku solver -a neural network simulator Unfortunately I am just a single person, and much work remains to be done. At the moment, there are several limitations to the type of Python programs that the compiler accepts. Even so, there is enough of Python left to be able to remain highly productive in many cases. However, for most larger programs, there are probably some minor problems that need to be fixed first, and some external dependencies to be implemented/bridged in C++. With this initial release, I hope to attract other people to help me locate remaining problems, help implement external dependencies, and in the end hopefully even to contribute to the compiler itself. I would be very happy to receive small programs that the compiler does or should be able to handle. If you are a C++ template wizard, and you would be interested in working on the C++ implementation of builtin types, I would also love to get in contact with you. Actually, I'd like to talk to anyone even slightly interested in the compiler, as this would be highly motivating to me. The source code is available at the following site. Please check the README for simple installation/usage instructions. Let me know if you would like to create ebuild/debian packages. Sourceforge site: http://shedskin.sourceforge.net Shed Skin blog: http://shed-skin.blogspot.com Should you reply to this mail, please also reply to me directly. Thanks! Credits Parts of the compiler have been sponsored by Google, via its Summer of Code program. I am very grateful to them for keeping me motivated during a difficult period. I am also grateful to the Python Software Foundation for chosing my project for the Summer of Code. Finally, I would like to thank my university advisor Koen Langendoen for guiding this project. Details The following describes in a bit more detail various aspects of the compiler. Before seriously using the compiler, please make sure to understand especially its limitations. Main Features -very precise, efficient static type inference (iterative object contour splitting, where each iteration performs the cartesian product algorithm) -stack and static pre-allocation (libgc is used as a fall-back) -support for list comprehensions, tuple assignments, anonymous funcs -generation of arbitrarily complex class and function templates (even member templates, or generic, nested list comprehensions) -binary tuples are internally analyzed -some understanding of inheritance (e.g. list(dict/list) becomes list >) -hierarchical project support: generation of corresponding C++ hierarchy, including (nested) Makefiles; C++ namespaces -annotation of source code with deduced types -builtin classes, functions (enumerate, sum, min, max, range, zip..) -polymorphic inline caches or virtual vars/calls (not well tested) -always unbox scalars (compiler bails out with error if scalars are mixed with pointer types) -full source code available under the MIT license Main Limitations/TODO's -Windows support (I don't have Windows, sorry) -reflection (getattr, hasattr), dynamic inheritance, eval, .. -mixing scalars with pointer types (e.g. int and None in a single variable) -mixing unrelated types in single container instance variable other than tuple-2 -holding different types of objects in tuples with length >2; builtin 'zip' can only take 2 arguments. -exceptions, generators, nested functions, operator overloading -recursive types (e.g. a = []; a.append(a)) -expect some problems when mixing floats and ints together -varargs (*x) are not very well supported; keyword args are not supported yet -arbitrary-size arithmetic -possible non-termination ('recursive customization', have not encountered it yet) -profiling will be required for scaling to very large programs -combining binary-type tuples with single-type tuples (e.g. (1,1.0)+(2,)) -unboxing of small tuples (should form a nice speedup) -foreign code has to be modeled and implemented/bridged in C++ -some builtins are not implemented yet, e.g. 'reduce' and 'map' From noamraph at gmail.com Sun Sep 11 01:54:08 2005 From: noamraph at gmail.com (Noam Raphael) Date: Sun, 11 Sep 2005 02:54:08 +0300 Subject: [Python-Dev] IDLE development Message-ID: Hello, More than a year and a half ago, I posted a big patch to IDLE which adds support for completion and much better calltips, along with some other improvements. Since then, I had some mail conversations with Kurt B. Kaiser, who is responsible for IDLE, which resulted in nothing. My last mail, from Jul 10, saying (with more details) "I made the minor changes you asked for, let's get it in, it's not very complicated" was unanswered. This is just an example of the fact that IDLE development was virtually nonexistent in the last months, because most patches were simply ignored. I and my colleges use IDLE intensively - that is, a heavily patched IDLE. It includes my patch and many other improvements made by me and my friends. The improved IDLE is MUCH better than the standard IDLE, especially for interactive work. Since we would like to share our work with the rest of the world, if nothing is changed we would start a new IDLE fork soon, perhaps at python-hosting.com. I really don't like that - maintaining a fork requires a lot of extra work, and it is certain that many more people will enjoy our work if it integrated in the standard Python distribution. But sending patches and watching them stay open despite a continuous nagging is worse. Please, either convince KBK to invest more time in IDLE development, or find someone else who would take care of it. If you like, I would happily help in the development. I hope I am not sounding offensive. It's actually quite simple: if the excellent development environment IDLE can't develop inside standard Python, it should be developed outside it. As I said, I prefer the first option. Have a good week, Noam Raphael From guido at python.org Sun Sep 11 04:11:05 2005 From: guido at python.org (Guido van Rossum) Date: Sat, 10 Sep 2005 19:11:05 -0700 Subject: [Python-Dev] IDLE development In-Reply-To: References: Message-ID: On 9/10/05, Noam Raphael wrote: > I and my colleges use IDLE intensively - that is, a heavily patched > IDLE. It includes my patch and many other improvements made by me and > my friends. > > The improved IDLE is MUCH better than the standard IDLE, especially > for interactive work. Could it be that this is a rather subjective judgement? It wouldn't be the first time that someone pushing for their personal set of functionality changes is overlooking the needs of other user groups. > Since we would like to share our work with the > rest of the world, if nothing is changed we would start a new IDLE > fork soon, perhaps at python-hosting.com. I have no problem with this. You might be able to save yourself some maintenance work by structuring your version as a set of subclasses rather than a set of patches (even if you distribute it as a complete working program). Many people have needs that aren't met by standard Python; they write their own modules or extensions and distribute these independently from Python; your case probably isn't all that different. Often the needs of certain user groups and the development speeds of such 3rd party modules are so different that it simply doesn't make sense to fold them in the Python distribution anyway -- consider what you would have to do if Kurt accepted your patches: you'll still have to wait until Python 2.5 is released before others can benefit from your changes, and if you come up with an improvement after that release, your next chance will be 18 months later... -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Sun Sep 11 04:13:33 2005 From: guido at python.org (Guido van Rossum) Date: Sat, 10 Sep 2005 19:13:33 -0700 Subject: [Python-Dev] PEP 3000 and iterators In-Reply-To: References: Message-ID: On 9/10/05, Lisandro Dalcin wrote: > On 9/9/05, Guido van Rossum wrote: > > For methods on standard objects like dicts it's not really possible > > either way; the type of a dict is determined by the module containing > > the code creating it, not the module containing the code using it. > > I had that in mind when I wrote my post; changing types is not the > way, that will not work. That is why I proposed __future__ (I really > do not know very well the implementation details of that feature) > because I think the parser/compiler can (magically) make the > replacements, e.g. dict.items -> dict.iteritems for Py2.X series in > codes *using* dicts . Do you think something like this could be > implemented in a safer way? Please trust me. It can't be made to work. The compiler doesn't know the types of the variables so it doesn't know whether in a particular occurrence of the expression 'x.items", x is a dict or not. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From ncoghlan at gmail.com Sun Sep 11 05:10:56 2005 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 11 Sep 2005 13:10:56 +1000 Subject: [Python-Dev] IDLE development In-Reply-To: References: Message-ID: <4323A040.90504@gmail.com> Guido van Rossum wrote: > Often the needs of certain user groups and the development speeds of > such 3rd party modules are so different that it simply doesn't make > sense to fold them in the Python distribution anyway -- consider what > you would have to do if Kurt accepted your patches: you'll still have > to wait until Python 2.5 is released before others can benefit from > your changes, and if you come up with an improvement after that > release, your next chance will be 18 months later... Isn't separate distribution the way the *current* version of Idle was developed? I seem to recall it existing as IDLEFork for a long time so that it could have a more rapid release cycle before being rolled into the main distribution. This approach also allows a wider audience to asess the subjective benefits of any changes made - many more people will download and try out a separate IDE than will download and try out a patch to the main distribution. I'm such a one, even though I believe my main problems with Idle lie in the Tcl/tk toolkit (so I don't expect any application level changes to alter my opinion much). Regards, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://boredomandlaziness.blogspot.com From victor.stinner-linux at haypocalc.com Sun Sep 11 05:16:23 2005 From: victor.stinner-linux at haypocalc.com (Victor STINNER) Date: Sun, 11 Sep 2005 05:16:23 +0200 Subject: [Python-Dev] Python code.interact() and UTF-8 locale Message-ID: <1126408583.12608.34.camel@haypopc> Hi, I found a bug in Python interactive command line (program python alone: looks to be code.interact() function in code.py). With UTF-8 locale, the command << u"?" >> returns << u'\xc3\xa9' >> and not << u'\xE9' >>. Remember: the french e with acute is Unicode 233 (0xE9), encoded \xC3 \xA9 in UTF-8. Another example of the bug: #-*- coding: UTF-8 -*- code = "u\"%s\"" % "\xc3\xa9" compiled = compile(code,' ',"single") exec compiled Result : u'\xc3\xa9' Excepted result : u'\xe9' After long hours of debuging (read Python documentation, debug Python with gdb, read Python C source code, ...) I found the origin of the bug: function parsestr() in Python/compile.c. This function translate a string to a unicode string (or a classic string). The problem is when the encoding declaration doesn't exist: the string isn't converted. Solution to the first code: #-*- coding: ascii -*- code = """#-*- coding: UTF-8 -*- u\"%s\"""" % "\xc3\xa9" compiled = compile(code,' ',"single") exec compiled Proposition: u"..." and unicode("...") should use sys.stdin.encoding by default. They will work as unicode("...", sys.stdin.encoding). Or easier, the compiler should use sys.stdin.encoding and not ascii as default encoding. Sorry if someone already reported this bug. And, is it a bug or a feature ? ;-) Bye, Haypo (who just have subscribed to the mailing list) From foom at fuhm.net Sun Sep 11 05:57:58 2005 From: foom at fuhm.net (James Y Knight) Date: Sat, 10 Sep 2005 23:57:58 -0400 Subject: [Python-Dev] PEP 3000 and iterators In-Reply-To: References: Message-ID: <3F65B538-4D54-4DC9-9136-9F6BAB941EC8@fuhm.net> On Sep 10, 2005, at 6:07 PM, Lisandro Dalcin wrote: > I had that in mind when I wrote my post; changing types is not the > way, that will not work. That is why I proposed __future__ (I really > do not know very well the implementation details of that feature) > because I think the parser/compiler can (magically) make the > replacements, e.g. dict.items -> dict.iteritems for Py2.X series in > codes *using* dicts . Do you think something like this could be > implemented in a safer way? > No, that cannot work. However, there is a very obvious and trivial solution. Do not remove dict.iteritems in Py 3.0. Py2.X programs wishing forward compat can avoid dict.items and use instead dict.iteritems. In Py3.0, dict.items becomes a synonym for dict.iteritems and programs that don't care about compat with 2.X can just use dict.items from then on. And everybody can be happy. A small number of redundant methods is a small price to pay for compatibility. James From noamraph at gmail.com Sun Sep 11 07:06:36 2005 From: noamraph at gmail.com (Noam Raphael) Date: Sun, 11 Sep 2005 08:06:36 +0300 Subject: [Python-Dev] IDLE development In-Reply-To: References: Message-ID: On 9/11/05, Guido van Rossum wrote: > On 9/10/05, Noam Raphael wrote: > > I and my colleges use IDLE intensively - that is, a heavily patched > > IDLE. It includes my patch and many other improvements made by me and > > my friends. > > > > The improved IDLE is MUCH better than the standard IDLE, especially > > for interactive work. > > Could it be that this is a rather subjective judgement? It wouldn't be > the first time that someone pushing for their personal set of > functionality changes is overlooking the needs of other user groups. > I don't think so, since: 1. These are added features, not functionality changes. 2. There are quite a lot of people using the improved IDLE where I work, and I never heard anyone saying he prefers the standard IDLE - on the contrary, many are asking how they can use the improved IDLE in their homes. 3. Kurt agreed to integrate the change - he just didn't do it. > > Since we would like to share our work with the > > rest of the world, if nothing is changed we would start a new IDLE > > fork soon, perhaps at python-hosting.com. > > I have no problem with this. You might be able to save yourself some > maintenance work by structuring your version as a set of subclasses > rather than a set of patches (even if you distribute it as a complete > working program). Many people have needs that aren't met by standard > Python; they write their own modules or extensions and distribute > these independently from Python; your case probably isn't all that > different. > I think that rewriting the patches as subclasses will be a lot of work, and won't be a very good idea - if you change one line in a function, copy-pasting it to a subclass and changing the line seems a little weird for me - not to mention the cases where some refactoring needs to be done. I think we will be talking about a separate package - say, idleforklib instead of idlelib. You can always run diff to find the differences between the two packages. > Often the needs of certain user groups and the development speeds of > such 3rd party modules are so different that it simply doesn't make > sense to fold them in the Python distribution anyway -- consider what > you would have to do if Kurt accepted your patches: you'll still have > to wait until Python 2.5 is released before others can benefit from > your changes, and if you come up with an improvement after that > release, your next chance will be 18 months later... > I don't think so - if IDLE is developed on the Python CVS, we can still distribute a stand-alone package with IDLE from the CVS head, for eager people. All others will get the changes a year later, which isn't that bad. Perhaps it can even be less than a year - since IDLE is a GUI application and not a library, so there isn't a lot of backward compatibility to maintain, it seems to me that updated versions can be shipped also with new minor versions of Python. The advantages of developing IDLE on the Python CVS are that there is no need to synchronize two versions, and a wider audience. Of course, after you see the improved IDLE you will surely decide to immediately import it into the Python CVS, so there's not much of a problem... :) Noam From noamraph at gmail.com Sun Sep 11 07:13:53 2005 From: noamraph at gmail.com (Noam Raphael) Date: Sun, 11 Sep 2005 08:13:53 +0300 Subject: [Python-Dev] IDLE development In-Reply-To: <4323A040.90504@gmail.com> References: <4323A040.90504@gmail.com> Message-ID: On 9/11/05, Nick Coghlan wrote: > Guido van Rossum wrote: > > Often the needs of certain user groups and the development speeds of > > such 3rd party modules are so different that it simply doesn't make > > sense to fold them in the Python distribution anyway -- consider what > > you would have to do if Kurt accepted your patches: you'll still have > > to wait until Python 2.5 is released before others can benefit from > > your changes, and if you come up with an improvement after that > > release, your next chance will be 18 months later... > > Isn't separate distribution the way the *current* version of Idle was > developed? I seem to recall it existing as IDLEFork for a long time so that it > could have a more rapid release cycle before being rolled into the main > distribution. Yes, it is. I answered on the way to maintain a more rapid release cycle of IDLE when developed in the Python CVS on my post in reply to Guido. > > This approach also allows a wider audience to asess the subjective benefits of > any changes made - many more people will download and try out a separate IDE > than will download and try out a patch to the main distribution. I'm such a > one, even though I believe my main problems with Idle lie in the Tcl/tk > toolkit (so I don't expect any application level changes to alter my opinion > much). Can you please explain what are these problems? A big problem with Tcl/tk is that only one function call can be triggered by an event, and I solved it for IDLE by writing a wrapper around Tkinter classes, which calls all binded function calls on an event. This, for example, allows the yellow CallTip windows to disappear when the IDLE window loses focus, instead of staying above all other windows. Thanks, Noam From skip at pobox.com Sun Sep 11 16:56:02 2005 From: skip at pobox.com (skip@pobox.com) Date: Sun, 11 Sep 2005 09:56:02 -0500 Subject: [Python-Dev] Replacement for print in Python 3.0 In-Reply-To: <8393fff05090913353f6133dc@mail.gmail.com> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> <8393fff05090913353f6133dc@mail.gmail.com> Message-ID: <17188.17794.574463.31979@montanaro.dyndns.org> (Maybe someone else has already raised this point. I'm a bit behind.) Martin> Here goes something: for applications targeted to the web, where Martin> newlines don't matter, the line breaks in _()'ed strings are Martin> superfluous. How will you know you're generating output that goes between

and

(where newlines do matter)? Skip From guido at python.org Sun Sep 11 17:24:53 2005 From: guido at python.org (Guido van Rossum) Date: Sun, 11 Sep 2005 08:24:53 -0700 Subject: [Python-Dev] PEP 3000 and iterators In-Reply-To: <3F65B538-4D54-4DC9-9136-9F6BAB941EC8@fuhm.net> References: <3F65B538-4D54-4DC9-9136-9F6BAB941EC8@fuhm.net> Message-ID: On 9/10/05, James Y Knight wrote: > No, that cannot work. However, there is a very obvious and trivial > solution. Do not remove dict.iteritems in Py 3.0. Py2.X programs > wishing forward compat can avoid dict.items and use instead > dict.iteritems. In Py3.0, dict.items becomes a synonym for > dict.iteritems and programs that don't care about compat with 2.X can > just use dict.items from then on. And everybody can be happy. A small > number of redundant methods is a small price to pay for compatibility. But it breaks the desire to keep the Python 3.0 language clean from deprecated features. Given that I don't expect there will be much compatibility *anyway*, I don't want to promise this. I expect that we'll have to write a source-level translator -- which could replace all iteritems() calls to items(), for example. Such a source-level translator may not be able to reach perfection, but it should take care of the tedious tasks and leave the rest up to manual polishing. This doesn't mean that there's no point in trying to introduce certain 3.0 features in 2.x; it's always good to have early experience with a new feature, and in some cases it *will* improve forward compatibility. But just installing python3.0 as python and expecting nothing will break is not a goal -- it would be too constraining. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From foom at fuhm.net Sun Sep 11 18:09:26 2005 From: foom at fuhm.net (James Y Knight) Date: Sun, 11 Sep 2005 12:09:26 -0400 Subject: [Python-Dev] PEP 3000 and iterators In-Reply-To: References: <3F65B538-4D54-4DC9-9136-9F6BAB941EC8@fuhm.net> Message-ID: <9C872F06-AED6-49FC-BFF1-AD56C84493D5@fuhm.net> On Sep 11, 2005, at 11:24 AM, Guido van Rossum wrote: > But it breaks the desire to keep the Python 3.0 language clean from > deprecated features. That is a nice goal, another nice goal is to not unnecessarily break things. > But just installing python3.0 as python and expecting > nothing will break is not a goal -- it would be too constraining. > Just to be clear, I do not want nor expect this. I wish to be able to specifically modify code with full knowledge of what has changed in Py3.0 such that it will work with both Py2.X and Py3.0. And, now is probably not really the right time to discuss such minor issues as whether to keep iteritems in Py3.0, but, if it is kept, it becomes easier to write such code. It is of course still possible to write compatible code without keeping iteritems, you just have to replace all the method calls with a function wrapper which calls one of items or iteritems depending on the version. James From martin.blais at gmail.com Sun Sep 11 18:35:08 2005 From: martin.blais at gmail.com (Martin Blais) Date: Sun, 11 Sep 2005 12:35:08 -0400 Subject: [Python-Dev] pygettext() without newlines (Was: Re: Replacement for print in Python 3.0) In-Reply-To: <17188.17794.574463.31979@montanaro.dyndns.org> References: <874q8xzexl.fsf@tleepslib.sk.tsukuba.ac.jp> <87wtltxmd7.fsf@tleepslib.sk.tsukuba.ac.jp> <1126094808.12806.2.camel@presto.wooz.org> <877jdryik1.fsf@tleepslib.sk.tsukuba.ac.jp> <1126180081.882.9.camel@p-dvsi-418-1.rd.francetelecom.fr> <8393fff05090913353f6133dc@mail.gmail.com> <17188.17794.574463.31979@montanaro.dyndns.org> Message-ID: <8393fff0509110935b12cbb9@mail.gmail.com> On 9/11/05, skip at pobox.com wrote: > > (Maybe someone else has already raised this point. I'm a bit behind.) > > Martin> Here goes something: for applications targeted to the web, where > Martin> newlines don't matter, the line breaks in _()'ed strings are > Martin> superfluous. > > How will you know you're generating output that goes between

 and
>

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4