InfoQ

News

The IronPython, Unicode, and Fragmentation Debate

Posted by Jonathan Allen on Jun 15, 2007 06:50 AM

Community
.NET
Topics
Dynamic Languages
Tags
IronPython

Unlike the canonical implementation, IronPython implements the str class as Unicode rather than an ASCII byte stream. While some are saying this is a serious fragmentation issue, others say it is no big deal.

The root of the issue is whether or not the str class must to be implemented as ASCII. While ASCII is severely limited in terms of supporting non-English languages, it can do so using code pages. ASCII strings can also be used to represent binary data as byte streams.

In non-canonical implementations like IronPython and JPython, the str class is implemented using the platform defined String class. In this case, that means Unicode strings. While this will not affect most applications, those that use str for non-English code pages or as byte arrays won't work.

Calvin Spealman argues this is a serious problem:

IronPython takes the syntax, but stops short of the language. The problem is one for both Python and IronPython lovers. In Python land, we're seeing what appears to be an influx of interest from the IronPython (also, via Silverlight) world, but all those new developers are creating completely incompatible code. IronPython advocates, on the other hand, look silly to think they are promoting the Python language, and are completely missing out on hundreds of great libraries, years of built up community, and synergy that isn't just a buzzword.

Response from the community have not been quite so negative. Michael Foord responds:

To make it clear - at Resolver we have a 'large' IronPython application, which uses many modules from the Python standard library as well as third party Python libraries - and it works *great*.

Manuel also defends the decision:

First of all, Python, CPython, IronPython, Jython, PyPy are all changing entities. As changing entities, we can only legitimately criticize their trajectories. I see zero evidence that IronPython will "fork" the Python community. So much activity is being spent to make the Python library run in IronPython, in the FePy community and in the Microsoft sanctioned community.

Even Guido van Rossum, the creator of Python himself, stepped in:

You realize that Jython has exactly the same str==unicode issue, right? I've endorsed this approach for both versions from the start. So I don't know what you are so bent out of shape about.

For the time being, it looks like the Python community is willing to accept some incompatibilities in exchange for support on more platforms.

3 comments

Reply

Check out the pythonic-syntax Boo .NET language. by Cedric Vivier Posted Jun 15, 2007 2:47 PM
How about a Python section in InfoQ? by Zeev B Posted Jun 19, 2007 1:16 AM
Re: How about a Python section in InfoQ? by berkay NiQuiL Posted Jun 30, 2008 3:37 PM
  1. Back to top

    Check out the pythonic-syntax Boo .NET language.

    Jun 15, 2007 2:47 PM by Cedric Vivier

    If you do not know it, maybe you should try the Boo Language for .NET instead of IronPython. Boo is a new object oriented statically typed programming language for the .NET/Mono with a python inspired syntax and a special focus on language and compiler extensibility. Thought from the ground-up for the .NET/Mono world everything is Unicode by default. Oh, I forgot to add, no more "self"-plague in your classes YAY! :)

    class MyClass:
        y = 1
    
        def AddToFieldY(x):
            y += x
    
    c = MyClass()
    c.AddToFieldY(2)
    print c.y // displays 3
    
    The official website is here: http://boo.codehaus.org/ Oh, and if you like it, please join the community on the mailing-list :)

  2. Back to top

    How about a Python section in InfoQ?

    Jun 19, 2007 1:16 AM by Zeev B

    I see there is a Ruby section in InfoQ - Why not add a python section also?

  3. Back to top

    Re: How about a Python section in InfoQ?

    Jun 30, 2008 3:37 PM by berkay NiQuiL

Exclusive Content

Rationalizing the Presentation Tier

Thin client paradigm characterized by web applications is a kludge that needs to be repudiated. Old compromises are no longer needed and it's time to move the presentation tier to where it belongs.

Agile Project Management: Lessons Learned at Google

In this presentation filmed during QCon 2007, Jeff Sutherland, the creator of Scrum, talks about his visit at Google to do an analysis of Google's first implementation of Scrum.

AtomServer – The Power of Publishing for Data Distribution

In this article, Bryon Jacob and Chris Berry introduce AtomServer, their implementation of a full-fledged Atom Store based on Apache Abdera, which is now available as open source.

An Introduction to Virtualization

It is easy to think that virtualization applies only to servers. In reality the recent resurgence of the concept is also being applied to networking, storage, and application infrastructure.

REST Anti-Patterns

In this article, Stefan Tilkov explains some of the most common anti-patterns found in applications that claim to follow a "RESTful" design and suggests ways to avoid them.

Choosing between Routing and Orchestration in an ESB

In this article, Adrien Louis and Marc Dutoo discuss the differences and relative merits of using orchestration vs. routing in a typical ESB setup, and discuss various implementation options.

Enterprise Batch Processing with Spring

Wayne Lund discusses batch processing, Spring Batch objectives and features, scenarios for usage, Spring Batch architecture, scaling, example code, failures and retrying, and the future roadmap.

User Story Estimation Techniques

Developer Jay Fields draws on his experiences as a ThoughtWorks consultant to describe effective user story estimation techniques.