About the fixation of syntax to concepts

Apr 19, 2010 13:04

Coders seem to be unable to distinguish the concepts of a programming language from its syntax. One coder might argue for Python on the grounds that it has certain innovative language features, but another might “refute” this by arguing about the shortcomings of whitespace-based syntax. The two are talking about entirely separate things ( Read more... )

Leave a comment

pne April 19 2010, 15:47:53 UTC
Perhaps a better example would be "syntactic sugar" -- for example, in Perl, while (<>) { ... } is equivalent to while (defined($_ = <>) { ... }, and in Java, java.util.Collection varlist = ...; for (Integer var : varlist) { var = var + 1; } is short for something like java.util.Collection varlist = ...; java.util.Iterator $tmpiter = varlist.iterator(); while ($tmpiter.hasNext()) { Integer var = (Integer) $tmpiter.next(); var = new Integer(var.intValue() + 1); }.

Sometimes you'd want to see autoboxing and auto-unboxing; sometimes you'd like to see explicit casts resulting from generics; sometimes you'd like to see "foreach" with explicit iterators -- but often, you might not want to see those things. (After all, part of the reason for syntactic sugar is to make code more readable by encapsuling a common idiom into a shorter form.)

Since syntactic sugar is fairly 1:1, it's possible to abbreviate always or to always show the long form, but it's not possible to mix the two (sometimes short, sometimes long).

Or compare the effect of Java import or (I think) C# using; I don't think this is reflected in the generated "object code", so it would probably be missing in the AST as well. But I wouldn't want to read code that always used fully-qualified package names such as java.util.Collection or System.String -- yet if you use, say, java.util.Date and java.sql.Date, then the "decompiler" can't know which of the two it can abbreviate using import.

So, perhaps people will get used to the limitations of automatic retranslation, but I think they will - at least initially - be perceived as limitations.

Reply

timwi April 19 2010, 17:40:50 UTC
The meanings of “while (<>)” and “while (defined($_ = <>))” may be the same, but the parse tree certainly isn’t. I used the term AST assuming it is the same thing as parse tree, but if the two terms are used slightly differently, maybe I should edit the post and call it parse tree. I’m thinking of the result of the very first step in a compiler, which is a simple parse according to a meaning-agnostic grammar - no substitutions of syntactic sugar or expansion of fully-qualified names. It is probably the case that many compilers do such expansions/substitutions at the same time as parsing, but there’s no reason why it can’t be separated and you could still have a parse tree that contains the import/using clauses and that distinguishes the for/foreach loop from its expanded iterator pattern. There is no “automatic retranslation”, most certainly not from “object code” (compiled binaries).

Reply


Leave a comment

Up