Things Rust shipped without

IvyMike · on July 3, 2015

> goto (not even as a reserved word)

I haven't done this for a while, but once upon a graduate program I wrote a compiler from a made-up-language (MUP) to C. MUP had some strange control structures, and if C did not have "goto", it would have been a lot more difficult to implement those structures. Since then, I have always thought languages should have a "goto" statement that human-written code is not allowed to use. :)

pcwalton · on July 3, 2015

It often simplifies a lot of the compiler implementation if you only have reducible control flow. LLVM is fine with irreducible control flow, because it has to handle C, but rustc uses a CFG for the borrow checker, which is flow-sensitive. You can convert irreducible control flow to reducible control flow, but it can explode the size of the graph in pathological cases.

(I don't recall whether the borrow checker actually depends on the control flow being reducible, but some of the possible future improvements we've talked about with "non-lexical lifetimes" definitely do. There are also nasty interactions between RAII and irreducible control flow…)

cwzwarich · on July 3, 2015

Non-lexical lifetimes don't depend on reducibility. The RFC I wrote works fine for irreducible CFGs:

https://github.com/rust-lang/rfcs/pull/396

It does have the effect of picking a single dominating entry point for a loop with multiple entry points, but if you want a single-entry region that's necessarily true. Multiple-entry regions are probably possible, but they could be counterintuitive and produce even stranger error messages than the current region system.

The borrow checker is based on dataflow analyses that should work on arbitrary CFGs, assuming an appropriately generalized notion of region.

EugeneOZ · on July 4, 2015

I think the point of parent comment is "it's ok to don't have goto in Rust, but it's not the reason to be proud of just not having goto". Goto is important enough operator and doesn't deserve blind hate.

tracker1 · on July 4, 2015

Agreed... I honestly tend to break code into a lot of discrete functions, and combine for workflows, but sometimes you just need to easily jump back a few places, and where goto is available it's not always a bad option. Just one that should be used sparingly... once in a complex workflow is fine.. more than twenty times in a few thousand line method, not so much.

mcguire · on July 4, 2015

Don Knuth, "Structured Programming with go to Statements":

"Just recently, however, Hoare has shown that there is, in fact, a rather simple way to give an axiomatic definition of go to statements; indeed, he wishes quite frankly that it hadn't been quite so simple....

"Informally, α(L) represents the desired state of affairs at label L; this definition says essentially that a program is correct if α(L) holds at L and before all "go to L" statements, and that control never "falls through" a go to statement to the following text. Stating the assertions α(L) is analogous to formulating loop invariants. Thus, it is not difficult to deal formally with tortuous program structure if it turns out to be necessary; all we need to know is the "meaning" of each label."

gsg · on July 3, 2015

Yeah. Even basic things like building SSA can be done more simply on "structured" CFGs.

The benefits are enough that data structure and algorithm designs in the JVM compiler world often take advantage of assuming reducible control flow even though Java bytecode can express irreducible CFGs. Instead, such programs are left to the interpreter.

cwzwarich · on July 3, 2015

Rust's CFGs are reducible with unbounded treewidth, not structured, because Rust has named exits from loops that nest arbitrarily.

Most problems on reducible graphs are not easier than for general graphs, because you can always compute a loop forest (for generalized loops, not natural loops with a single entry point) and just consider a derived acyclic graph.

gsg · on July 4, 2015

The structured SSA building algorithm I had in mind has a fairly simple extension that covers break, continue, and early return. It's more complicated, but still way simpler than iterated dominance frontiers.

> you can always compute a loop forest

It's easier to not have to.

cwzwarich · on July 4, 2015

Technically speaking, break / continue / early return are unstructured by the classical definition. The algorithm you are likely thinking of can easily be extended to handle arbitrary control-flow:

https://pp.info.uni-karlsruhe.de/uploads/publikationen/braun...

In the case of irreducible control-flow it may produce redundant cyclic phis, but those can be eliminated with a pretty simple post-pass.

haberman · on July 3, 2015

At least reserve the word, in case you change your mind in the future? I guess it's too late now...

Maken · on July 3, 2015

Burn the boats and never look back, I guess.

GFK_of_xmaspast · on July 3, 2015

Since when does rust care about breaking changes to the language.

steveklabnik · on July 3, 2015

Seven weeks ago today.

mcguire · on July 4, 2015

For version 1? Can we break on 2.0?

steveklabnik · on July 4, 2015

It's a bit more subtle than that, see https://github.com/rust-lang/rfcs/blob/master/text/1122-lang...

azakai · on July 3, 2015

> You can convert irreducible control flow to reducible control flow, but it can explode the size of the graph in pathological cases.

True, but only in node-splitting approaches. If you use a label threading variable (as emscripten's relooper does), there is a guaranteed reasonable limit on code size increase (at the cost of performance).

chjj · on July 3, 2015

I've always defended goto and gotten a lot of flak for it. As soon as I say, "I wish I had X-language had gotos", I see jaws drop. Response: "Wow, haven't you heard the news?! GOTOs are considered harmful!"

I think goto should be in almost every language. It's one of the most primitive instructions, why shouldn't it be available when needed? Yes, it can be misused, just like any other feature in the language, but it can also be used to great benefit. State machines, for example, can make good use of gotos.

Hell, look at any unix library your machine uses every day. The ones you don't even think about. Start with libncurses: I promise there's a few gotos where needed.

    $ find ~/ncurses-5.9 -name '*.c' -print0 | xargs -0 grep goto | wc -l
    70

Raise your hand if you plan to stop using ncurses because of how opposed to how "harmful" goto statements are.

Oh, here's tmux, if you're interested (one of the most beautifully written C programs): https://github.com/tmux/tmux/search?utf8=%E2%9C%93&q=goto

geofft · on July 4, 2015

Rust is not a language that looks favorably on things that are easy to misuse and hard to use correctly. You can make the same argument about, say, untagged / C-style unions, which are much more common in C, but Rust doesn't have those either.

I don't plan to stop using ncurses in particular, but one of the reasons I am supportive of Rust is that I would like to stop using all C software sometime in my lifetime.

milspec · on July 4, 2015

Rust should have untagged unions. Obviously, ones that allow pointer abuse would be restricted to code marked unsafe.

Note that a union of two pointer-containing structs is safe as long as the pointers line up, having the same type and offset in each struct.

comex · on July 4, 2015

They've been proposed:

https://github.com/rust-lang/rfcs/pull/724

If you really need them, you can implement them in a library by declaring a suitably sized byte array and having methods that return casted pointers to it. This would be illegal in C due to strict aliasing, but IIRC Rust does not have a strict aliasing rule (because almost all pointers being the equivalent of 'restrict' drastically reduces the benefit).

There are issues with size_of not being a constant expression and such (at least in stable), but those are definitely going to be fixed.

jroesch · on July 4, 2015

Do you have a compelling use case? I can't imagine a use for untagged unions in Rust that isn't subsumed by other features in the language.

mbrubeck · on July 4, 2015

The most common use case would be interop with C libraries that use untagged unions. This is a significant hassle when writing C bindings in Rust today.

Jweb_Guru · on July 4, 2015

I can think of many, mostly revolving around cases where the tag is not stored inline with the structure in question. Where space efficiency is important this is quite common. C interoperability is obviously another important use case (the current solution of passing around [u8] arrays is pretty atrocious).

jxf · on July 4, 2015

> It's one of the most primitive instructions, why shouldn't it be available when needed?

I see what you're saying, though I don't find this a compelling argument. By this logic, why not allow direct register access in all languages?

Just because something is a primitive operation doesn't mean you want to include it, especially not if it's more difficult to enforce guarantees your language would like to make about valid programs.

pcwalton · on July 4, 2015

All of the gotos I see on that page are simulations of RAII or labeled break/continue.

mercurial · on July 4, 2015

The kernel has plenty of goto, mostly (as far as I know) to handle errors and safe resource deallocation. There is no reason to keep using GOTO if you have better mechanisms to handle these things.

Retra · on July 4, 2015

The target of a goto statement is a specific place in code. It is not code that refers to the task you are trying to perform.

The main advantage of using higher-level languages is that you can talk about manipulations of data. Goto doesn't manipulate data, it manipulates the machine. And if you want to _really_ manipulate the machine, you're going to want more than just goto.

>Raise your hand if you plan to stop using ncurses because of how opposed to how "harmful" goto statements are.

Goto statements are not harmful to computers or to code. Ncurses isn't what is harmed by goto, nor does "using ncurses" imply any interaction with goto at all.

Muhammad Ali was a good boxer. He was Muslim. Am I to conclude that one must be Muslim to be a good boxer, now?

Yes, ncurses is a good program. Yes it uses goto. But that doesn't imply that goto is necessary to it being a good program. Or that it is even a mildy efficient way of doing well. (Especially not when there is a whole universe of alternatives.)

If you're going to defend goto (and there are reasons to do this) you should probably do so without employing such blatant logical fallacies. It's irresponsible and detracts from your point.

adynatos · on July 4, 2015

C's goto has a bad rep it only partially deserves, however I can't agree that it should be in every language. I use it fairly frequently, but virtually all the uses are about cleanup after error. I like that it requires unique label for identification, as it allows for language-enforced documentation. But my gotos always go "down" the source code, never "up. Through goto I can basically setup an error-condition code flow for every function, with multiple conditional entry points into that flow and if there was another language feature that allowed just this particular usage in cleanup, without multiple nested scopes and with clear labelling, then I would see no reason for keeping goto. Throughout the years we developed a way to neatly use an unsafe feature, but it's always better to have such things enforced by language, not by tradition.

istvan__ · on July 4, 2015

I do not use goto in my code. What do I do wrong? :) Ok, I have to admit that I used it on C64 in the 80s.

Anyways, exceptions are sort of gotos or at least they can behave that way.

sklogic · on July 4, 2015

If you're not implementing fast FSMs, fast bytecode interpreters (and dynamic dispatchers in general), and you're not using metaprogrming to the full power, not implementing multiple embedded DSLs - then you can live without a goto. Otherwise it is essential.

pjmlp · on July 4, 2015

Metaprogramming to the full power sounds like Lisp. It doesn't have goto.

sklogic · on July 4, 2015

> Metaprogramming to the full power sounds like Lisp.

It does, yes, although many Lisps got a pretty limited backend for this sort of things. My preferred metaprogramming environment must have a fallthrough mode for allowing generating low-level code where Lisp semantics is not sufficient. Rust seems like a very good target platform in this sense, so the lack of goto really hurts.

> It doesn't have goto.

Of course they do (tagbody in Common Lisp, for example). And when they don't, it's often relatively easy to add one.

pjmlp · on July 6, 2015

> Of course they do (tagbody in Common Lisp, for example). And when they don't, it's often relatively easy to add one.

I was wrong, however it is much more limited than the goto everywhere from C, as where the labels are located is clearly defined in the tagbody and not in every possible statement.

EugeneOZ · on July 4, 2015

No, exceptions are incomparable much harmful because they don't have destination, they just hysterically run, crashing everything on their way.

istvan__ · on July 4, 2015

I said "sort of"...

MaulingMonkey · on July 4, 2015

> Raise your hand if you plan to stop using ncurses because of how opposed to how "harmful" goto statements are.

What if I just avoid contributing to the ncurses codebase? I've used plenty of useful tools with absolutely horrific codebases that I'd never want to touch in a million years. Not sure if ncurses is one of them.

The whole "it gets used, ergo it must be a good idea" argument doesn't hold much traction with me - even if I think using it when in C, to enforce single exit style, to work around the lack of RAII constructs, goto is the lesser evil.

I started with GOTO in BASIC. I used it a lot. It structured my initial reasoning about control flow. Despite this, in the past few years, I've used a naked goto maybe once or twice, and in all cases later rewrote it without the goto, which in my opinion increased it's readability. (I generally always have the option of C++ over C, and choose it, rendering single exit style 'useless'.)

> Oh, here's tmux, if you're interested (one of the most beautifully written C programs): https://github.com/tmux/tmux/search?utf8=%E2%9C%93&q=goto

Most of those are single exit style gotos. Those that aren't, do cause some concern, despite being "one of the most beautifully written C programs", even to their original author from the looks of it:

  if (errno == ENOMEM)
    goto retry; /* possible infinite loop? */

For what it's worth, it seems unlikely to be an infinite loop, short of encountering a bug in sysctl, or another process/thread constantly adding data. I had to google the header path to find an appropriate manpage (i.e. not _sysctl, not sysctl the program) to figure this out...

I've encountered worse edge cases before, however, and I'd really prefer my programs crash properly, instead of hanging when they do.

> State machines, for example, can make good use of gotos.

The performance complaints about an additional branch misprediction when using the "for(;;) switch(...)" style without gotos, is one of the few arguments that moves me, if only slightly. That seems like a case your standard optimizing compiler really should be able to handle, however. I'll assume they don't, as I'm too lazy to test if this is merely hearsay...

That said, I'll even use "goto case" in C# on occasion where I'd use case fallthrough in C++, if I'm feeling particularly lazy and don't want to turn things into proper method calls that can simply call each other just yet. I usually clean it up before I start to confuse myself.

It's not something I'd miss if it were gone, however. It's something I use only rarely, and only as a crutch to stave off cleanup. Not exactly a ringing endorsement.

EDIT: Code formatting, proper insertion of subject...

sunfish · on July 3, 2015

I wish C/C++ had a labeled break construct, like JavaScript, Java, Rust, and other languages have. It's surprisingly powerful, while still remaining structured. I personally have enjoyed learning about it, and about just how rarely an actual goto is really needed.

api · on July 3, 2015

I've never had to use 'goto' in C++ except to break from a nested loop. In C++ labeled breaks would make 'goto' completely obsolete.

In C it would still have use in the implementation of orderly error handling -- the pattern where you hand-implement exception handling in C by putting an on_error: label at the end of the function that is goto'd on error. The addition of some orderly construct for this in C would eliminate that case, leaving no real role for goto there either.

frivoal · on July 3, 2015

A bitecode interpreter is another place where it's nice to have gotos.

Here's the base code without gotos:

  typedef enum { ADD, MUL, ..., END } opcode;

  void run() {
    opcode ins;

    while (1) {
      ins = fetch_next_inst();
      switch (ins) {
        case ADD:
          perform_addition();
          break;
        case MUL:
          perform_multiplication();
          break;
        ...
        case END:
          wrap_up();
          return;
      }
    }
  }

You have 3 jumps on each loop. From the break to the end of the loop, then from the end to the top, and one from the switch to the right case. The first one might be optimized away, but let's remove it explicitly.

  typedef enum { ADD, MUL, ..., END } opcode;

  void run() {
    opcode ins;

    start:
    ins = fetch_next_inst();
    switch (ins) {
      case ADD:
        perform_addition();
        goto start;
      case MUL:
        perform_multiplication();
        goto start;
      ...
      case END:
        wrap_up();
        return;
    }
  }

Assuming a non lousy compiler, we haven't improved anything yet. But now the fun starts. We can go down to one jump for each iteration.

  typedef enum { ADD, MUL, ..., END } opcode;

  #define NEXT() \
  do { \
    ins = fetch_next_inst(); \
    goto *jump_table[ins]; \
  } while(0)

  void run() {
    opcode ins;
    static void *jump_table[] = { &&add_l, &&mul_l, ..., &&end_l };

    NEXT();
    add_l:
      perform_addition();
      NEXT();
    mul_l:
      perform_multiplication();
      NEXT();
    ...
    end_l:
      wrap_up();
      return;
  }

Voila! a single jump every time around. Now, depending on what kind of architecture you're running on, the size of the cache, etc, this may or may not be faster.

Granted, this is not the kind of code you write everyday. But sometimes speed matters, and good luck writing this without gotos.

jacquesm · on July 4, 2015

I really dislike 'clever' code like this, even when speed is paramount you will find that by obscuring the flow you make it harder, not easier to really optimize the code.

Over time it tends to evolve into ever messier and harder to understand versions of the initial run after which a future maintainer will end up losing sleep and or hair chasing some production bug.

Consider this (slower!) much clearer alternative, and consider too that if you need to eliminate one goto for speed reasons that you're most likely doing something wrong:

  typedef enum { ADD, MUL, ..., END } opcode;

  int process_instruction(opcode ins) {

      switch (ins) {
        case ADD:
          perform_addition();
          break;

        case MUL:
          perform_multiplication();
          break;
        ...
        case END:
          return FALSE;
      }

      return TRUE;
  }

  void run() {

    do {
    } while (process_instruction(fetch_next_instruction());

    wrap_up();
  }

That's a whole function call overhead (but you can eliminate that with an 'inline'), it's easier to test and much easier to follow what it does.

It would be interesting to see what the actual difference is in speed when comparing those two versions, I suspect that the contents of 'perform_addition' and 'perform_multiplication' are going to be the key here, not whether or not the loop uses a short-cut or an instruction (or two) less. Oh, and you could have eliminated that 'ins' variable.

obastani · on July 4, 2015

I imagine that if you have more instructions (a few hundred) the overhead of the switch starts to become significant. At 256 opcodes you need 8 jumps, and hence 8 CPU cycles. That's non-trivial, especially if most of the opcodes can be implemented more quickly than this. For example, a quick Google search says that for an Intel CPU, addition is 1 cycle and multiplication is 3 cycles.

jacquesm · on July 4, 2015

Switches over enums are usually compiled to jump tables. What you see in source and what you get after the optimizer is done with it are sometimes very far apart. Looking at the output of gcc -S can be very enlightening at various levels of optimization.

ectoplasm · on July 5, 2015

Since you asked, there's an assembly comparison on pages 5-6 of http://www.jilp.org/vol5/v5paper12.pdf and a performance comparison on page 12 of http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.90.... . Note that the "inline threading" in the second paper is a further optimization that hasn't been brought up here yet.

In summary, the assembly of the goto version is much shorter and there's a 10% speedup over SPEC JVM98. The optimization works out because: a) instruction bodies are often short; b) instruction dispatch is the hottest code path of any interpreter; c) it's orthogonal to all or almost all other optimizations.

jacquesm · on July 5, 2015

That's about what I expected, contrary to the 2-4x times speedup mentioned above. Still, I can imagine in isolated cases you really would get a better number than that 10%. So, for code where performance is super important and where the effort of the optimization and lack of transparency is outweighed by the performance boost this makes sense.

ectoplasm · on July 5, 2015

Yes, that 10% is an average over seven benchmarks. The numerical one gets a 25% speedup, the one with a bunch of exceptions doesn't benefit at all. The funny thing is, if you're interested in performance, you're probably going to be writing a JIT, so this optimization is really just a stop-gap measure. In fact, the inline threading I mentioned is a kind of cheap JIT: it copies the in-memory executable code from the instructions at these goto labels to form basic blocks in an executable CFG. That gets you 1.6x speedup on those benchmarks, but it's a lot more work.

jacquesm · on July 6, 2015

> The funny thing is, if you're interested in performance, you're probably going to be writing a JIT, so this optimization is really just a stop-gap measure.

Exactly. And stop-gap measures are fine if and when they're followed up by a proper solution. Unfortunately stop-gap measures tend to be a lot more permanent than originally intended in practice.

unholiness · on July 4, 2015

Okay, but a much more readable and maintainable way to achieve this same optimization is with function pointers.

In your example:

  void (*[10])() function_table = {
      &perform_addition, 
      &perform_multiplication,
      ...
  }
  
  while(1) {
      ins = fetch_next_inst();
      *(function_table[ins])();  
  }

Edit: Seeing it written like this now, it's clear you could save yet another jump by defining a macro for what's in the while loop and putting it at the end of every function call.

TheLoneWolfling · on July 4, 2015

That's an additional pointer-chase per loop. And more function prefix / suffix work. In actuality, I suspect that it'd be optimized out - but you cannot say "you cannot do X because it relies on compiler optimizations" and replace that with Y that relies on compiler optimizations.

unholiness · on July 4, 2015

A function call is nothing more than putting your return address and parameters on the stack and jumping to the address of the function. By referring directly to the function's address, there's no "additional pointer chase", since calling the function already does exactly that.

If you were to inline all of the perform functions in the GOTO version vs putting a macro at the end of each function, you're right that there's some function overhead, but I think it's as small as a single instruction to put the return address on the stack. Maybe that would be optimized away, maybe not.

To your point: My argument isn't that you can't do the GOTO version. With optimizations, it's essentially identical. My point is, that is a lot more hard-to-grok code to maintain for something that can be achieved in a simpler way.

TheLoneWolfling · on July 4, 2015

You're forgetting the overhead of popping / pushing registers. Both in terms of direct instruction overhead, and indirectly through code size and working set bloat. Which, especially for smaller functions, can be significant. It's one of the problems with a register-oriented architecture.

Sometimes this can be optimized away, but not always.

zem · on July 3, 2015

~20 years of using various combinations of C and C++, and i never knew you could take the address of a label! reminds me of computed GOSUBs from my basic days :)

banthar · on July 3, 2015

You can't. This is a GCC extension:

https://gcc.gnu.org/onlinedocs/gcc/Labels-as-Values.html

arcticbull · on July 3, 2015

There is actually a neat feature where you can take the address of the thing you're assigning in a static context:

static void* address = &address;

This is helpful for things that take void* context pointers that you want to match with logical equality.

frivoal · on July 3, 2015

And they say C is a simple language... :)

To be fair, I've seen code like this exactly once.

tjradcliffe · on July 4, 2015

They say C is a low level language. That is not the same as simple.

jcoffland · on July 3, 2015

Who says that?

jacquesm · on July 4, 2015

Before you use such constructs make sure your team lead or boss is ok with it, it's incompatible with many compilers and ugly to boot. This is the kind of code that gives C its bad reputation.

sklogic · on July 4, 2015

This is the kind of code which makes OCaml the fastest bytecode interpreter out there, and selection between an ad hoc switch and a comuted goto is done transparently with a single ifdef.

jacquesm · on July 4, 2015

Yes, there may be exceptions when this kind of code is preferable. But it's definitely not the rule and the speed difference between the one and the other is so small that only a profiler can guide you to optimizations like these.

sklogic · on July 4, 2015

> Yes, there may be exceptions when this kind of code is preferable.

In my practice these exceptions are ubiquitous. Multiple tiny interpreted DSLs (which cannot be compiled more efficiently for the latency reasons), efficient protocols, all that stuff. System programming, in other words, and it's exactly the stuff Rust was supposed to be designed for.

> But it's definitely not the rule and the speed difference between the one and the other is so small

2-4 times difference is not "small" in system-level programming.

jacquesm · on July 4, 2015

2-4 times is a spectacular speed-up, of course the inner loop of an interpreter is important but you're implying the rest of the code-path is only 3-4 instructions long (otherwise you can't get that kind of speed-up) which I find hard to believe. Even the inner loop of an interpreter usually calls routines that are longer than the inner loop itself. Is there a particular benchmark you have in mind for that 2-4 number?

sklogic · on July 4, 2015

Take a look at the OCaml bytecode interpreter: case bodies are all very small there, so the instruction dispatch time really counts. I got this 2x times difference when comparing this bytecode interpreter built with and without computed goto, but I don't remember any other details at the moment. In my own bytecode interpreters the difference was up to 4x, again, because each bytecode instruction implementation was tiny and trivial (like, move something from one register to another, or perform an arithmetic operation).

> Even the inner loop of an interpreter usually calls routines

These are too high level interpreters for the too dynamic languages, they're beyond any hope in terms of performance, by design. I'm talking about some simpler and better designed things (like OCaml, for example).

sklogic · on July 3, 2015

There is one next step - eliminate jump_table and rewrite your bytecode before executing it, replacing opcodes with the jump addresses.

api · on July 3, 2015

That's pretty nifty, but it seems like something a really good compiler could achieve automatically. Of course I'm not sure if any compilers actually are that good.

sklogic · on July 3, 2015

This is not exactly how fast bytecode interpreters are implemented. There is one next step - pre-caching the jump table straight into the bytecode. And there is absolutely no way compiler can transform your ad hoc switch-based interpreter into a threaded code interpreter automatically - it is not allowed to rewrite your data willy-nilly.

frivoal · on July 4, 2015

Hadn't thought of that, but it sounds interesting. I guess this means that opcodes in your bytecode need to be large enough to store a pointer, this could involve some tradeoffs in terms of minimizing instructions vs fitting in the cache, but I suppose it could fall on the right side of things often enough.

sklogic · on July 4, 2015

32-bit opcodes are ok, even on 64-bit architectures, if you add a constant offset - see how it's done in OCaml.

frivoal · on July 4, 2015

An offset from the first label, or some nearby alignment point? Makes sense.

Do you have a pointer to the relevant part of OCaml's source? The whole thing is probably worth a read, but don't think I can set aside enough time for that soon, and it could take me a while to find the right part.

sklogic · on July 4, 2015

https://github.com/ocaml/ocaml/blob/trunk/byterun/interp.c

See the "Next" macro definition.

frivoal · on July 5, 2015

Nice. Simple and efficient. I am going to read more from that code base. Looks like this might be a nice read as well: https://ocaml.org/docs/papers.html#BytecodeCompilerandByteco...

Redoubts · on July 3, 2015

Well, this is what is going into Python 2.7.11 for a 15% speed up, so apparently compilers aren't that good yet.

https://lwn.net/Articles/646888/

frivoal · on July 3, 2015

This is a fairly mechanical conversion, so suppose a sufficiently smart compiler could do that.

But assuming a sufficiently smart compiler when you depend on your code being fast tends to cause issues. Especially if you expect it to run on a bunch of platforms with compilers of varying qualities. Last time I saw code last this, we certainly couldn't rely on compilers being particularly smart, and we did care about speed.

jasode · on July 3, 2015

>In C++ labeled breaks would make 'goto' completely obsolete.

In 20 years of using C/C++, I've found one use for "goto" that's hard to substitute: simulating coroutines that yield to an outer context. (Similar to C# yield return).

A "break label" gets you out of a loop. I needed to use "goto" to jump back into the middle of a loop to resume where the "coroutine" previously left off. The keywords "break/setjmp/longjmp" wouldn't have been substitutes for this particular use case.

prutschman · on July 3, 2015

I've used a switch statement essentially as a jump table for this purpose. I'm curious whether that would have worked, or if you were doing something different enough that you needed a goto.

IsTom · on July 3, 2015

I don't quite understand how it's not a setjmp/longjmp usecase. Isn't this exactly what longjmp is for?

Niten · on July 3, 2015

> The addition of some orderly construct for this in C would eliminate that case, leaving no real role for goto there either.

I like the way this is handled in Go with the defer keyword: https://blog.golang.org/defer-panic-and-recover

This construct gives you most of the power of C++ RAII without the overhead. Except that you can't use it to cleanup resources after exiting an anonymous block—it strictly defers to function exit.

microtonal · on July 4, 2015

This construct gives you most of the power of C++ RAII without the overhead.

But it leaves out the most useful part: in C++ releasing the resources is completely implicit:

    {
        std::ifstream in(fn);
        // Do something with 'in'
    } // Released

while with Go defer, you have to call explicitly for the method that you want to call. If you forget this, you still have a resource leak.

Sure, it's an improvement over languages where you have to cover every possible scope exit, but it definitely doesn't give you 'most of the power' of C++ RAII.

pcwalton · on July 3, 2015

defer has unavoidable runtime overhead due to its dynamic semantics. It's strictly slower than RAII as implemented in C++ or Rust.

giovannibajo1 · on July 3, 2015

That's not true. The only case where the runtime overhead is unavoidable is calling defer in a loop, because it might require allocating the defer chain in the heap; in all other cases, it's just a metter of writing the correct optimization passes in the compiler.

pcwalton · on July 4, 2015

Yes, that's what I meant by "strictly slower". At best, it can be optimized to something similar to what RAII can give you. RAII never has the overhead of the bad case.

giovannibajo1 · on July 4, 2015

I don't want to be excessively picky, but you said that is has "unavoidable runtime overhead", and that's true only in its rarest form (defer within a loop), which is a feature which you can't implement in RAII. IOW, defer is a superset of RAII.

In all other cases (which is almost all usages), it is semantically equivalent to RAII, so the language doesn't force any runtime overhead. The only difference is that the compiler is less mature than an average C++ compiler, but this is an implementation problem, not a design problem.

RAII has no overhead because it is a pattern designed within the context of a zero-overhead language. defer allows you to implement a superset of cases that RAII handles, including those with runtime overhead.

pcwalton · on July 4, 2015

defer isn't a superset of RAII. defer is function-scoped. RAII is block-scoped. If you want to run code at the end of a block, you can do that with RAII and you can't do that with defer—and note that this is what causes all the codegen issues with defer.

Niten · on July 4, 2015

To clarify, I meant the "overhead" of defining and instantiating a container class for RAII purposes.

I hadn't considered the type of overhead you're talking about, which is admittedly more important for the kind of software that tends to get written in C.

pcwalton · on July 4, 2015

There was such a thing (std::finally), but it was removed due to lack of use and maintenance. When your standard library is completely built on RAII, you rarely need it.

Animats · on July 3, 2015

I don't think I've written a goto since the 1970s, and I've written a lot of code since then. If you need to bail out of something in the middle, make it a function.

bluecalm · on July 3, 2015

As mentioned above, in C goto is often useful for error handling (where you need to free some resources before exiting the function). It's way more convenient and readable to write freeing code once and jump to it instead of having multiple return exits and duplicate the code before every single one of them.

derf_ · on July 3, 2015

What the parent said still applies in your case. You just make a wrapper function that does the cleanup. I have used this pattern many times:

  static int foo_impl(int arg, int **resource1, int **resource2) {
    *resource1 = (int *)malloc(sizeof(**resource1));
    if (*resource1 == NULL) return EXIT_FAILURE;
    /* Do something with resource1 (omitted)... */
    *resource2 = (int *)malloc(sizeof(**resource2));
    if (*resource2 == NULL) return EXIT_FAILURE;
    /* Do something with resource1 and resource2 (e.g.,
       stick them in a global hash table as key/value pairs,
       or whatever, omitted)... */
    return EXIT_SUCCESS;
  }

  int foo(int arg) {
    int *resource1;
    int *resource2;
    int ret;
    resource1 = resource2 = NULL;
    ret = foo_impl(arg, &resource1, &resource2);
    if (ret != EXIT_SUCCESS) {
      free(resource2);
      free(resource1);
    }
    return ret;
  }

The advantages are that it makes it very explicit which resources need to be cleaned up (making it easier to review to be sure you haven't forgotten one, or forgotten to initialize one, because they're in a nice list), and it's harder to screw up the control flow. You can't get out of the function without passing by the clean-up code, and you can't accidentally fall into the clean-up code without explicitly returning an error.

luso_brazilian · on July 3, 2015

I know it is just an example to illustrate the point but this seems to have a bug in it: resource2 won't be initialized if the function fail to allocate resource1 so resource2 will be null when the code enters the deallocation condition.

A way to fix it would be to return different failures for every allocation and to use a switch without a break to clean up, something like:

    switch (ret) {
      case EXIT_SUCCESS:
        free(resource2);
      case EXIT_FAILURE2: 
        free(resource1);
      case EXIT_FAILURE1:
        break;
    }

EDIT: added potential fix to the code. Sorry if there is any bug in the fix, my C is a little bit ... rusty.

EDIT2: fixed a bug in the fix. C is hard, let's go shopping.

EDIT3: the explanation for the bug was also wrong. It is not a memory leak for but freeing a null pointer. Fixed too.

protopete · on July 3, 2015

Passing a null pointer to free() is perfectly safe, and in this case simplifies the cleanup code.

Although the concern of a memory leak is valid. I would have expected the resources to be free'd regardless of the result.

luso_brazilian · on July 3, 2015

Of course, you are right. I seem to have carried the belief that freeing a null pointer was as bad as freeing one twice. And now I know.

Seems to be a common misconception [1], thanks for pointing out, that makes the GGP code correct in all counts and my switch redundant.

[1] https://news.ycombinator.com/item?id=8844031

malka · on July 4, 2015

Or you could simply free resource1 and resource2 even if the allocation failed. free(NULL) is defined to do nothing.

TheLoneWolfling · on July 4, 2015

Except of course that many languages seem to be trying to remove fallthroughs from switches too...

Gibbon1 · on July 4, 2015

I find myself often having the issue where the code digging through what can and often is, garbage. At some point you find it's pointless to continue. You are done. Fin. Goto error; in that case makes absolutely perfect sense.

Two things about goto and jmp instructions. Most people have no idea how badly they were abused back in old days. For instance to jump into the middle of a subroutine. Yay just saved 9!!! words of core memory!!! And most people forget that old computer scientists were obsessed with creating grammars that you could write formal proofs for.

contravariant · on July 4, 2015

I'm pretty sure Dijkstra would also have been against using 'return' instead of a goto.

justincormack · on July 3, 2015

Lua introduced goto in a recent release, to some surprise, but it is useful especially for code geneation and interpreters.

CyberDildonics · on July 3, 2015

I've used goto in very specific cases which basically boils down to breaking from inner loops to the outside of the outer loop. Thankfully Rust actually has labeled break and continue statements, so this case of using goto is taken care of.

kazinator · on July 3, 2015

> Since then, I have always thought languages should have a "goto" statement that human-written code is not allowed to use.

It's an interesting idea.

The problem I see is that users can get around it by wrapping goto in the simplest possible macro. Then the idea has backfired: we now have goto under as many names as there are goto-using programmers. :)

vezzy-fnord · on July 3, 2015

All control flow is ultimately derived from goto (or JMP).

pjmlp · on July 4, 2015

I only used goto in ZX Spectrum Basic, GW Basic and several Assembly flavours.

All its use cases are better served by other language constructs.

Then again, C is a portable macro assembler.

andrepd · on July 3, 2015

What about things that Rust shipped without that should have been included?

riquito · on July 3, 2015

Tail call optimization

https://mail.mozilla.org/pipermail/rust-dev/2013-April/00355...

https://github.com/rust-lang/rust/issues/217

> I'm sorry to be saying all this, and it is with a heavy heart, but we tried and did not find a way to make the tradeoffs associated with them sum up to an argument for inclusion in rust.

> -Graydon

steveklabnik · on July 3, 2015

Note that a lot has changed since April of 2013. Rust does have TCO, we just can't guarantee it. And, LLVM has come a long way, so we probably can support guaranteed TCO now, and have 'become' as a reserved keyword for this purpose.

https://github.com/rust-lang/rfcs/issues/271 is a better link today.

riquito · on July 3, 2015

That's really a good news! Thank you for pointing that out.

hamstergene · on July 5, 2015

I praise the idea of making TCO explicit and guaranteed, but wonder if whole new keyword is necessary? It could be avoided by modifying `return` with another keyword which is currently impossible at that place, for example

    return as foobar(x, n-1)
    
    return in foobar(x, n-1)

    override return foobar(x, n-1)

    final return foobar(x, n-1)

It's a little surprising to see entirely new construct invented for something that is just a different implementation of `return`, after all, the end result of computation with tail-return is the same as with normal return, only performance differs.

steveklabnik · on July 5, 2015

In theory, we could have done that too, and I guess we still could.

yazaddaruvala · on July 4, 2015

I really wish Rust/stdlib had been built around a good async io story. Right now it just feels neglected "because the crates ecosystem can deal with it".

I might be wrong here but it always seemed to me that Rust was built to replace C/C++ in critical infrastructure like Firefox, Nginx, Redis, etc. Basically critical network dependent infrastructure.

With respect (because I understand the difficulties that come with time constraints/resourcing/building an efficient async API), currently I'm not sure how Rust expects itself to be a viable replacement (let alone the best replacement) language for any of those types of applications.

steveklabnik · on July 4, 2015

Well, and again, it doesn't implement the entire web platform, but Servo is already showing significant speed gains, even without AIO. It's really more useful for servers than clients.

It's important to remember that IO is truly a library concern in Rust. 1.0 means the language is stable, but there's still tons of libraries to build on top of that language. Holding back the language itself for a library that, while important, is only needed for certain applications wouldn't make a whole lot of sense.

yazaddaruvala · on July 6, 2015

In this particular context I believe using Servo as an argument is misleading. And actually its not just about "doesn't implement the entire web platform". Servo gains all of its performance wins from parallel rendering. A very effective but entirely orthogonal optimization. Just because one optimization far outweighs the performance wins of another optimization doesn't mean the other should be neglected entirely. Also, Servo's benchmarks run a relatively light workloads, i.e. one page at a time.

The argument might be similar to using benchmarks for a parallel web rendering engine (a Servo-like) in Go. It would probably have similar median latencies to Servo. And Go may even scale out better if tabs were sandboxed in goroutines instead of processes. However, ofcourse unlike Servo it would have a higher variance/p90/p99 latencies because of the GC. P.S. I'm not a big fan of Go I was just using it to try and bolster my argument that Servo's current benchmarks shouldn't be used as an argument against async io prioritization. Actually, side note, I would love to hear more flaws about building a Servo-like in Go.

gilgoomesh · on July 4, 2015

The absence of async IO in the standard library in Rust 1.0 is probably an artefact of Rust's original intention to use M:N green threads and blocking IO based on libuv:

https://www.reddit.com/r/rust/comments/1v2ptr/is_nonblocking...

The timing of the move away from green threads didn't really offer enough time to implement a stable async IO option before 1.0

luibelgo · on July 3, 2015

Higher kinded types https://github.com/rust-lang/rfcs/issues/324

sparkie · on July 4, 2015

A requirement to stick #version 1.0 at the top of code files so we know which version they were intended to be compiled with, such that in future we can introduce breaking changes into version 2.0 and still have support for compiling legacy 1.0 applications.

Seriously, nearly all data storage formats we use today have some kind of version number in them - why are we treating code as dumb text rather than interesting data?

Rusky · on July 6, 2015

That probably fits better in Cargo.toml, where it's also a lot less work to add later.

simoncion · on July 3, 2015

The ability to provide a default implementation of a function in a trait that can be inherited and used by "classes" that extend that trait.

In C++, this would look like:

  #include <iostream>
  class B { public: int get_id() { return id; }; int id; };
  class C : public B { };

  int main() { C c; std::cout << c.get_id() << "\n"; return 0;}

Maybe I'm ignorant, and there's an easy way to do this. I was screwing around with Rust a few months before 1.0 and didn't see any mention of it in the docs, though. (Everything I saw required me to provide an impl for functions defined in an interface for every class that implemented that interface.)

dragonwriter · on July 3, 2015

> The ability to provide a default implementation of a function in a trait that can be inherited and used by "classes" that extend that trait.

Rust provides this. E.g., the "talk" method in the "Animal" trait below: [0]

  trait Animal {
      // Static method signature; `Self` refers to the implementor type
      fn new(name: &'static str) -> Self;

      // Instance methods, only signatures
      fn name(&self) -> &'static str;
      fn noise(&self) -> &'static str;

      // A trait can provide default method definitions
      fn talk(&self) {
          // These definitions can access other methods declared in the same
          // trait
          println!("{} says {}", self.name(), self.noise());
      }
  }

[0] From Rust By Example, http://rustbyexample.com/trait.html

simoncion · on July 3, 2015

Man. The documentation did not make that clear at all. Was this added in 1.0?

A question (In C++ syntax, as I'm not a Rust programmer): How would one call Animal::talk inside Dog::talk? The obvious thing (commenting out the println and adding Animal::talk(self) ) causes infinite recursion. The other vaguely obvious thing ( Animal.talk(self); ) is a syntax error, which makes sense.

Edit: I'm referring to the code at the linked Rust By Example page.

arielby · on July 4, 2015

`Animal::talk(self)` is shorthand for `<Self as Animal>::talk(self)` - i.e. it calls the overridden method. If you want to reuse the supertrait method, you have to implement it as a generic method outside of the trait, and call it from the default implementation:

    pub fn super_talk<T:Animal>(this: &T) {
        println!("{} says {}", this.name(), this.noise());
    }
    trait Animal {
        //...
        fn talk(&self) {
          super_talk(self)
        }
    }

ryeguy · on July 3, 2015

No, this feature has existed a long time. Here's a post from 2012 showing it's use: http://pcwalton.github.io/blog/2012/08/08/a-gentle-introduct...

heinrich5991 · on July 3, 2015

The feature was added way before 1.0, if you're talking about the documentation, then I have no idea.

There is no way to call the overwritten `Animal::talk`.

jdub · on July 3, 2015

I thought you could do this:

  Animal::talk(self);

chrismorgan · on July 4, 2015

There is no such thing as `Animal::talk`. It does not exist. `Animal::talk` is purely shorthand for `<_ as Animal>::talk`, meaning that it is a specific type’s implementation of the `talk` method. In the case of `Animal::talk(dog)` where `dog` is of type `&Dog`, the `_` can be inferred to be `Dog`, and so `Animal::talk(dog)` is equivalent to `Dog::talk(dog)` and `<Dog as Animal>::talk(dog)`.

The default implementation, if overridden, does not exist for the given type.

ddevault · on July 3, 2015

Incremental compilation and a non-braindead module system. Breaking changes will be required to fix both, which is why Rust was released too early if you ask me.

pcwalton · on July 3, 2015

Why does incremental compilation require a breaking change? We just sketched a fully compatible design for it last week.

I'd also question why the module system is "braindead", of course.

ddevault · on July 3, 2015

I haven't looked at Rust since I came to these conclusions a few months ago, so add a grain of salt.

I don't recall why incremental compilation requires a breaking change, maybe it doesn't. I shouldn't have looped it into that statement without being certain.

The module system _is_ braindead, though. Last time I brought this up, I made a proof of concept that compiled the exact same program two ways - one by using the module system, and one by running the code through the C preprocessor and literally #include-ing other rust files. The purpose of this PoC was to point out that Rust modules are functionally identical to #include-ing C files would be in C, which is a well known antipattern.

Rust is very close to C when you consider the guts of the toolchain. C has solved several problems with respect to linking and I feel like Rust could have taken several more hints from C. This is probably a consequence of the Rust devs inherently disliking C and wanting to distance themselves from it.

pcwalton · on July 3, 2015

> The purpose of this PoC was to point out that Rust modules are functionally identical to #include-ing C files would be in C, which is a well known antipattern.

But they trivially aren't.

    lib.rs:
        mod foo;
        mod bar;
    foo.rs:
        fn f() {}
    bar.rs:
        fn f() {}

No name conflict. foo::f and bar::f happily coexist.

In C:

    lib.c:
        #include "foo.c"
        #include "bar.c"
    foo.c:
        void f() {}
    bar.c:
        void f() {}

Name conflict; fails to compile.

> Rust is very close to C when you consider the guts of the toolchain. C has solved several problems with respect to linking and I feel like Rust could have taken several more hints from C. This is probably a consequence of the Rust devs inherently disliking C and wanting to distance themselves from it.

No, it's that header files are are a big problem in C (DRY violation, hostile to code inlining, slow compilation) and it was felt that a real module system would be an improvement.

ddevault · on July 3, 2015

The PoC did this:

    mod foo {
        #include "..."
    }

Which is barely enough to say that Rust is hugely different than just #include-ing C files.

I was talking more about linking objects incrementally and the consequences of that design, rather than singing the praises of headers (though I do rather like headers). I understand C from the compiler's perspective as well, having written my own linker and assembler from scratch myself, and I really appreciate the elegance of the design.

>slow compilation

That's objectively untrue. It's much faster to compile with something like headers.

As far as inlining and DRY are concerned, back before Rust shipped I spoke with many Rust maintainers about solutions to all of these problems, but it was dismissed because "we're trying to ship". Maybe you shouldn't sail a boat when you need to replace the hull later?

pcwalton · on July 3, 2015

> Which is barely enough to say that Rust is hugely different than just #include-ing C files.

Well, sure, if you want to get fancy enough you can make a module system out of #include. (Your code snippet isn't enough because it doesn't replicate privacy or imports.) But replicating something approximating Rust's module system with "#include plus other stuff" doesn't show that Rust's module system is "just #include".

> That's objectively untrue. It's much faster to compile with something like headers.

I don't think that's true once you have the proper incremental build setup. With headers, you have to parse large source files over and over again. With a proper module system, the compiler can use a more efficient binary database format (with an index).

For example, consider math.h. If your program is using one function (say, sin) from math.h, you have to parse all the prototypes in math.h. (And if you have inlined functions or templates in the header files, you have to parse those too!) But with a module system, the compiler can serialize an index of all the signatures of functions inside libmath.so, so the compiler can do a direct, O(1) hash table lookup for "sin".

We aren't there today, of course, since we don't have incremental compilation, and C is certainly simpler, but I think doing it right from the start will pay dividends down the road.

> As far as inlining and DRY are concerned, back before Rust shipped I spoke with many Rust maintainers about solutions to all of these problems, but it was dismissed because "we're trying to ship". Maybe you shouldn't sail a boat when you need to replace the hull later?

I don't see anything backwards-incompatible about incremental compilation, and I believe where we're going will end up better than header files when all is said and done.

shawn-butler · on July 3, 2015

> For example, consider math.h. If your program is using one function (say, sin) from math.h, you have to parse all the prototypes in math.h. (And if you have inlined functions or templates in the header files, you have to parse those too!)

Isn't this pretty much a solved problem with precompiled / pre tokenized headers? PTH are language / arch / compiler agnostic.

http://clang.llvm.org/docs/PTHInternals.html

pcwalton · on July 3, 2015

Precompiled headers inch C and C++ closer to a module system, by discarding the traditional notion of what a header is: it becomes a sort of binary metadata instead of a textually included file. But why not just start with the right thing in the first place?

ufo · on July 4, 2015

Precompiled headers are a big hack. As pcwalton mentioned, the problem was actually solved when people invented real module systems.

the_why_of_y · on July 4, 2015

Which, notably, happened decades before the first implementation of the precompiled headers hack.

TillE · on July 4, 2015

Yes. Well. And just a few short decades later, modules will most likely make it into C++17.

steveklabnik · on July 4, 2015

In my understanding, they won't: https://botondballo.wordpress.com/2015/06/05/trip-report-c-s...

    > Modules making it into C++17 is less likely.

However, they do say

    > That said, from a user’s perspective, I don’t think this is any reason to
    > despair: the feature is still likely to become available to users in the
    > 2017 timeframe, even if it’s in the form of a TS.

comex · on July 4, 2015

Note that Clang has been working on a true module system for C and C++:

http://clang.llvm.org/docs/Modules.html

arielby · on July 3, 2015

This is Rust's approach to splitting a crate between files - not for splitting to different isolated separately-compiled modules. I must say it works great for that purpose, being much better than dealing with build tools.

jpgvm · on July 3, 2015

That concerns us less. Rust ships again every 6 weeks.

moonchrome · on July 3, 2015

They do have a constraint of not being able to break backwards compatibility between releases.

jpgvm · on July 3, 2015

This is true. I guess you could say it shipped and is still shipping with many unstable APIs because they so cautious about shipping things they can't take back.

I don't see that as a bad thing though, it's not great if you are trying to write Rust programs/libraries -right- now as sometimes you will have to jump through some hoops to only use stable Rust APIs but it will be worth it in the long term.

phaylon · on July 4, 2015

Or to continue the train of thought, what about things that it did ship with that could have been excluded? :)

Personally I'm starting to dislike index operations since they can panic (and are the shortest way to access), and I rather use the explicit Option based APIs. Though I'm not too worried about those, since a lint disallowing them shouldn't be too hard.

amelius · on July 3, 2015

Coroutines, Go style.

pcwalton · on July 3, 2015

M:N threads didn't work for Rust, and they don't have that many advantages anyway even in languages where they do work. There has been a lot of discussion on this over the years and this has been the conclusion everyone came to.

tomp · on July 3, 2015

> they don't have that many advantages anyway even in languages where they do work.

Are you sure about that? One of the main advantages of coroutines/greenlets IMO is writing simple and straightforward blocking code (e.g. an echo server); without them, you either need to use threads (which are slower and much more heavyweight) orcallbacks or related constructs (async/await, futures, ...).

pcwalton · on July 3, 2015

Threads aren't that slow on Linux. The main advantage of M:N threading as implemented in Go over 1:1 is that spawning is fast and doesn't use much memory, because you can avoid the syscall and only a small (initial) stack is required. Rust can't do the latter because it's not GC'd.

Even if it could, many real-world servers actually do non-trivial work in their threads, so the cost of spawning a thread is dwarfed by the actual work the thread ends up doing. There are serious drawbacks to M:N: complexity, fairness, problems in interoperability with the 1:1 world (including essentially unavoidable performance problems with the FFI), etc.

agentS · on July 4, 2015

For servers that primarily speak RPC or HTTP, do you foresee Rust going thread-per-request or something more callback-y?

eddyb · on July 4, 2015

Neither. Existing successful solutions use tight event loops (the "Reactor pattern": https://stackoverflow.com/questions/3436808/how-does-nginx-h...).

There is a Rust library called "mio" which provides a lot of the plumbing for such systems: https://github.com/carllerche/mio.

The path forward is likely going to involve adding a way to build cheap state machines (call them generators or async/await) with a clean syntax and giving mio hundreds of thousands of reusable instances.

agentS · on July 4, 2015

> ... Reactor pattern ...

I don't understand, and the link seems unclear. Perhaps a more direct question: I get a request X, and I need to consult a backend service to answer the request. Do I write synchronous code calling that backend? Or do I have some callback mechanism?

> ... generators or async/await

Ah. This perhaps answers my question. Both of these are essentially compiler-written callbacks.

If this is going to be like C#, then I presume there will be a thread-pool where user code will execute. It seems like a non-ideal story for concurrency. Users will have to take inordinate care not to call any blocking code; otherwise they will prevent one of the threads in the pool from doing useful work.

pcwalton · on July 4, 2015

> It seems like a non-ideal story for concurrency. Users will have to take inordinate care not to call any blocking code; otherwise they will prevent one of the threads in the pool from doing useful work.

The downsides of going M:N are worse. The cgo-like FFI performance problems, for example, are killer for Rust's use case.

pcwalton · on July 4, 2015

Most applications right now should do thread-per-request. Thread spawning is very optimized in both Rust and the Linux kernel, and you can adjust stack sizes if you need to. If you're hitting limits caused by this, you can use mio.

Narishma · on July 4, 2015

What about systems other than Linux?

pcwalton · on July 4, 2015

Mac OS X is rarely used for servers, so I'm not particularly concerned about it. On Windows you can use user-mode scheduling—I would like to see a library for this—which is effectively 1:1.

Kranar · on July 3, 2015

Funny, co-routines have been recently added to C++ and they work wonders. D has fibers which are used extensively and once again, it's a highly desired feature. Go is kind of the poster boy for coroutines and I doubt anyone claims that it doesn't provide many advantages.

To be honest it seems to me like your explanation is an attempt to downplay just how nice fibers/coroutines are rather than acknowledge their utility in many existing languages.

teacup50 · on July 4, 2015

No, his explanation is the few-line summary of years of failed experiments in userspace M:N threading.

Userspace is not equipped to make reasonable scheduling decisions that provide any significant performance advantage, and library/language runtime control of M thread register/stack contexts on top of N kernel threads plays absolute havoc with most operating system's standard libraries.

Go works around this by explicitly not calling into libc et al -- all system calls are issued directly. One big problem with that: directly invoking syscalls is supported on Linux, but NOT supported on OS X.

End result is that Go literally must rely on undefined behavior on any system that does not support direct issuing of syscalls.

From my brief review just now, what MS appears to be proposing for C++17 isn't coroutines in the traditional M:N threading sense, but rather, an explicit mechanism (with syntactical sugar) for capturing reachable variables in a lambda (without preserving the stack), and issuing a call to that magicked-up lambda later via promises.

This is interesting if you love the idea imperative mutable promise-based concurrency, but it's not likely to win you any performance gains, and it's useless in the extreme if imperative mutable promises aren't your cup of tea.

jroesch · on July 4, 2015

You are correct the current proposed C++ coroutines are vastly different then M:N userspace threading. The allocation differences are drastic, and unlike Go they play nicely with system libraries.

nwmcsween · on July 4, 2015

IMO if you cannot directly do a syscall in an os its useless.

the_why_of_y · on July 4, 2015

M:N threading may make sense for a higher-level language implementation like Erlang or GHC where you have a runtime in charge of things anyway and can reap benefits with initial stack sizes substantially smaller than a single VM page, allowing creation of millions of threads.

But since Rust is intended for runtime-less lower-level programming it doesn't make any sense here.

steveklabnik · on July 3, 2015

There is https://github.com/rustcc/coroutine-rs, though I haven't personally used it yet.

blaenk · on July 3, 2015

Abstract Return Types.

IsTom · on July 3, 2015

Non-lexical lifetimes/borrows. This is a dealbreaker IMO.

ewillbefull · on July 3, 2015

Dealbreaker? This just means you need to play with let bindings a little bit until they improve it at some point. (Same goes for SEME regions.)

IsTom · on July 4, 2015

It makes array use really awkward in my experience.

notriddle · on July 4, 2015

Non-lexical scope won't help with that. Remember that Rust does not allow mutable aliasing, so this code can never be allowed:

    let mut arr = [1, 2];
    let a = &mut arr[0];
    let b = &mut arr[0];

This code could be allowed:

    let mut arr = [1, 2];
    let a = &mut arr[0];
    let b = &mut arr[1];

And it gets Hard once variable indexes are involved. And since the whole point of using arrays is to get variable indexing, the Rust developers choose to only implement reborrowing for structs and tuples.

theseoafs · on July 4, 2015

For those who are not so Rust inclined, can we have an example of what this means?

steveklabnik · on July 4, 2015

Currently, lifetimes are based entirely on lexical scope. So for example, this doesn't work:

    fn main() {
        let mut x = 10;
        let y = &mut x;
        *y = 11;
    
        println!("{}", x);
    }

This will complain

    error: cannot borrow `x` as immutable because it is also borrowed as mutable

This is because an `&mut` borrow is exclusive: while `y` is alive, we cannot use `x`. We can fix this by making a new scope for `y`:

    fn main() {
        let mut x = 10;

        {
            let y = &mut x;
            *y = 11;
        }
    
        println!("{}", x);
    }

This works, and will print `11`.

Non-lexical lifetimes would allow the compiler to demonstrate that these two things are the same, and allow the first one to compile with the behavior of the second.

It's an interesting tradeoff, because right now, the rules are very simple and conservative. Scope is fairly easy to reason about. Non-lexical lifetimes would make certain things easier, but also a bit harder to reason about, because the rules are more complex.

madez · on July 4, 2015

Could you elaborate on or link to a more detailed tradeoff?

I want memory safety with as few hazzle as possible and that Rust doesn't understand the safety of the first example means hazzle. I expect the compiler to try to understand even if it's ’hard’. It doesn't need to understand everything but the mentioned situtation should be doable.

steveklabnik · on July 4, 2015

> Could you elaborate on or link to a more detailed tradeoff?

I'm not sure what you mean by a 'more detailed tradeoff.' You mean a more complicated example?

> I expect the compiler to try to understand even if it's ’hard’.

It's not a matter of difficulty, exactly, it's a matter of how easy it is to understand what the compiler is doing. Figuring out non-lexical scopes means that my mental model of what the compiler is doing is more difficult than it is right now, which may or may not be the right tradeoff. I would say that most people want non-lexical lifetimes/SEME regions to be implemented, though.

nwmcsween · on July 3, 2015

An effects system.

A safe stdlib - aborting on malloc failure is not safe.

kzrdude · on July 3, 2015

Then you are using a different meaning for safe than what Rust usually does.

Abort occurs on oom, that's right and also on double panic (most frequently found if a destructor panics during unwinding).

gnuvince · on July 3, 2015

I'd need to recheck my vocab, but I'm pretty sure aborting is always safe.

ajb · on July 4, 2015

Not if you want to write an OS.

rcxdude · on July 4, 2015

Then you won't use the stdlib anyway.

steveklabnik · on July 4, 2015

Rust's usage of 'unsafe' refers to a very specific thing: memory safety. Aborts are always safe, though they may not be what you want, of course.

Manishearth · on July 3, 2015

libcore and no_std is what you want to use if you want finer-grained malloc-free control over things.

DannyBee · on July 4, 2015

" Next time you're in a conversation about language design and someone sighs, shakes their head and tells you that sad legacy design choices are just the burden of the past and we're helpless to avoid repeating them, try to remember that this is not so."

I think rust is neat, but this is somewhat arrogant.

Rust is amazingly young. 10-20 years from now, when rust hopefully has bajillions of users, if this is still true, then you can say it. I mean, do you really believe that Rust won't have things that turn out to be warts from 1.0 it can't remove 10-20 years from now?

hyperpape · on July 4, 2015

I think you're misreading this. I read it as saying that these things are sad design choices in many similar languages, but that Rust has avoided them. Nothing follows about them not having included other sad design choices that they're unaware of.

The point is that you can at least try to avoid the things you view as sad design choices--you're not necessarily stuck with them.

Jweb_Guru · on July 4, 2015

I know that Rust has warts. That doesn't mean it has to have the same warts.

andresmanz · on July 4, 2015

On the whole, it's that arrogance that keeps me away from Rust - for now.

justthistime_ · on July 4, 2015

Exactly. It's not like we don't know plenty of mistakes yet, which will bite them a few years down the road.

kazinator · on July 3, 2015

I used Common Lisp's goto (tagbody with go) to implement portable tail recursion: a "tlet" construct that looks like the "labels" syntax for defining a local function, but is just a stackless goto thunk with parameters.

http://www.kylheku.com/cgit/lisp-snippets/tree/tail-recursio...

This also provides some facilities for doing cross-module tail recursion among top-level functions. Here, continuation to the next function is provided by wrapping the function call in a closure, performing a non-local exit which abandons stack frames up to a dispatch loop, which then invokes the closure.

andresmanz · on July 3, 2015

I'd like Rust to be shipped without counterintuitive standard library function names and that book with all its style 'recommendations'. And I'd like the Rust compiler to be shipped without that non-snake case warning enabled by default.

Language creators won't endear themselves to me by ranting. The problem I have with Rust is _not_ the language itself.

pcwalton · on July 3, 2015

This is the first I've heard complaints that Rust is too strict with formatting. If anything, the popularity of gofmt (which is far more opinionated than rustc's default warnings are!) is a testament to the fact that people want languages to be ruthlessly opinionated regarding style these days.

madez · on July 3, 2015

It's so much better to let the computer worry about formatting such that the programmer can worry about the logic.

fmt-tools offer the amazing possibility that one programmer writes and reads the same code with different formatting than another programmer. I'd love to be able to set my formatting in my editor so that I see it how it's best for me but on saving or sharing code the formatting is reverted to the standard.

So, I think there should be a default style for rustfmt, but also support for other styles.

tatterdemalion · on July 3, 2015

rustfmt (currently in development) follows this principle.

Animats · on July 3, 2015

Please don't make it an option. Use one format to rule them all, as with "gofmt". I don't care what it is, but pick something and standardize.

tatterdemalion · on July 3, 2015

I'm not working on rustfmt but my understanding is that its output is going to be configurable.

Every language carries with it a culture. The culture of Go is one which allows for gofmt to define one true style and refuse to deviate, just as it allows the language designers to refrain from adding generics.

The culture of Rust is not like that, for several reasons. For one, the Rust community loves a good bikeshed. For two, the syntax of Rust is more complicated than Go, and includes situations (match statements & where clauses come to mind) in which people are just going to want to different things.

I know the advantages of one true style - everyone's heard the arguments - and there's a sane, median default as the official style guide, which will be rustfmt's default output. That seems like a good compromise.

madez · on July 3, 2015

It seems you are misunderstanding the situation because you are talking about not making “it” an option and you are then reiterating what I also said.

Even if I repeat myself: I think there should be one default formatting that is standardized and there should be the option to emit in other formats such that everyone can read in the individually preferred format.

With fmt you don't need to establish formatting rules on a project basis, anymore. Everybody can just configure their editor to format the code how they want it to look. That is why I think rustfmt should be compilable as a library, too.

andresmanz · on July 3, 2015

That's absolutely OK and I do understand that people want that. I actually like snake case and only slightly prefer camel case. I'd write snake case in large open source projects, if it's actually preferred. But it's very awful to type snake case code on my keyboard and I'm not ready yet to switch to a US keyboard. The worst thing about that warning is that I get some kind of a bad conscience by disabling it.

shawkinaw · on July 3, 2015

Even on a US keyboard snake case is annoying to type (for me at least.)

nightpool · on July 3, 2015

have you thought about binding underscores to some other key combination at the operating system level?

cpeterso · on July 4, 2015

SHIFT+SPACE could be a good keyboard shortcut for underscore because snake_case's underscores represent spaces between words.

nightpool · on July 8, 2015

Oooh, I hadn't thought about this. I might want to try this even on my (US) keyboard!

dragonwriter · on July 3, 2015

> If anything, the popularity of gofmt (which is far more opinionated than rustc's default warnings are!) is a testament to the fact that people want languages to be ruthlessly opinionated regarding style these days.

Its testament to the fact that there are some people that want that; that's very different from that being what people in general want.

jpgvm · on July 3, 2015

I'm in the camp that would love for rustfmt to become both very strict/opinionated and widely used.

It makes everything so much easier to read.

88e282102ae2e5b · on July 3, 2015

The idea that "[u]sing a return as the last line of a function works, but is considered poor style" irks me so much. A lot of what I find appealing about Rust is that it makes so much explicit through its type system, so I don't understand the philosophy behind preferring implicit returns, especially since you could have a scenario where someone hasn't finished writing a function but it still compiles without error.

pcwalton · on July 3, 2015

Implicit returns encourage functional style; foo.map(|x| x + 1) is so much nicer than foo.map(|x| { return x + 1; }). Once you have implicit returns in closures, you might as well have them everywhere for consistency.

veddan · on July 3, 2015

I think that the "expression-style" return looks good for short and "expression-like" function.

Good

    fn inc(a: u32) -> u32 { a + 1 }
    fn foo(a: u32, b: u32) -> u32 { let x = a + b; a * x }

Bad

    fn bar(...) -> bool {
        let mut success = false;
        let conn = getConnection();
        ...
        if x > y {
            return false;
        } else if z < q {
            success = false;
        }
        foo.barify(x, y);
        ...
        success
    }

It looks especially bad when the function has multiple early returns, and then the final return looks different.

ewillbefull · on July 4, 2015

This could easily be added as a lint to the codebase.

Eerie · on July 3, 2015

Sheesh, it's such a little thing. "success" vs "return success;"

cmrx64 · on July 3, 2015

Doesn't work when you're not returning from a function (ie, EVERY OTHER PLACE a block can appear)

88e282102ae2e5b · on July 3, 2015

Why "might as well"? Things can be optimal in some places and suboptimal elsewhere.

lmm · on July 3, 2015

Consistency is valuable. If some blocks have different rules to other blocks that's a real downside.