Truly Seamless Reloads with HAProxy

atombender · on May 5, 2017

While this is fascinating, and Willy is brilliant as always, I always wondered why HAProxy couldn't just, you know, reload the config.

Surely you don't need to fork: Just parse the new config, create the necessary internal data structures, and let traffic flow into the new ruleset while keeping all the sockets (except for those that are superfluous, and of course let in-flight requests finish). Is it because HAProxy's internals weren't designed to do that and that it would too big of a rewrite?

I always found Varnish's design very cool: It compiles the configuration (which is a DSL called VCL) to C and loads it as a dynamically loaded library. I don't know how it does hot reloads, but I believe it does do them seamlessly.

JosephRedfern · on May 5, 2017

The post kinda touches on this, and makes it clear that config changes isn't the only situation that this would come in handy -- software updates are a big reason too.

"Service upgrades are even more important because, while some products are designed from the ground to support config updates without having to be reloaded, they cannot be upgraded at all without a stop/start sequence. Consequently, admins leave bogus versions of those software components running in production because it is never the right time to do it."

annnnd · on May 5, 2017

And if you have restarts without downtime, there is no need for configuration reloads anymore. Why solve the same problem in 2 ways? It would just increase a chance for bugs.

All in all, HAproxy is a brilliant piece of software and Willy is running it in an exemplary way. Kudos!

atombender · on May 5, 2017

That's a good point, I must have skimmed that part.

xiaodown · on May 5, 2017

I mean, "apachectl graceful" has existed for 20+ years. Sending a graceful (USR1) to apache will cause it to re-read its config, close and re-open log files, and send a signal to all children that they should exit after their current work is finished. If they have no work, they die immediately. New children are created under a master process that has the new configuration.

I know that HAProxy is not an apples to apples comparison with something like apache, but I don't see how something similar would be a huge burden to add.

tyingq · on May 5, 2017

A multi-protocol (way more than http) load balancer with health checking just has more state than a web server. Check bug history for apache, and you'll see that modules that deal with state, like mod_security, have had issues with graceful reload in the past.

It's not that surprising to me, especially given that haproxy is 17 years old. Expectations of a load balancer where pretty light when it was invented, so the internals weren't built for hot reload.

nollbit · on May 5, 2017

If you can restart without downtime you don't need a hot config reload. Suddenly you can treat config as immutable and discard an entire category of bugs as well.

drewbug01 · on May 5, 2017

> Is it because HAProxy's internals weren't designed to do that and that it would too big of a rewrite?

Bingo. The internals will eventually be rewritten to support things like this (and I think that's what Willy was hinting at towards the end of the article) in time. But it's a big project, and it's a complicated piece of software, and there are a lot of conflicting demands on dev time.

That said: it's a welcoming community, and if you want to help... do it!

user5994461 · on May 5, 2017

>>> I always wondered why HAProxy couldn't just, you know, reload the config.

It reloads just fine, same as all software.

Or if you prefer the pessimistic version: HAProxy, nginx, Apache and Varnish all suck at reloading configurations.

The difference is that HAProxy 1) tests it and 2) have a TCP mode.

To quote the issue: They manage to achieve 1 error per 40 000 connections... only if pinning to specifics CPU (typical in high performance environment to achieve 100% usage of all cores) while doing 10 reloads per second and creating 80k new connections per second.

Do you think any of apache/nginx/varnish, would do better than that in these circumstances? If you do, you are not very realistic ;)

snvzz · on May 5, 2017

>While this is fascinating, and Willy is brilliant as always, I always wondered why HAProxy couldn't just, you know, reload the config.

Sure, but what about seamless updates?

ericd · on May 5, 2017

I love HAProxy so much. In our architecture, it started with simple frontend load balancing, but it ended up mediating almost every inter-server communication, which gave us a great amount of flexibility in swapping machines in and out, by giving each service load balanced virtual ip addresses. Thanks for your great work, Willy.

DorothySim · on May 5, 2017

I've used a simple iptables approach to redirect traffic to new Docker container:

  iptables --wait --table nat --append PREROUTING --protocol tcp --dport 80 ! --in-interface docker0 --jump DNAT --to $new_target

Then removing tables for old one:

  iptables --wait --table nat --delete PREROUTING --protocol tcp --dport 80 ! --in-interface docker0 --jump DNAT --to $old_target

(repeat the same for ip6tables).

The same had to be repeated on system start but otherwise it worked flawlessly and had zero-downtime.

fidz · on May 5, 2017

I wonder, how does nginx and haproxy handle long/persistent connection session? The connection itself can't be terminated since there is actual client, established connection with a backend beneath. Will the reload be failed? (Something like, "connection termination timeout; can't reload; try later"). For web workers probably we won't see this; for most of the time, the connection is terminated after request done.

jolynch · on May 5, 2017

Both NGINX and HAProxy will hang around for as long as the connection is open (up to the timeout). It's actually quite an issue when you're rapidly reloading either proxy (you can run out of memory reasonably easily), but most services that have long lived TCP connections also handle resets reasonably well so you can typically just kill the old proxies and it'll be ok.

r4um · on May 5, 2017

There is an option added in 1.7 which makes old process exit after a grace period https://cbonte.github.io/haproxy-dconv/1.7/configuration.htm...

frik · on May 5, 2017

Why has HaProxy still such a 20 year old website?

http://www.haproxy.org

And then there is a blog on a new http://www.haproxy.com site.

Is there a comparision of HaProxy, Varnish, Squid, Traffic Server (ATS), Nginx, lighttpd, etc for typical scenario.

ericd · on May 5, 2017

Because it's an absolutely awesome tool, and it doesn't need a flashy site with marketing graphics.

snvzz · on May 5, 2017

>Why has HaProxy still such a 20 year old website?

It's a far far better website than most so called "modern" websites.

user5994461 · on May 5, 2017

Comparison:

https://serverfault.com/questions/204025/ordering-1-nginx-2-...

Thaxll · on May 5, 2017

Lighttpd is dead, just don't use it.

orthecreedence · on May 5, 2017

This is great. Thanks, Willy!

I've used HAProxy on and off throughout a lot of my career. I'm currently using it at my company as a way for services to talk to each other without specifically knowing who is where. I wouldn't call it "microservices" but probably similar: each server has HAProxy on it, and Ansible creates the HAProxy config/hosts file so that, say, a worker server can grab http://lb-api:6666/some/resource. lb-api is a host that routes to 127.0.0.1 and HAProxy runs on port 6666 locally, parses the "lb-api" host, and routes the request to one of the servers in the "api" group. Any time we change any servers, we just run our haproxy playbook and everything just flows.

As always, HAProxy is one of the few pieces of our infrastructure that "just works" day after day.

toomuchtodo · on May 5, 2017

Airbnb released a tool called Synapse that'll do most of this service discovery and config rendering for you.

jolynch · on May 5, 2017

Disclaimer: I help maintain Synapse.

I highly recommend this tool. Yelp has used it in production for years to manage a fairly large PaaS (hundreds of services, thousands of containers, constant churn); it's proven quite flexible and resilient.

Synapse is available on github [0], and we've open sourced our automation used to create a highly available service router using Synapse as well [1][2].

[0] https://github.com/airbnb/synapse [1] https://github.com/Yelp/synapse-tools/tree/master/src/synaps... [2] http://paasta.readthedocs.io/en/latest/yelpsoa_configs.html#...

toomuchtodo · on May 5, 2017

Really appreciate not only the work put into Synapse, but also the release and maintenance of it. Its been very helpful for projects where I've used it.

was_boring · on May 4, 2017

This is great. We use haproxy at my work and I like it, it does it's job, but quirks like dns resolution only at startup, having to reload on config changes and no seamless reloading stop me from loving it.

tyingq · on May 5, 2017

Haproxy can be configured to cache dns resolves for shorter periods of time. Though there are some other limitations.

Here's a thread showing a config that does that, and some of the limitations. http://discourse.haproxy.org/t/trouble-with-dns-resolvers-st...

r4um · on May 5, 2017

This is not needed since version 1.6 See https://cbonte.github.io/haproxy-dconv/1.6/configuration.htm...

It can do DNS resolution at runtime, additionally you can override bunch of options as to how DNS resolution behaves in terms of TTL, Failures etc

blibble · on May 4, 2017

title of article says it now supports seamless reloading?

what's the alternative to reloading on config change? automatic detection and reload on file change? personally I prefer the explicit action

Chronos · on May 5, 2017

It still requires explicit action. However, the old way had a little dance between the old process and the new process: the new process tells the old process to start shutting down, the old process stops listening for new connections, then the new process starts listening for new connections. That left a gap where connections got rejected.

The new technique is for the old process to use a Unix socket to seamlessly transfer ownership of the listening sockets to the new process. At no point are the listening sockets closed, so no connections are rejected.

It's still a (potentially) new haproxy binary starting up and parsing the (potentially) changed haproxy config because the user requested a graceful restart.

vbernat · on May 5, 2017

The new process listen to connections before the old process stop listening. The problem is that the old process can still have new connections queued up. They are lost when its sockets are closed.

stavros · on May 4, 2017

I, too, am wondering about that. The only alternative I can see to reloading is doing it automatically every file change, which means everything would break if I saved before everything was ready. I am perplexed.

phil21 · on May 6, 2017

It certainly does not automatically reload on configuration file change.

This simply means you can have hitless reloads - change your configuration, reload HAProxy, and you will drop zero incoming connections during the reload time. Other methods previously existed to do this without having to first drain traffic, but they were both unwieldy and still tended to have a performance impact.

jolynch · on May 5, 2017

I'm so excited about this. We just finished rolling out a new seamless strategy involving pairing NGINX with HAProxy which I am almost done with the blog post for, but I envision this making our solution even simpler in the future when it hits stable branches.

Absolutely awesome.

gwu78 · on May 5, 2017

Example line from the Openshift reload-haproxy script:

  old_pids=$(ps -A -opid,args | grep haproxy | egrep -v -e 'grep|reload-haproxy' | awk '{print $1}' | tr '\n' ' ')

Can we do this without grep, egrep and awk? Would this work?

  old_pids=$(exec ps -A -opid,args |sed -n '/sed/d;/reload-haproxy/d;/haproxy/{s/ .*//;p};'|tr '\n' ' ')

tyingq · on May 5, 2017

Haproxy supports a directive called pidfile that causes it to write it's pid to that file at startup.

If you still want grep, without it matching itself, an old trick is egrep '[h]aproxy' or similar.

Egrep, as opposed to pgrep, is more widely installed on non Linux systems like osx.

ploxiln · on May 5, 2017

Yeah, the "ps | grep | grep -v grep" pattern is silly, there are a few better ways. In this case just:

    old_pids=$(pgrep '^haproxy')

gwu78 · on May 5, 2017

"... is silly, there are better ways."

I see this usage often where it seems like

  grep pattern1 | grep -v pattern2

can be replaced by

  sed -n '/pattern1/d;/pattern2/p'

or at least

  sed '/pattern1/!d' | sed '/pattern2/d'

or

  sed -n 's/pattern1pattern2//g;/pattern2/p'

But I must be missing something obvious.

For example look at the "grep -v" usage here:

https://github.com/thomwiggers/qhasm/raw/master/qhasm-arm

Is there something wrong with using

   sed '/^op:livefloat80:/d'

Moreover, in the last line, why not use

  sed 's/\$/#/g'

instead of

  tr '$' '#'

Apologies if I am missing the obvious.

gnaritas · on May 5, 2017

All of your suggestions are more complicated than what you're suggesting replacing. People chain simple commands together because they're a language and it matches how they think of the problem. They're solving the problem with simple commands and pipes, you're trying to solve it with regex and as few commands as possible. All ways are valid but specific commands tend to be easier to remember on the fly than trying to do it all with sed and regexes. I use sed when I want to edit streams, not when I want to filter them. Tr is a simpler replace than a sed regex.

gnosek · on May 5, 2017

    grep [h]aproxy

gnaritas · on May 8, 2017

It's not silly at all, it's simple; there's many ways to accomplish something and knowing a shorter more precise way to do something doesn't make the longer simple ways silly. The author didn't know about pgrep, so he used what he did know about, ps and grep, nothing remotely silly about that; it's pragmatic.

vbernat · on May 5, 2017

"args" could be replaced by "comm" to not filter grep. grep could be replaced by awk:

    ps -A -opid,comm | awk '($2 == "haproxy") {printf "%d ",$1}'

However, "pidof haproxy" would work too.

wumpus · on May 4, 2017

As a comparison, nginx -- which is a quite different bit of software but is sometimes used in a similar fashion to haproxy -- can be gracefully restarted several times per second with no issues.

annnnd · on May 5, 2017

What do you mean by "with no issues"? Do you want to say that it doesn't drop connections? If that is the case, I would be curious to know more - but cursory search doesn't support this claim. [0]

If you mean something else then HAproxy also supports "graceful restarts several times per second with no issues". But the article is not talking about that.

EDIT: added link.

[0] https://serverfault.com/questions/523417/reloading-new-nginx...

wumpus · on May 5, 2017

I mean it didn't drop our user connections.

The link you give talks about persistent HTTP connections paired with a broken http client.

annnnd · on May 5, 2017

Actually the link I posted dealt with reloading config, not restarting (which is something completely different - you can't upgrade binary that way). But broken clients are everywhere, so you can't discount them. And persisten connections too, while we are at it.

I did however find instructions how to properly restart Nginx without dropping connections: https://www.digitalocean.com/community/tutorials/how-to-upgr... Apparently it can be done (and could even be automated), but the procedure looks very generic to me (not nginx-specific). Is this what you did?

gerdesj · on May 5, 2017

"which is a quite different bit of software but is sometimes used in a similar fashion to haproxy"

You put your finger on it right there - it's quite different. For example can nginx tunnel an RDP session for example? No of course not - nginx is a web server (proxy) etc. They have overlapping use cases but there are quite a few bits outside the intersection of their capabilities. Even now I am mentally writing a haproxy.conf to serve web content. The static bit is easy but I'll stick to using nginx or apache for what they are good at.

Obviously I wouldn't dream of letting IIS accept external inbound connections unless mediated via HAProxy ...

atombender · on May 5, 2017

Nginx supports L4 (TCP/UDP) proxying just fine, just like HAProxy [1]. Nginx's HTTP(S) proxy capabilities are extensive. Not as extensive as HAProxy, but definitely close. (I don't know anything about RDP, I guess it's a binary protocol that HAProxy doesn't know anything about?)

[1] https://www.nginx.com/resources/admin-guide/tcp-load-balanci...

drewbug01 · on May 5, 2017

HAProxy is much more flexible than nginx when it comes to proxying and load-balancing non-http protocols.

And, frankly - HAProxy is pretty darn flexible around HTTP too. It's not really an apples-to-apples comparison.

gerdesj · on May 5, 2017

That link goes to something peddling nginx plus. I'll stick with nginx and haproxy and use each where appropriate.

For some odd reason Products -> Compare versions didn't work for me (Chrome on Linux) so I can't tell what plus brings to the show.

atombender · on May 5, 2017

No, that link is for the open source version.

    *Prerequisites*
    Latest NGINX Open Source built with the --with-stream
    configuration flag, or latest NGINX Plus (no extra build
    steps required)

user5994461 · on May 5, 2017

Your link is to nginx plus: That's the paid edition, $1900 per server.

The free nginx is lagging farrrrr behind in terms of features.

atombender · on May 5, 2017

No, that link is for the open source version.

    *Prerequisites*
    Latest NGINX Open Source built with the --with-stream
    configuration flag, or latest NGINX Plus (no extra build
    steps required)

jsz0 · on May 5, 2017

Nginx reloads do have some issues. I've had problems related to config changes / reloads with limit_req, upstream ip hash, resolver, and userid.

tamalsaha001 · on May 5, 2017

I wish they had prioritized HTTP/2 in HTTP mode.