High availability load balancing with HAProxy and CARP on FreeBSD (opens in new tab)

(techbar.me)

44 pointskomljen13y ago21 comments

21 comments

17 comments · 4 top-level

smallegan13y ago· 4 in thread

Is this load balancing or just a redundant online backup? The way I read this was that all traffic goes to one server until it fails and then it flows to the backup. If this is the case and you got a huge influx of traffic it seems like it would crash server 1 and then 2. Or am I misunderstanding it?

stevekemp13y ago

You're slightly misunderstanding things.

The article describes two things:

1. Having a virtual IP which can move around between two physical hosts, such that it is "always available". (It will clearly go away if both hosts crash).

2. Using HAProxy to route incoming requests, from the single virtual IP, to both back-end webservers.

This means in the expected & typical scenario where both hosts are online both webservers will handle half the load. When one host fails the other will handle all traffic.

komljenOP13y ago

Ok, if both load balancers go away it will crash. When master or backup is up, both webservers in background can take all load (100%), because just one loadbalancer will work in one time.

druiid13y ago

This is both load balancing and a redundant load balancer.

Traffic goes to all the backend web-servers under this scenario, through a single load-balancer device/server.

If that device goes down the secondary takes over.

While if you have millions of requests/sec this scenario wouldn't be quite enough for you, using haproxy with carp/ucarp/keepalived will get you at least 100k connections or more. If you need millions, then you have other problems to worry about as well :).

komljenOP13y ago

Basically this will not increase performance, just availability as the main problem with load balancing is that it becomes a SPOF, if it goes down everything is down no matter how powerful web servers you have in background. What will happen if one of those gets huge amount of traffic... I think that it will not go to the backup as CARP should work. But yes it would be nice to test that.

zaphoyd13y ago· 4 in thread

Does anyone know how CARP on FreeBSD compares to keepalived on Linux?

stevekemp13y ago

CARP compares well to UCARP on Linux, as I recently documented here:

* http://www.debian-administration.org/article/678/Virtual_IP_...

komljenOP13y ago

Thanks, I didn't know about UCARP

komljenOP13y ago

I didn't try keepalived on Linux (I tried it on FreeBSD) so I'm not right person to say is it better or not. Only I can say is that CARP performed really well from my perspective. I'm also interested in comparison of those two...

justincormack13y ago

Keepalived implements vrrp, which seems to have patent issues according to Wikipedia https://en.wikipedia.org/wiki/Common_Address_Redundancy_Prot...

rcoder13y ago· 3 in thread

A number of years ago, I set up a similar configuration with two OpenBSD boxes (which support PF and CARP out of the box) running Apache + mod_proxy_balancer for really granular load balancing and routing. It was a super-flexible and cheap way to route traffic to a mid-sized app server cluster, and generally worked really well.

We did have some network issues after the new topology went live that we unexpectedly tracked down to the LB pair, for one simple reason: CARP generates a lot of multicast traffic. Depending on how your hosts and network are configured, this can easily get routed out into a fairly large portion of your local network, and use a lot of bandwidth/router capacity with no benefit.

So, if you're setting up a CARP pair/cluster of your own, pay close attention to your multicast setup. Ideally, put the CARP multicast traffic on a dedicated subnet and watch your router and switch stats to make sure you aren't flooding the rest of your network with multicast spam.

INTPenis13y ago

Also if you're setting up in a virtual environment keep in mind that it requires promiscuous mode enabled.

This stopped the show for me once when setting up two OpenBSD load balancers in a shared virtual environment. I was told that to enable promiscuous mode on a single port group they would also have to enable it on the physical ESX host adapter for each ESX host since the pair was separated on different physical hosts.

If that is true then I would never enable it. However networking isn't my strong side so I can't verify this.

druiid13y ago

Well generally you're going to be putting multicast traffic on a default subnet of something like 224.0.0.0/4.

It's pretty easy to filter that out using iptables as well. I know for keepalived, you're generally only sending out notifications every second or two. Anything higher than that and yeah, you could end up generating some crazy traffic.

Here's a typical tcpdump of an advertisement for the curious: 172.20.1.xx > 224.0.0.18: AH(xxxxx): VRRPv2, Advertisement, vrid 51, prio 104, authtype ah, intvl 1s, length 20, addrs: 172.20.1.xx

komljenOP13y ago

Thanks, you are right about multicast traffic which I totally forgot to mention, I will have that in mind...

druiid13y ago· 2 in thread

Carp/ucarp are pretty fun. Additionally keepalived in linux implements vrrp and some additional nice features such as scripting what to do on failover of resources.

An addendum to this guide might be to add in connection tracking across the master/slave nodes. In *BSD this is implemented with pfsync. In linux it would be iptables connection tracking.

More on pfsync: http://www.openbsd.org/faq/pf/carp.html#pfsyncop Linux conntrackd: http://conntrack-tools.netfilter.org/

Edit: Oh, and everyone always forgets about using LVS for load-balancing and failover. There's endless documentation on the web about that, and it's not a proxy service like haproxy (which is both good and bad).

komljenOP13y ago

You are right, connection tracking is must have in production environments, I will try both pfsync and iptables. Thanks

druiid13y ago

Well pfsync is only for freebsd pf. If you use linux instead then you'll be using iptables + conntrackd.

They're both powerful systems with a different way of thinking about the idea of packet filtering and mangling!

1 more reply

j / k navigate · click thread line to collapse

21 comments

17 comments · 4 top-level

smallegan13y ago· 4 in thread

stevekemp13y ago

You're slightly misunderstanding things.

The article describes two things:

1. Having a virtual IP which can move around between two physical hosts, such that it is "always available". (It will clearly go away if both hosts crash).

2. Using HAProxy to route incoming requests, from the single virtual IP, to both back-end webservers.

This means in the expected & typical scenario where both hosts are online both webservers will handle half the load. When one host fails the other will handle all traffic.

komljenOP13y ago

Ok, if both load balancers go away it will crash. When master or backup is up, both webservers in background can take all load (100%), because just one loadbalancer will work in one time.

druiid13y ago

This is both load balancing and a redundant load balancer.

Traffic goes to all the backend web-servers under this scenario, through a single load-balancer device/server.

If that device goes down the secondary takes over.

komljenOP13y ago

zaphoyd13y ago· 4 in thread

Does anyone know how CARP on FreeBSD compares to keepalived on Linux?

stevekemp13y ago

CARP compares well to UCARP on Linux, as I recently documented here:

* http://www.debian-administration.org/article/678/Virtual_IP_...

komljenOP13y ago

Thanks, I didn't know about UCARP

komljenOP13y ago

justincormack13y ago

Keepalived implements vrrp, which seems to have patent issues according to Wikipedia https://en.wikipedia.org/wiki/Common_Address_Redundancy_Prot...

rcoder13y ago· 3 in thread

INTPenis13y ago

Also if you're setting up in a virtual environment keep in mind that it requires promiscuous mode enabled.

If that is true then I would never enable it. However networking isn't my strong side so I can't verify this.

druiid13y ago

Well generally you're going to be putting multicast traffic on a default subnet of something like 224.0.0.0/4.

Here's a typical tcpdump of an advertisement for the curious: 172.20.1.xx > 224.0.0.18: AH(xxxxx): VRRPv2, Advertisement, vrid 51, prio 104, authtype ah, intvl 1s, length 20, addrs: 172.20.1.xx

komljenOP13y ago

Thanks, you are right about multicast traffic which I totally forgot to mention, I will have that in mind...

druiid13y ago· 2 in thread

Carp/ucarp are pretty fun. Additionally keepalived in linux implements vrrp and some additional nice features such as scripting what to do on failover of resources.

An addendum to this guide might be to add in connection tracking across the master/slave nodes. In *BSD this is implemented with pfsync. In linux it would be iptables connection tracking.

More on pfsync: http://www.openbsd.org/faq/pf/carp.html#pfsyncop Linux conntrackd: http://conntrack-tools.netfilter.org/

komljenOP13y ago

You are right, connection tracking is must have in production environments, I will try both pfsync and iptables. Thanks

druiid13y ago

Well pfsync is only for freebsd pf. If you use linux instead then you'll be using iptables + conntrackd.

They're both powerful systems with a different way of thinking about the idea of packet filtering and mangling!

1 more reply

j / k navigate · click thread line to collapse