How Discord Resizes 150M Images Every Day with Go and C++ (opens in new tab)

(blog.discordapp.com)

303 pointsb1naryth1ef8y ago145 comments

145 comments

85 comments · 14 top-level

Xeoncross8y ago· 28 in thread

Why don't more companies resize images client-side first using <canvas> and then save the server some work by only asking it to verify the result by

- resizing to the same size

- removing metadata

This results in much faster transfer (10x less bandwidth used often for mobile uploads) and reduces server load by "farming" out the work to the clients.

https://developer.mozilla.org/en-US/docs/Web/API/CanvasRende...

# Edit: On Keeping Full Resolution Images

Some people mention having original highest-resolution images are important. I don't think that is true for most applications.

Most apps don't need hi-resolution history as much as current, live engagement so older photos being smaller isn't a big deal. As technology moves on you simply start allowing higher-res uploads. Youtube, facebook, and others have done this fine as the older stuff is replaced with the new/current/now() content.

In fact, even our highest resolution images are still low-quality for the future. Pick a good max size for your site (4k?) and resize everything down to that. In a year, bump it up to 6k, then 10k, etc...

Keeping costs low has it's benefits, especially for us startups. Now if you have massive collateral, then knock yourself out.

AndrewStephens8y ago

A few reasons:

1) Although the site serves up images at 1024 pixels (or whatever) today, in the future they may want larger images. When everyone is rocking 10K monitors and 6K phone displays, those small images are going to look pretty bad.

2) The original image has some metadata that they want to keep (geolocation, etc).

3) They think they can do a better and more consistent job resizing than the various browsers, which is probably true.

malux858y ago

agree on 3) most browsers just use linear interpolation when resizing images, which makes sense from a performance point of view, but looks terrible. Better to use a bi-linear or cubic resize, more computing up front, but better images, this is probably the reason they do it

4 more replies

SaltyBackendGuy8y ago

> 2) The original image has some metadata that they want to keep (geolocation, etc).

Isn't exif data something you should strip out?

1 more reply

Moru8y ago

Our site would have been happy with full res images from the start. As it is now we are stuck with 80x80 images that needs replacing with higher res images since the originals was not kept in any sorted order.

1 more reply

Lutin8y ago

This is for proxying images that users link in chat, not for when users upload images to the service. It doesn't make sense to talk about doing this resize on the client, as the client doesn't have the image.

TrinaryWorksToo8y ago

That is a great point. It could still be feasible to cache the image on the client, and have it do the resize. Although I probably wouldn't accept it as a client, especially if my internet cuts out halfway through.

Xeoncross8y ago

Yes, the use-case of proxying images is a different mater. I was talking about client uploads since so many companies seem determined to waste my bandwidth and time uploading without resizing first.

1 more reply

b1naryth1efOP8y ago

As mentioned in the post, one of our core product features is preventing your IP from being shared. Given that requirement, images shared in chat have to be proxied through our infrastructure. When doing this we save a lot of money and improve client performance by reducing image sizes.

Xeoncross8y ago

So why the image can't first be resized/compressed before being sent through your infrastructure...?

2 more replies

pmelendez8y ago

From the notes of your link:

"drawImage() will ignore all EXIF metadata in images, including the Orientation. This behavior is espacially troublesome on iOS devices. You should detect the Orientation yourself and use rotate() to make it right. "

If the origin of the image is the client and you got the client side resize wrong, then you might introduce artifacts when trying to fix it on the server because the data loss. Also if clients are mobile, you might like to optimize the battery of clients instead of computing time on the server.

Matt3o12_8y ago

The question now is, what drains more power, sending the image and resizing it before sending. If the user has a good WiFi or 4G connection, sending the file as it, should be quicker and more energy efficient. With a 2G or 3G connection, uploading a photograph can take significantly longer (1min on poor/average 3G, which means the antenna is working for that duration and draws a lot of battery). Converting it should not take more then a second. Furthermore, I would prefer using less data then using a tiny amount of battery.

1 more reply

Xeoncross8y ago

Even several years ago there were libraries on github that accounted for iOS defects. However, that aside, just skip the resize on iOS and send as-is. The server still has to verify the result anyway.

jlg238y ago

First: Please don't use "Edit" for responding to responses to your comment; it make following threads much, much harder.

On Topic:

> Some people mention having original highest-resolution images are important. I don't think that is true for most applications.

It is true for every application when the next generation of displays hits the market. The question is not the long term usability of our current low-res images but just the migration to the next step. At the moment Acorn announces their new APhone and has a million handsets sold by tomorrow, you want your service to deliver at least viewable images. It's not always the app that sets the bar, sometimes it is the device.

Edit: As someone who travels rather remote places of this planet regularly I'm grateful for every app that does not put the burden on the client. My battery packs only last so long.

antisthenes8y ago

The last thing I want is client-side image resizing when my browser is already choking on heavy javascript.

timdorr8y ago

These images are often links from the web or posted by a bot, so they're not on a client until after they've been served.

codedokode8y ago

As I understand they needed to download images from any external servers using an URL. I am not sure if that is possible even with CORS.

Also they don't save preview images and generate them when needed as I understood. So what you are suggesting requires a lot of disk space to keep thumbnails that might be never needed later.

And if you don't have millions of uploads per day then it makes no sense trying to save some seconds of CPU time by unnecessarily complicating the system. In most languages there already are libraries for resizing images.

megy8y ago

I have no idea how what you are saying related to compressing images before they are sent?

jasonlotito8y ago

Many do resizing initially, but even when resizing, you still need to resize images for different reasons, such as thumbnails. So what you need to do is resize on client as low as you are willing to go, and then upload that. But you still need to resize for different needs. You don't want the client doing multiple resizes and uploads for that.

CamTin8y ago

Because then you don't get to build out fun infrastructure like this and write it up in your company blog.

fleitz8y ago

Agreed. I wish the posts contained a "it cost X developer hours to recreate thumbor or $$$ total, and we saved Y dollars per month" meaning in approximately 15 years we'll have broken even on this investment. Oh yeah and we don't even do intelligent resizing like thumbor does.

1 more reply

Xeoncross8y ago

Sure you do! Don't you know that adding "Free" to a title increases the ROI by 40%?

"How Discord Resizes 150M Images Every Day for Free"

batmansmk8y ago

It may work in certain cases.

But it also means you need to know how you will display when you save it. Layout changes, screens change, how do you anticipate the future dimensions / resolution you will need out of the original?

user59944618y ago

Historically, resizing images client side doesn't work because most clients are not able to render the images, let alone resize them.

The image file formats are very very complicated, many are platform specific, some are covered by patents.

For example of a common issue, another comment mentioned the rotation parameter, it's set by many cameras but the support is inconsistent.

sgk2848y ago

You often want the original because different clients may display different sizes.

dajohnson898y ago

isn't it a meme at this point, pushing computational work to the client side? I have a laptop or mobile device, please don't hog my limited cpu and battery life by forcing my device to resize images.

_Tev8y ago

Are you sure your wifi connection will not eat more power with huge upload than processor/gpu doing the scaling?

1 more reply

gsich8y ago

Mandatory XKCD: https://xkcd.com/1683/

greenleafjacob8y ago

Imgur does this.

throwthisawayt8y ago· 10 in thread

Did it seem to anyone else that sticking to Python would have been way easier? It didn’t seem like any of the performance gains were through Golang.

smaili8y ago

I believe this little piece answers your question:

> We likely could have addressed this behavior in Image Proxy, but we had been experimenting with using more Go, and it seemed like a good place to try Go out.

At the heart of if, they were looking for opportunities to use more Go in their stack and they deemed this situation as a fit.

prophesi8y ago

And they ended up open-sourcing the library they built, so it's a win on all sides.

fleitz8y ago

The age old solution in search of a problem.

1 more reply

deepnotderp8y ago

Yeah, either a good Python JIT or Cython would have been fine honestly. I never understood the obsession with "python is slow" when you can recover almost all of the performance with a good JIT or Cython (in many/most cases).

Drdrdrq8y ago

Yes. Or simply profiling the app and optimizing sore spots would have helped too. It seems to me there was no real reason to move from Python to Go, apart from preference.

victor1068y ago

What are some good Python JIT's that are worth trying out?

1 more reply

detaro8y ago

I don't think the article gives us the data to know this. Where did the latency spikes in the original implementation come from? Would fixing them have required a complete rewrite of the Python parts anyways?

harikb8y ago

I understand this is a personal preference, but having spent a good amount time with both Python and Go, FWIW I would also choose Go if I were solving the same problem.

stmw8y ago

From reading this, seems HTTP handling speed was important to them? which Go is probably better for. Also, interfacing Python to C/C++ is pretty unpleasant.

jononor8y ago

In Python they already had an extremely fast library with bindings available.

caltrops8y ago· 7 in thread

I’d be very worried about a security issue with the unsafe C++ code.

You really have to run this kind of complex parsing in a disposable containerized environment to do it safely. Or do everything carefully and in a memory safe language.

bri3d8y ago

I'm not sure why this is being downvoted - image processing is one of the most dangerous parts of a common consumer-facing web software stack. By and large this is because image container formats are poorly documented, overly broad, and rely on a lot of tricky binary parsing that's easy to mess up in an unsafe programming language. It's also one of the most obvious ingress points for untrusted binary data uploaded by an end-user, which is always going to be dangerous.

See the persistent, years-long trend where mobile devices and game consoles get exploited via some combination of libtiff and libpng.

Impossible8y ago

The downvotes are also because it's a somewhat cliche comment on HN now. Anytime anyone is doing any with C or C++ that is even indirectly web facing, "this could be unsafe!!!" is an obligatory comment, even though all major tech companies have core components written in C++, and there are big web apps that have been running for years that are mostly written in C or C++. Security is definitely a concern, but these kind of comments can derail interesting discussion, in the same way complaining about font readability or template choice in an otherwise interesting article can.

1 more reply

pmelendez8y ago

True (and I didn't downvote by the way), but a "memory safe" language might not be as helpful as people might think. Most of memory managed languages still rely on native libraries to perform image processing, if at the end you are using libpng and there is an exploit on it, it doesn't matter if you are using python or C++, both code base would have the same exploit if it is not explicitly mitigated in the logic.

1 more reply

GuB-428y ago

The downvote is probably because the comment implied that the issue is that the image processing is done in "unsafe" C++ and that another language should have been used.

However, there isn't much choice. Performance is very important in image processing, so much that many libraries contain hand-written assembly. In the article, it says that 90% of processing power is dedicated to it. Using a safer language in a safe way could completely kill performance and significantly increase the costs.

1 more reply

abiox8y ago

> I'm not sure why this is being downvoted

if i was a betting person, i'd wager that it may see somewhat like "rewrite it in rust" cargo culting.

1 more reply

hemancuso8y ago

I'd love to be pointed at any resource where somebody who has spent the time walks through the best way to do this safely. Is the only way to do it safely inside a container via some networked connection? Are there other ways to lock down ImageMagick etc such that you can resize safely?

searealist8y ago

This has nothing to do with parsing.

Also, your life must be very stressful.

devwastaken8y ago· 5 in thread

How is the security? Any sort of image processing is a potential exploitation point. I see it says it uses the 'mature' libjpeg-turbo and libpng libraries,along with giflib for .gifs, but even with full trust of those, the C code, patches, and changes ontop could be more exploitation points. You can look through Imagemagick alone to see all the fun things possible when seemingly basic processing turns into exploits. https://www.cvedetails.com/vulnerability-list/vendor_id-1749...

mark-r8y ago

They specifically addressed this by throwing a fuzzer at it. Of course that's to find crashes rather than exploits, but it's a good start.

Buttons8408y ago

Wow really? Is there room for another image processing library? Is ImageMagic poorly written or is image manipulation inherently risky?

bri3d8y ago

ImageMagick is notoriously questionable. It was originally written, I believe, as a local command-line tool for users to work with their own images, so security and untrusted input were not primary concerns.

Additionally, image manipulation is inherently challenging - not even due to the actual manipulation of image pixel data, but due to the proliferation of complex image container formats which require binary data manipulation and byte copying in performance-critical code. This is a minefield for secure programming practices because it puts at direct odds performance and sanity checking, as well as encouraging pointer and memory arithmetic and unsafe access.

abiox8y ago

> Is there room for another (...)

seems to me that there is no limit to available room. well, i suppose we're capped by the collective capacity of local storage and storage service providers.

tedunangst8y ago

ImageMagick is a particularly poor choice because it will try parsing a thousand formats your users will never upload. That's a lot of code to leave exposed to the internet.

manigandham8y ago· 5 in thread

Nice, but why? https://cloudinary.com, https://www.imgix.com, or https://www.filestack.com already exist and are well worth it for 99% of apps. Even at scale, it really doesn't cost that much to have someone else do it. You can use a thin proxy through your existing CDN if you want to save on their bandwidth fees.

Also http://thumbor.org and https://imageresizing.net if you want a library to host yourself which are already very fast and well tested. Put them in a docker container on a kubernetes cluster and it's all done in an hour.

zitterbewegung8y ago

Maybe it’s because that they don’t want a dependency on a external service that could go down ?

StreamBright8y ago

So you have an internal dependency that could go down?

1 more reply

manigandham8y ago

It's images... seems like a very low risk situation, especially when they are served from a CDN.

2 more replies

mercwear8y ago

I agree. Offloading this type of work to a third party who does it really well is a smart move. Why manage additional code when it's not even core to what you do?

bpicolo8y ago

In this case, it was perhaps cheaper for them to do in-house, and it's not rocket science? They wrote a bleeding edge library for it - sounds like they have the expertise just fine. Minimizing external dependencies can be a big deal if you have the developers to manage it.

Also, it is totally core to what they do. Images are a huge part of the Discord UX.

2 more replies

0xbear8y ago· 4 in thread

That’s 1700 images per second. Doable on one (beefy) box. 3 to account for the diurnal cycle. Am I supposed to be impressed?

brian-armstrong8y ago

Can you link to which resize library you're using? We'd love to see a 90% further reduction in instances

mbrumlow8y ago

Sorry to be confusing, I am not resizing images. Just working with data sets as large as what I image 150M images would be. The software I am working on takes point and time backups of computers and uploads them to "the cloud", I mean servers in a data center. There they can be virtualized with a click of a button in mass or one at a time, and near instantly.

This involves transfering, encrypting, compression and creating checksum of terabytes of data a hour (per node). While not exactly resizing images, I would image the computational power was on par with the service described. The entire system has about 4 PB or 8 PB in it right now, as backups are pruned (based on what people will pay for storage).

My software has a ton of space to grow and become better, but I think a better story would have been how discord handles 150M images a hour. If anything bandwidth acquiring the source image would be what I would consider the largest problem, not the CPU time to resize. In fact as long as your resize code slightly faster than the download then streaming it in and out would put your bottleneck entirely on bandwidth.

I will also note I am not a fan of libraries :p but that is not what this is about.

EDIT:

Also kudos to you, somebody criticized your post and you had the best response one could have. Inquiring minds are awesome.

1 more reply

mbrumlow8y ago

I don't get why you are being down voted. This is almost exactly what I thought. It's just not that much data given the state is computer hardware.

Where I work we have single nodes processing near that much data a hour -- these are beefy systems though.

0xbear8y ago

People have just drunk so much “cheap commodity hardware” kool aid by now, they don’t realize there are cheaper and easier ways of doing things now, assuming you have devs who can code and tune for performance. Same with “big data”. Most people have sub-1T datasets. You simply don’t need Spark or anything custom for that.

JepZ8y ago· 3 in thread

Anybody knows how well libvips https://github.com/DAddYE/vips compares to liliput performance wise?

b1naryth1efOP8y ago

vips (Go binding) is included in the benchmarks mentioned in the post, but at the time of running them (~10 months ago) vips pulled 51482954 ns/op on a 1024x1024 test image, where as pillow-simd managed 3324135.3035 ns/op.

CapacitorSet8y ago

For ease of reading, that's respectively 51 ms and 3 ms.

JepZ8y ago

Thanks :-)

Looks like I didn't scroll properly when I looked at that file. My bad :-/

kylehotchkiss8y ago· 3 in thread

I wish Cloudfront supported resize parameters so we wouldn't have to keep buildings these or paying a lot for Imgix.

abeach2228y ago

You can use lambda edge functions for this. They recently announced support for query string parameters.

https://aws.amazon.com/about-aws/whats-new/2017/10/lambda-at...

I have built an image resizing service around this with go and libvips. With go libvips, s3gof3r, you can load s3 images directly into a buffer, pass to libvips, and serve without writing to disk. Basically, you can use edge functions with your origin as the above go service.

fleitz8y ago

How much would you pay for an image resizing service? I'd been thinking for a while of putting a fleet of autoscaled thumbor boxes behind cloudfront and making a billing API for it.

kylehotchkiss8y ago

Imgix's $10 minimum is so much for a personal site with maybe 500 uniques a month. If you're going for a service like that, think of people like me who host on s3/cloudfront for $.20/month. But let people scale up to millions of pageviews a month.

Don't need anything fancy. Just w=? h=? would be great, developers can handle the DPI stuff with sourceset tags.

1 more reply

Const-me8y ago· 3 in thread

I wonder why people implement such things on CPU?

PCI express is ~100 gbit/sec, much faster than any network interface. Internally, a GPU can resize these images by an order of magnitude faster than that, see the fillrate columns in the GPU spec.

acdha8y ago

This isn't just resampling an image: decoding a variety of image (and even video) formats, decompressing the selected frame, performing the actual resize, and then compressing the result. If the resample doesn't save more than the setup overhead, it'd be an immediate loss. Even if it does, there's an engineering cost since you now need to make sure that all of your servers have GPUs available, your chosen implementation code supports all of them with acceptable quality and error handling, etc.

Since the GPU hardware has become commonplace, there's definitely a lot more attention on using it in the server space and I think it'll become common in the next few years but that has a migration cost for early adopters since you're hitting less mature projects for critical functions. Internet-facing image processing has a bunch of tedious but important work handling format variations and errors (it'll be reported as a bug in your software if the image opens in a browser and/or photoshop), making sure that you handle gamma/colorspace consistently, etc.

If you're trying to get production-ready server out the door, it's really tempting not to deal with any of that once you hit the point where it's fast enough that engineering time costs more than the server savings.

Const-me8y ago

> This isn't just resampling an image

GPUs can do that, too: http://fastcompression.com/products/jpeg/cuda-jpeg.htm

> you now need to make sure that all of your servers have GPUs available

OP is running on google’s cloud: “n1-standard-16 host type, peaking at 12 instances on a typical day.” That instance costs $0.76/hour. Adding NVIDIA Tesla K80 is $0.7 extra.

> it's really tempting not to deal with any of that

Yeah, that’s understandable. But the original article dealt with a lot of strange technologies to get the performance they want. And ended up doing much slower, performance wise, than what’s possible with a GPU.

2 more replies

malikNF8y ago

Most probably its because of the time it takes to push the image on the the GPU and then back to the CPU.

sgk2848y ago· 2 in thread

There is already an (unofficial Google) image proxy written in Go that is quite fast, does caching (local or backed by S3/GCS), and does other nice things like smart cropping: https://github.com/willnorris/imageproxy

Seemed like a lot of unnecessary work for them to reimplement a service from scratch without gaining any major perf benefits over their existing one and without leaning on an existing well-known and well-built foundation.

brian-armstrong8y ago

Author of the blog post here - it looks like what you linked does its image resizing in pure Go. In our testing we found these libraries are significantly slower than the C++ resize libraries. I would guess we would need at least 10x as many instances if we used that resizer, though probably a lot more

bpicolo8y ago

https://github.com/thoas/picfit is another golang lib for this, and it's pretty mature at this point.

The one thing these don't support though is smarter cropping that takes into account image contents, which takes enough cpu power to require preprocessing

ymse8y ago· 1 in thread

This post reminded me of a very old article from Yahoo/Tumblr explaining how they were (ab)using Ceph to generate thumbnails on the fly as pictures were uploaded using the Ceph OSD plugin interface.

Unfortunately the post seems to have disappeared from the internet (it was probably around 6 years ago), so here are some other teasers:

https://yahooeng.tumblr.com/post/116391291701/yahoo-cloud-ob...

https://ceph.com/geen-categorie/dynamic-object-interfaces-wi...

Disclaimer: not affiliated with Ceph apart from being a happy sysadmin.

noahdesu8y ago

Here is a link to a talk I gave last month describing how to use Lua to generate thumbnails remotely in the Ceph/RADOS OSD servers.

Talk is from Lua workshop 2017. Relevant content begins at 15m40s.

https://youtu.be/bGQc-PpJAyk?t=15m40s

gourou8y ago

Link to the resulting open-source project:

https://github.com/discordapp/lilliput

linkmotif8y ago

> Today, Media Proxy operates with a median per-image resize of 25ms and a median total response latency of 85ms. It resizes more than 150 million images every day. Media Proxy runs on an autoscaled GCE group of n1-standard-16 host type, peaking at 12 instances on a typical day.

Awesome! <3

tuananh8y ago

is there any open source project img proxy that can do this?

eg: instead of this

http://localhost:8080/https://octodex.github.com/images/code...

we can create alias like octo and url will become this

http://localhost:8080/octo/images/codercat.jpg

j / k navigate · click thread line to collapse

145 comments

85 comments · 14 top-level

Xeoncross8y ago· 28 in thread

Why don't more companies resize images client-side first using <canvas> and then save the server some work by only asking it to verify the result by

- resizing to the same size

- removing metadata

This results in much faster transfer (10x less bandwidth used often for mobile uploads) and reduces server load by "farming" out the work to the clients.

https://developer.mozilla.org/en-US/docs/Web/API/CanvasRende...

# Edit: On Keeping Full Resolution Images

Some people mention having original highest-resolution images are important. I don't think that is true for most applications.

Keeping costs low has it's benefits, especially for us startups. Now if you have massive collateral, then knock yourself out.

AndrewStephens8y ago

A few reasons:

2) The original image has some metadata that they want to keep (geolocation, etc).

3) They think they can do a better and more consistent job resizing than the various browsers, which is probably true.

malux858y ago

4 more replies

SaltyBackendGuy8y ago

> 2) The original image has some metadata that they want to keep (geolocation, etc).

Isn't exif data something you should strip out?

1 more reply

Moru8y ago

1 more reply

Lutin8y ago

TrinaryWorksToo8y ago

Xeoncross8y ago

Yes, the use-case of proxying images is a different mater. I was talking about client uploads since so many companies seem determined to waste my bandwidth and time uploading without resizing first.

1 more reply

b1naryth1efOP8y ago

Xeoncross8y ago

So why the image can't first be resized/compressed before being sent through your infrastructure...?

2 more replies

pmelendez8y ago

From the notes of your link:

Matt3o12_8y ago

1 more reply

Xeoncross8y ago

jlg238y ago

First: Please don't use "Edit" for responding to responses to your comment; it make following threads much, much harder.

On Topic:

> Some people mention having original highest-resolution images are important. I don't think that is true for most applications.

Edit: As someone who travels rather remote places of this planet regularly I'm grateful for every app that does not put the burden on the client. My battery packs only last so long.

antisthenes8y ago

The last thing I want is client-side image resizing when my browser is already choking on heavy javascript.

timdorr8y ago

These images are often links from the web or posted by a bot, so they're not on a client until after they've been served.

codedokode8y ago

As I understand they needed to download images from any external servers using an URL. I am not sure if that is possible even with CORS.

Also they don't save preview images and generate them when needed as I understood. So what you are suggesting requires a lot of disk space to keep thumbnails that might be never needed later.

megy8y ago

I have no idea how what you are saying related to compressing images before they are sent?

jasonlotito8y ago

CamTin8y ago

Because then you don't get to build out fun infrastructure like this and write it up in your company blog.

fleitz8y ago

1 more reply

Xeoncross8y ago

Sure you do! Don't you know that adding "Free" to a title increases the ROI by 40%?

"How Discord Resizes 150M Images Every Day for Free"

batmansmk8y ago

It may work in certain cases.

But it also means you need to know how you will display when you save it. Layout changes, screens change, how do you anticipate the future dimensions / resolution you will need out of the original?

user59944618y ago

Historically, resizing images client side doesn't work because most clients are not able to render the images, let alone resize them.

The image file formats are very very complicated, many are platform specific, some are covered by patents.

For example of a common issue, another comment mentioned the rotation parameter, it's set by many cameras but the support is inconsistent.

sgk2848y ago

You often want the original because different clients may display different sizes.

dajohnson898y ago

isn't it a meme at this point, pushing computational work to the client side? I have a laptop or mobile device, please don't hog my limited cpu and battery life by forcing my device to resize images.

_Tev8y ago

Are you sure your wifi connection will not eat more power with huge upload than processor/gpu doing the scaling?

1 more reply

gsich8y ago

Mandatory XKCD: https://xkcd.com/1683/

greenleafjacob8y ago

Imgur does this.

throwthisawayt8y ago· 10 in thread

Did it seem to anyone else that sticking to Python would have been way easier? It didn’t seem like any of the performance gains were through Golang.

smaili8y ago

I believe this little piece answers your question:

> We likely could have addressed this behavior in Image Proxy, but we had been experimenting with using more Go, and it seemed like a good place to try Go out.

At the heart of if, they were looking for opportunities to use more Go in their stack and they deemed this situation as a fit.

prophesi8y ago

And they ended up open-sourcing the library they built, so it's a win on all sides.

fleitz8y ago

The age old solution in search of a problem.

1 more reply

deepnotderp8y ago

Drdrdrq8y ago

Yes. Or simply profiling the app and optimizing sore spots would have helped too. It seems to me there was no real reason to move from Python to Go, apart from preference.

victor1068y ago

What are some good Python JIT's that are worth trying out?

1 more reply

detaro8y ago

harikb8y ago

I understand this is a personal preference, but having spent a good amount time with both Python and Go, FWIW I would also choose Go if I were solving the same problem.

stmw8y ago

From reading this, seems HTTP handling speed was important to them? which Go is probably better for. Also, interfacing Python to C/C++ is pretty unpleasant.

jononor8y ago

In Python they already had an extremely fast library with bindings available.

caltrops8y ago· 7 in thread

I’d be very worried about a security issue with the unsafe C++ code.

You really have to run this kind of complex parsing in a disposable containerized environment to do it safely. Or do everything carefully and in a memory safe language.

bri3d8y ago

See the persistent, years-long trend where mobile devices and game consoles get exploited via some combination of libtiff and libpng.

Impossible8y ago

1 more reply

pmelendez8y ago

1 more reply

GuB-428y ago

The downvote is probably because the comment implied that the issue is that the image processing is done in "unsafe" C++ and that another language should have been used.

1 more reply

abiox8y ago

> I'm not sure why this is being downvoted

if i was a betting person, i'd wager that it may see somewhat like "rewrite it in rust" cargo culting.

1 more reply

hemancuso8y ago

searealist8y ago

This has nothing to do with parsing.

Also, your life must be very stressful.

devwastaken8y ago· 5 in thread

mark-r8y ago

They specifically addressed this by throwing a fuzzer at it. Of course that's to find crashes rather than exploits, but it's a good start.

Buttons8408y ago

Wow really? Is there room for another image processing library? Is ImageMagic poorly written or is image manipulation inherently risky?

bri3d8y ago

abiox8y ago

> Is there room for another (...)

seems to me that there is no limit to available room. well, i suppose we're capped by the collective capacity of local storage and storage service providers.

tedunangst8y ago

ImageMagick is a particularly poor choice because it will try parsing a thousand formats your users will never upload. That's a lot of code to leave exposed to the internet.

manigandham8y ago· 5 in thread

zitterbewegung8y ago

Maybe it’s because that they don’t want a dependency on a external service that could go down ?

StreamBright8y ago

So you have an internal dependency that could go down?

1 more reply

manigandham8y ago

It's images... seems like a very low risk situation, especially when they are served from a CDN.

2 more replies

mercwear8y ago

I agree. Offloading this type of work to a third party who does it really well is a smart move. Why manage additional code when it's not even core to what you do?

bpicolo8y ago

Also, it is totally core to what they do. Images are a huge part of the Discord UX.

2 more replies

0xbear8y ago· 4 in thread

That’s 1700 images per second. Doable on one (beefy) box. 3 to account for the diurnal cycle. Am I supposed to be impressed?

brian-armstrong8y ago

Can you link to which resize library you're using? We'd love to see a 90% further reduction in instances

mbrumlow8y ago

I will also note I am not a fan of libraries :p but that is not what this is about.

EDIT:

Also kudos to you, somebody criticized your post and you had the best response one could have. Inquiring minds are awesome.

1 more reply

mbrumlow8y ago

I don't get why you are being down voted. This is almost exactly what I thought. It's just not that much data given the state is computer hardware.

Where I work we have single nodes processing near that much data a hour -- these are beefy systems though.

0xbear8y ago

JepZ8y ago· 3 in thread

Anybody knows how well libvips https://github.com/DAddYE/vips compares to liliput performance wise?

b1naryth1efOP8y ago

CapacitorSet8y ago

For ease of reading, that's respectively 51 ms and 3 ms.

JepZ8y ago

Thanks :-)

Looks like I didn't scroll properly when I looked at that file. My bad :-/

kylehotchkiss8y ago· 3 in thread

I wish Cloudfront supported resize parameters so we wouldn't have to keep buildings these or paying a lot for Imgix.

abeach2228y ago

You can use lambda edge functions for this. They recently announced support for query string parameters.

https://aws.amazon.com/about-aws/whats-new/2017/10/lambda-at...

fleitz8y ago

How much would you pay for an image resizing service? I'd been thinking for a while of putting a fleet of autoscaled thumbor boxes behind cloudfront and making a billing API for it.

kylehotchkiss8y ago

Don't need anything fancy. Just w=? h=? would be great, developers can handle the DPI stuff with sourceset tags.

1 more reply

Const-me8y ago· 3 in thread

I wonder why people implement such things on CPU?

PCI express is ~100 gbit/sec, much faster than any network interface. Internally, a GPU can resize these images by an order of magnitude faster than that, see the fillrate columns in the GPU spec.

acdha8y ago

Const-me8y ago

> This isn't just resampling an image

GPUs can do that, too: http://fastcompression.com/products/jpeg/cuda-jpeg.htm

> you now need to make sure that all of your servers have GPUs available

OP is running on google’s cloud: “n1-standard-16 host type, peaking at 12 instances on a typical day.” That instance costs $0.76/hour. Adding NVIDIA Tesla K80 is $0.7 extra.

> it's really tempting not to deal with any of that

2 more replies

malikNF8y ago

Most probably its because of the time it takes to push the image on the the GPU and then back to the CPU.

sgk2848y ago· 2 in thread

brian-armstrong8y ago

bpicolo8y ago

https://github.com/thoas/picfit is another golang lib for this, and it's pretty mature at this point.

The one thing these don't support though is smarter cropping that takes into account image contents, which takes enough cpu power to require preprocessing

ymse8y ago· 1 in thread

This post reminded me of a very old article from Yahoo/Tumblr explaining how they were (ab)using Ceph to generate thumbnails on the fly as pictures were uploaded using the Ceph OSD plugin interface.

Unfortunately the post seems to have disappeared from the internet (it was probably around 6 years ago), so here are some other teasers:

https://yahooeng.tumblr.com/post/116391291701/yahoo-cloud-ob...

https://ceph.com/geen-categorie/dynamic-object-interfaces-wi...

Disclaimer: not affiliated with Ceph apart from being a happy sysadmin.

noahdesu8y ago

Here is a link to a talk I gave last month describing how to use Lua to generate thumbnails remotely in the Ceph/RADOS OSD servers.

Talk is from Lua workshop 2017. Relevant content begins at 15m40s.

https://youtu.be/bGQc-PpJAyk?t=15m40s

gourou8y ago

Link to the resulting open-source project:

https://github.com/discordapp/lilliput

linkmotif8y ago