When Solid State Drives Are Not That Solid (opens in new tab)

(blog.algolia.com)

302 pointsShipow11y ago118 comments

118 comments

85 comments · 26 top-level

ploxiln11y ago· 8 in thread

Originally TRIM was an un-queued command; all writes had to be flushed, then TRIM executed, then writes could continue. This was bad for performance with automatic on-file-delete trim, so everyone wanted a trim command that could be put in the command queue along with writes. Many new drives have this.

It turns out that Samsung 8XX SSDs advertise they support queued trim but it's buggy. The old TRIM command works fine.

https://lkml.org/lkml/2015/6/10/642

There are in fact lots of "quirks lists" and "blacklists" in the kernel and virtually all computers require some workarounds in the linux kernel for some buggy hardware they have. Pretty amazing when you think about it.

EDIT: another closely related example is macbook pro SSDs and NCQ aka native command queuing. They claim they support it, but on many it's buggy. It gets better though; the linux kernel just starting trying to use such functionality by default relatively recently.

https://bugzilla.kernel.org/show_bug.cgi?id=60731

these sort of things are, as you can see, very confusing and frustrating to track down, identify, and find a general fix for

EDIT2: now that I actually read the kernel bugzilla entry further, it's more recently come to light the actual problem with recent macbook pro SSDs is MSI (efficient type of interrupts)

digi_owl11y ago

In essence the Linux kernel put on display what is on Windows hidden by proprietary device drivers.

_yosefk11y ago

The thing is, almost all hardware accessed through drivers has tons of bugs, at least it's nowhere near as close to "bug-free" as are things like CPUs or DRAMs which cannot hide their bugs behind drivers. The thing that one can hope to work reasonably is a piece of hardware plus an accompanying driver which knows to hide that hardware's issues.

So another way of putting what you said would be "on Linux there's no working driver for that piece of hardware, unlike on Windows where the 'proprietor' went to the trouble of supplying such a driver."

1 more reply

MichaelCrawford11y ago

that's one of the things drivers are for; to workaround hardware bugs.

Among the challenges faced by the AMCC 3ware RAID HBAs were faulty motherboards.

"But PCI is a standard!" you quite reasonably protest.

Yes, and the US Constitution guarantees us many inalienable rights.

sdalfakj11y ago

Since you seem to be the higher voted and showing on top, could you update your bit about queued stuff with this

https://news.ycombinator.com/item?id=9724192

ploxiln11y ago

I can no longer edit my comment.

I assumed that these drives had the same controller chip and the same firmware base as the consumer samsung SSDs, but with higher quality nand and some firmware tweaks. It's very hard to find technical details about these enterprise drives on the internet (compared to the consumer drives).

I guess the smartctl command proves it, these enterprise samsung SSDs do not have queued trim enabled.

It would make sense for enterprise drives to be more conservative and lag on feature set. But it's very surprising that enterprise drives are corrupted by original un-queued trim, they're supposed to have more validation, and that's a very common feature.

adamsurak11y ago

In this case the TRIM command was un-queued, which makes it worse.

ploxiln11y ago

It sounds to me like even when it's the fstrim utility, which uses some ioctl() to tell the kernel to trim free regions in a range on a filesystem, the kernel ends up causing the queued trim command to be used if available.

The "blacklist" does not appear to have any constant to blacklist old-style trim, only NCQ_TRIM (and other odd stuff, most notably all NCQ usage).

This makes sense, because if some SSD advertised old-style trim but was corrupted by it, then it would be found and fixed sooner by these vendors, because Windows 7 would exhibit the corruption.

1 more reply

MichaelCrawford11y ago

"workarounds in the kernel."

Please permit me to violate my NDA:

/* MacWrite needs this */

... in Mac OS System 7.5.2. I honestly don't know whether MacWrite still needed it but that code was there to work around a bug.

jlebar11y ago· 7 in thread

To me, this sort of thing brings home the value of not running your own machines. Sure, Amazon's/Google's clouds have quirks, but it's far less likely that you're going to have to debug faulty hardware in this way. It sounds like a team of more than one person worked on this at least part-time for weeks -- how much is that worth? It's not just the cost of hiring extra people to do the work; often small companies simply can't hire enough good people -- when you do find them, do you want to squander them twiddling servers?

will_hughes11y ago

If something similar happens to you on "cloud" infrastructure, you're very limited in what you can do to diagnose or work-around the problem.

At a place I used to work at we had a reasonably large cluster of Windows boxes on Amazon. Randomly, Windows machines on Amazon would suddenly stop accepting new TCP connections.

This means that machines would be running fine, and then half your cluster starts dropping offline. At the time when this happened to us, there were no other reports we could find of this happening.

Turns out, it's some bug in the Xen Virtual NIC driver that wasn't running the offloaded TCP cleanup, and so eventually the system couldn't accept any new connections. Once we figured out was happening we could pre-emptively reboot boxes, but that was a problem for us for about 6 months iirc.

There's probably dozens of these bugs affecting someone on these cloud platforms at any one time. But because you have no access to the hardware, you don't even have the option of saying "Screw it, lets just get different hardware". You're at the mercy of your cloud provider.

madez11y ago

There is no cloud - just other people's computers.

Many use-cases just require the job to be done on your computers due to security and privacy reasons. Yes, Amazon's and Google's services are in some ways less secure than your own computer, because they are hosted by companies which are subject to a government that doesn't value privacy, not even of it's own citizens. That means said government can, just to give a concrete example, NSL the companies to give up all they have about you, and you wouldn't even know notice.

When the government puts national security above fundamental human rights there is something dangerously wrong.

derefr11y ago

Thinking about individual computers will lead you astray. There are, rather, sets of machines (from single boxes to entire data-centers) that are managed by a given sysadmin staff. The more machines they manage, the more likely it is that problems will have institutionalized and operationalized solutions.

A cloud is just a sysadmin staff with a Sufficiently Large Deployment to have ironed out all the kinks in their hardware.

4 more replies

KaiserPro11y ago

Lets do some maths on that claim: AWS: c3.8xlarge with 32 "CPUS" and 60 gigs of ram.

For the machine alone its $1200 a month. Bear in mind its on a shared infrastructure, with noisy neighbours. You'll see about 10-30% CPU steal. In practice you'll see performance about half that of a real machine (from my comparisons)

Then you'll need to factor in disks as well. First things first EBS is dogshit slow. Yes ephemeral disks are fast, but then they die, so you're in the same situation. however you need 10gig networking to get low latency, avoid puncturing the cache etc,etc,etc,

for EBS the maximum IOPs you can guarantee to get is 20,000, and you need 1tb for that.

for the Iops, thats $1300 a month + $125 for the 1 TB of storage.

so a month, per machine it'll be $2625. $31500 per machine, per year.

Every 6 months, you could buy a new machine, which is faster than the fastest EC2 instance + EBS.

Now, the OP stated that they have more than one machine. Obviously one could use reserved instances. However similarly one could negotiate volume discounts.

There is of course the cost of internet and cooling, you're looking at around $500 a month for half a rack, depending on power consumption. (if you're colo'ing)

From a valuation point of view, having hardware counts towards your value, as its an asset you actually own. More importantly you can use it to lower your tax bill, and reduce your run rate, in exchange for an up front cost.

Now, if you have a lot of bursty traffic, that doesn't require much DB activity, then AWS is perfect, as the elastic IP load balancer allows you to spin up machine on demand. However thats not that helpful for Databases. Sure you can warm migrate from a EBS snapshot, but you'd best do it quick, otherwise you'll overload an already overloaded DB.

adamsurak11y ago

With our architecture, HW requirements, the price of HW and the price of the cloud VMs, even working on this for a week or two saves us significant amount of money both short-term and long-term. The side effect is that we now have tools to recover servers way faster and allows us to do things we have not thought about before.

jhead11y ago

Agreed. Additionally, some business models simply don't mesh with cloud infrastructure pricing no matter the volume. There are definitely advantages to using cloud services, but most of the time bare metal gets you more hardware/performance at a lower cost in the long run, even when you factor in everything else that it entails.

1 more reply

emodendroket11y ago

I agree with you. I don't think it makes sense except for very large companies.

madez11y ago· 6 in thread

It feels like Samsung used the Linux community here as a free testbed.

Samsung knew that only Linux supported queued trim, so releasing it without proper testing is just externalizing the disproportionately increased cost of testing to the Linux community.

adamsurak11y ago

In this case it was un-queued TRIM (I forgot to mention it in the blogpost). We have reached to Samsung and although it looked good at the beginning now they are silent for more than a month without any progress.

madez11y ago

I was refering to the Native Command Queued trim (hence "queued trim"), not the traditional trim command.

caf11y ago

Is the loss of reputation really worth less than the value of the externalised testing?

madez11y ago

That remains to be seen. I really hope not.

With Samsungs finished-forms walling the company already tells Linux users to not expect any support, at all. So, that is consisting with the testbed-theory.

1 more reply

pjc5011y ago

Loss of reputation isn't a real thing in this industry. Pretty much all hard drive manufacturers have had high-profile "bad" models, for example.

2 more replies

PythonicAlpha11y ago

I once was a huge fan of Samsung. But with the EVO disaster and this one, I really regret to have bought one of these.

1 more reply

bbcbasic11y ago· 6 in thread

I have a Samsung SSD 850 PRO 512GB in my Windows PC. And I have TRIM enabled in Windows:

     > fsutil.exe behaviour query DisableDeleteNotify
     DisableDeleteNotify = 0

Should I be worried?

ploxiln11y ago

Released versions of Windows do not use queued trim.

(That's why serious bugs like this can happen ;)

bad_user11y ago

The problem exposed in the article is about un-queued trim.

feelix11y ago

This issue is related to TRIM in the context of command queuing, not the relatively ancient straightforward TRIM which Windows supports.

malbs11y ago

Pretty sure this is an interaction issue with Samsung drives, trim support, and the Linux kernel, so no, you don't need to be worried.

sengork11y ago

Install the Samsung Magician Toolbox for best results on Windows platforms: www.samsung.com/samsungssd/

eveningcoffee11y ago

When you do this, read what they claim in ToS.

kbar1311y ago· 5 in thread

if one machine failed and failover kicked in correctly, why was the engineer paged?

jimrandomh11y ago

Because it's hard to make an automatic monitoring system that reliably distinguishes between "a failure occurred but everything is fine" and "a failure occurred and now everything is on fire".

InclinedPlane11y ago

Depends on how much spare capacity they had. Being one failure away from going down is an emergency situation at many places.

mentat11y ago

I wondered this as well. Valuing your engineers' sleep is important.

adamsurak11y ago

We have multiple different pages. In our cluster we have 3 machines and if one of them is unavailable because of broken network, we do not page. In this case the page came as an application error that the application was not able to cope with. When we have issue that we have seen before and the server can handle it on its own, we do not page.

Qantourisc11y ago

Also depends on how many machines you got running. If it's 2: do you really want to wait it out and risk the other one going to hell too ?

cabirum11y ago· 4 in thread

Strange, Samsung 840/850 evo/pro are considered [1][2] among the best consumer SSDs. The issues article mentions do not exist on Windows, the SSDs are very reliable there. I suspect it's not only Samsung fault. Are we sure Linux handling of TRIM operations is absolutely correct?

[1] http://techreport.com/review/27062/the-ssd-endurance-experim...

[2] http://www.anandtech.com/show/8216/samsung-ssd-850-pro-128gb...

notacoward11y ago

The problem is that "absolutely correct" is a slippery concept. Even the most tightly written standard is likely to have some areas of ambiguity through which bugs can creep. If the way that a particular device deals with that ambiguity is known only to those under NDA, then you can have two drivers that are both "absolutely correct" per the standard but only one actually works in all the edge conditions.

sfilipov11y ago

Windows doesn't do queued TRIM (yet).

drzaiusapelord11y ago

Personally, I find Samsung has a "it boots? Fine then ship," mentality to pretty much all things. Their buggy phones, buggy SSD's, buggy TV's, etc. I wouldn't recommend them, even though they do well on SSD speed tests (which are often gamed by on-board ram caching).

scott_karana11y ago

The 840 Pro exceeded 2.4PB of writes before failing in Anandtech's tests over 18 months: http://techreport.com/review/27909/the-ssd-endurance-experim...

Even if Samsung has some systemic problems, it's more subtle than just schlocky marketing, or targeted benchmarking.

douglasheriot11y ago· 4 in thread

Wow, that sucks. Another reason to use ZFS – you’d notice the corrupted files a lot sooner.

Freaky11y ago

Yup. I was seeing occasional corruption with my SanDisk Extreme Pro's and quite happy that ZFS was able to repair the damage each time.

The problem appears to have gone away following a firmware update, touch wood.

icebraining11y ago

Or run a verification layer on top of whatever FS you use (e.g. running git fsck would discover corruption in your git indexed files too).

ThatPlayer11y ago

Or Btrfs on Linux.

rleigh11y ago

In theory, yes. Unfortunately, every time my Btrfs filesystems have encountered a hardware glitch, it has happily trashed the filesystem beyond recovery (including both drives in a RAID1 mirror, one of which was perfectly OK). I use ZFS now, and while some features are compatable with Btrfs, the implementation quality, documentation, and feature completeness, and tool quality set it well above where Btrfs is at.

1 more reply

lvs11y ago· 3 in thread

Can someone clarify the article's claim that these Samsung drives are really "broken" as such? We have a few of these on 3.13 and 3.16 kernels and ext4 with no problems. It seems that there must be something unique to their application in order to expose these trim failures.

ploxiln11y ago

Do you have the "discard" mount option enabled? Do you have a cron job that runs the "fstrim" command? It's possible your systems are not running trim. Or maybe your ext4 filesystems have little activity and you haven't had enough corruption to notice yet :)

Also, some Samsung 800 series drives only gained this bug in a recent firmware update (840 EVO specifically).

guns11y ago

The 840 EVO joined the club with firmware EXT0DB6Q, which itself is a nasty little hack around a fundamental design problem with the tightly packed NAND cells.

Linux 4.0.5 ships with the patch linked above, but for a while you had to roll with a kernel built from source.

EDIT: The blatant file corruption issues only manifested after updating to firmware EXT0DB6Q.

1 more reply

kuschku11y ago

So, if I don’t update the firmware of my 840 EVO, I can continue using it with discard?

1 more reply

Aardwolf11y ago· 3 in thread

I'm so sick of this TRIM. Constant configurations needed because of it, constant care like "this thing you better don't do on SSDs". And then problems like this.

Do you think there'll ever be SSDs that don't need it?

userbinator11y ago

They never "needed" TRIM, it was mainly introduced as a performance optimisation.

I have an old Intel SSD that doesn't even support TRIM, and it still works fine. As do all the other USB flash drives I have...

yardie11y ago

I remember started incorporating SSDs into their computers and didn't support TRIM. Windows users were telling Mac users their Macs were practically obsolete because it couldn't do this one thing that was enabled for Windows. of course they sent that back to Apple and Apple replied, for years, you don't need it.

Eventually, they relented and enabled it on their SSDs. I'm pretty sure the marketing and engineering butted heads over this one stupid bullet point.

drzaiusapelord11y ago

Except without TRIM you'll fill all your blocks and kill performance of your fancy $1500 Apple when the SSD is performing a dozen operations to create a space to perform writes instead of one operation on a properly TRIM'd drive.

Apple didn't do this because of "windows users whining" but because they knew they didn't want an angry mob of customers wondering why their drive is 10x slower than it was on day one.

Arguably, idle GC was "good enough," for some use cases but probably not for drives that aren't sitting idle all the time and on many hours a day. Even then, Apple probably didn't want to tell its customers "let it sit out overnight" to regain performance when supporting plain-jane TRIM was a trivial addition.

On-board GC + OS-driven TRIM are considered the optimal solution for SSD's.

teraflop11y ago· 2 in thread

Here's an Ubuntu bug tracker entry for what sounds like the same problem: https://bugs.launchpad.net/ubuntu/+source/fstrim/+bug/144900...

Linux 4.0.5 includes a patch that blacklists queued TRIM for the buggy drives. Windows and OS X apparently don't support queued TRIM at all, so they're unaffected.

adamsurak11y ago

The drives we have detected the issue had still un-queued TRIM. I have reached to one of the kernel I/O developers for help and he confirmed that it is not related.

asayler11y ago

But isn't the blacklist you link to in the article specifically for queued TRIM? E.g. https://github.com/torvalds/linux/commit/9a9324d. SO either that blacklist has nothing to do with this issue (in which case it probably shouldn't be linked from the article), or it does, and we're talking about issues with queued TRIM.

1 more reply

sandGorgon11y ago· 2 in thread

I have this running on my Ubuntu Thinkpad with A Samsung 840 Pro as a weekly cron job. should I turn it off ?

  #!/bin/sh
  # call fstrim-all to trim all mounted file systems which support it
  set -e
  
  # This only runs on Intel and Samsung SSDs by default, as some SSDs with faulty
  # firmware may encounter data loss problems when running fstrim under high I/O
  # load (e. g.  https://launchpad.net/bugs/1259829). You can append the
  # --no-model-check option here to disable the vendor check and run fstrim on
  # all SSD drives.
  exec fstrim-all

teraflop11y ago

Probably, unless you're running a kernel that was released within the last couple of weeks and includes this patch: https://github.com/torvalds/linux/commit/9a9324d

sandGorgon11y ago

wow - thanks. did NOT know about this.

Aardwolf11y ago· 2 in thread

"Samsung SSD 850 PRO 512GB recently blacklisted as 850 Pro and later in 8-series blacklist"

That's what I have in my home computer, with ArchLinux.

Do you think this problem only is something particular in the servers of the author of that article, or should this be interpreted as:

linux + samsung 850 = you will lose your data?

Thanks...

adamsurak11y ago

Unless you run the latest kernel, I would disable TRIM.

stream_fusion11y ago

I have the same drive in a laptop. There were lots of trim errors in the kernel logs, with debian so I ended up disabling trim.

cft11y ago· 2 in thread

Using SAS SSD drives on a server is a bad idea for many reasons. One should use PCIe cards, that sit directly on the PCIe bus, such as FusionIO or SanDisk. They have been tested and retested (e.g. by Facebook), without the unnecessarily added complexity of SAS/SATA protocols. The I/O performance is also about 20x.

baruch11y ago

I don't think that testing by Facebook is going to help you unless you are using the exact same model as they are and are assured of using their exact firmware. At work we use SAS SSDs in large quantities and the firmware we use is customized to us (based on the mainline one). Do not assume that a bug that was fixed in our firmware was necessarily fixed in the normal one. One would think it would but it is possible that it wasn't ported to the mainline firmware.

adamsurak11y ago

I completely agree and we are going this direction.

mrmondo11y ago· 2 in thread

I've worked on some interesting SSD deployments / experiments a lot over the past 12 months. Quite honestly - I wouldn't go anywhere near Samsung products regardless of their 'PRO' labelling or otherwise.

We have had great success with both Sandisk Extreme Pro SATA and Intel DC NVMe series drives, we've also recently deployed a number of Crucial 'Micron' M600 1TB SATA drives that are performing very well and so far haven't given us any issues.

u02sgb11y ago

I've done similar over the last three years and had good luck with the Crucial drives. However if you take a look at the Linux Kernel patch they link to (search for "don't properly handle queued TRIM"): https://github.com/torvalds/linux/blob/e64f638483a21105c7ce3...

There are Crucial SSDs on the list. I'm going to be keeping a closer eye on them now.

mrmondo11y ago

Yeah I saw that - although that's the older, now discontinued series that has a different controller and doesn't show the same consistent performance as the newer M600 drives.

stream_fusion11y ago· 2 in thread

I have one of the affected drives mentioned in the article in my development laptop - the Samsung SSD 850 PRO 512GB.

As one of the most expensive SSD drives available on the market, it was disconcerting to find dmesg -T showing trim errors, when the drive was mounted with the discard option. Research on mailing lists, indicated that the driver devs, believe it's a Samsung firmware issue.

Disabling trim in fstab, stopped the error messages. However it's difficult to get good information about whether drive performance or longevity may be impacted without the trim support.

hvidgaard11y ago

Trim really is only a helpful message when the drive is near full so the GC can preemptively zero blocks and retain good write speed. Without trim, the firmware must wait until it gets a write for a particular block before it know it can be erased.

If your drive has reasonably with unprovisioned space, it can simply work around the missing trim commands - this however, is theory, I do not know if the firmware actually does this. This is the exact thing that makes some drives better than others when working without trim.

stream_fusion11y ago

Thanks. I'll probably end up creating an unprovisioned partition. It's frustrating, exactly because of the uncertainty re future performance. Especially given the price premium for pro/enterprise level hardware.

1 more reply

Supersaiyan_IV11y ago· 1 in thread

Undoubtedly the same issue happened to me on an 500GB 840 EVO with NTFS.

SSD zeroed out a part of the disk during runtime, as I watched this happen music was playing from this drive. It was mounted from Ubuntu MATE 15.04 and playing a music library through Audacious. Suddenly music glitched and IO errors began appearing. Rebooted to a DISK READ ERROR (MBR was on the EVO). Ran chkdsk from USB and it showed a ridiculous amount of orphaned files for ca. 1h. Once finished the most frequently accessed files had disappeared. Download folder, Documents folder, some system files. Of course, some of the files could've been recovered had I not ran chkdsk off the bat, bot nonetheless it's an approximate measure of failure impact.

I began being suspicious of 840 EVO when sorting old files by date became fantastically slow. If you have a feeling this has happened to you recently - buckle up for a shitstorm.

TL;DR Avoid 840 EVO.

Supersaiyan_IV11y ago

To the downvoters: this occurred a week after upgrading to Samsung's EXT0DB6Q firmware. Meaning that mentioned read delays should've been nonexistent.

Not to mention that this disk has only had 5TB written to it.

ChuckMcM11y ago

Nice debugging story. When I was at NetApp there were lots of times when drive firmware for the 'less used' options would fail. On the fiber channel drives the 'write zeros' command which was supposed to zero a drive was notorious in its in ability to achieve something that simple. When Google looked at (I don't know if they finally deployed it) the disk encryption technology it worked differently disk to disk and firmware rev to firmware rev. I think it was Brian Pawlowski at NetApp that said "You can count on two things working right in a hard drive, read, write, and seek." The joke being that you needed all three of them to work for reliable disk operation.

MrBuddyCasino11y ago

Not directly related to TRIM, but AeroSpike has a nice test suite for SSDs, probing for IOPS and latency: https://github.com/aerospike/act

They share their test results for both physical and cloud-based storage, I figured this would be of interest:

http://www.aerospike.com/docs/operations/plan/ssd/ssd_certif...

notacoward11y ago

Pretty disappointing to see some of those Samsung drives on the list, because in some of the other tests/surveys I've seen they seemed to be among the better choices. Sigh I guess Sturgeon's Law applies to SSDs too.

andmarios11y ago

Been there, done that. :|

Sometime around the end of 2013 I started getting frequently lost data and corrupted filesystems upon reboot. After much search and about 4-6 months into the issue, I found out that the culprit were the queued TRIM commands issued by the linux kernel to my Crucial M500 mSATA disk. The Linux kernel already had a quirks list with many drives, including some of the M500 variants, just not mine.

I added my model, compiled the kernel and the nightmare ended. I proceeded to submit a bug report and a patch. The patch got accepted (yay!) and the bug report turned to be very useful for other people with the same problem but different disk as I included the dmesg output that was specific to the issue. This meant that they could now google the errors and get a helpful result.

Such is the nature of free software; you are allowed to fix your computer yourself. :)

suprjami11y ago

What a wonderful story. I wish everyone was this diligent at troubleshooting. Then again, that would put me out of a job.

microcolonel11y ago

I've had issues with these samsung 8xx drives, unfortunately they all happened at once. I gave up on their RMA/warranty process because I was bounced back and forth between the same two numbers a few times. Either side said that the other was in charge of this process(samsung bought the SSD division from seagate... or was it seagate that bought the HDD division from Samsung? To this day I have no clue.).

anigbrowl11y ago

Interesting! I sometimes work with SSDs as storage media for cameras (where Sandisk is the most popular brand by a mile) and I seriously doubt any camera firmware is doing drive maintenance. From what I know of digital imaging technicians, neither are they - if a drive starts acting up in any way, the usual policy is to just take it out of service immediately, recover anything that was on it, dump it, and buy a replacement.

sengork11y ago

Given how many Samsung drives are listed in their findings, I can only attribute this to the fact Samsung make their own SSD controllers.

Figs11y ago

How do you disable TRIM on common distros? Under Ubuntu, is it just preventing /etc/cron.weekly/fstrim from running, or is there more to it? What about CentOS, etc?

frik11y ago

What SSD do cloud hoster like DigitalOcean, Linode, Rackspace, Vultr, etc use?

I would some sites trade storage speed for more space (HDDs instead of SSDs).

j / k navigate · click thread line to collapse

118 comments

85 comments · 26 top-level

ploxiln11y ago· 8 in thread

It turns out that Samsung 8XX SSDs advertise they support queued trim but it's buggy. The old TRIM command works fine.

https://lkml.org/lkml/2015/6/10/642

https://bugzilla.kernel.org/show_bug.cgi?id=60731

these sort of things are, as you can see, very confusing and frustrating to track down, identify, and find a general fix for

EDIT2: now that I actually read the kernel bugzilla entry further, it's more recently come to light the actual problem with recent macbook pro SSDs is MSI (efficient type of interrupts)

digi_owl11y ago

In essence the Linux kernel put on display what is on Windows hidden by proprietary device drivers.

_yosefk11y ago

1 more reply

MichaelCrawford11y ago

that's one of the things drivers are for; to workaround hardware bugs.

Among the challenges faced by the AMCC 3ware RAID HBAs were faulty motherboards.

"But PCI is a standard!" you quite reasonably protest.

Yes, and the US Constitution guarantees us many inalienable rights.

sdalfakj11y ago

Since you seem to be the higher voted and showing on top, could you update your bit about queued stuff with this

https://news.ycombinator.com/item?id=9724192

ploxiln11y ago

I can no longer edit my comment.

I guess the smartctl command proves it, these enterprise samsung SSDs do not have queued trim enabled.

adamsurak11y ago

In this case the TRIM command was un-queued, which makes it worse.

ploxiln11y ago

The "blacklist" does not appear to have any constant to blacklist old-style trim, only NCQ_TRIM (and other odd stuff, most notably all NCQ usage).

This makes sense, because if some SSD advertised old-style trim but was corrupted by it, then it would be found and fixed sooner by these vendors, because Windows 7 would exhibit the corruption.

1 more reply

MichaelCrawford11y ago

"workarounds in the kernel."

Please permit me to violate my NDA:

/* MacWrite needs this */

... in Mac OS System 7.5.2. I honestly don't know whether MacWrite still needed it but that code was there to work around a bug.

jlebar11y ago· 7 in thread

will_hughes11y ago

If something similar happens to you on "cloud" infrastructure, you're very limited in what you can do to diagnose or work-around the problem.

At a place I used to work at we had a reasonably large cluster of Windows boxes on Amazon. Randomly, Windows machines on Amazon would suddenly stop accepting new TCP connections.

This means that machines would be running fine, and then half your cluster starts dropping offline. At the time when this happened to us, there were no other reports we could find of this happening.

madez11y ago

There is no cloud - just other people's computers.

When the government puts national security above fundamental human rights there is something dangerously wrong.

derefr11y ago

A cloud is just a sysadmin staff with a Sufficiently Large Deployment to have ironed out all the kinks in their hardware.

4 more replies

KaiserPro11y ago

Lets do some maths on that claim: AWS: c3.8xlarge with 32 "CPUS" and 60 gigs of ram.

for EBS the maximum IOPs you can guarantee to get is 20,000, and you need 1tb for that.

for the Iops, thats $1300 a month + $125 for the 1 TB of storage.

so a month, per machine it'll be $2625. $31500 per machine, per year.

Every 6 months, you could buy a new machine, which is faster than the fastest EC2 instance + EBS.

Now, the OP stated that they have more than one machine. Obviously one could use reserved instances. However similarly one could negotiate volume discounts.

There is of course the cost of internet and cooling, you're looking at around $500 a month for half a rack, depending on power consumption. (if you're colo'ing)

adamsurak11y ago

jhead11y ago

1 more reply

emodendroket11y ago

I agree with you. I don't think it makes sense except for very large companies.

madez11y ago· 6 in thread

It feels like Samsung used the Linux community here as a free testbed.

Samsung knew that only Linux supported queued trim, so releasing it without proper testing is just externalizing the disproportionately increased cost of testing to the Linux community.

adamsurak11y ago

madez11y ago

I was refering to the Native Command Queued trim (hence "queued trim"), not the traditional trim command.

caf11y ago

Is the loss of reputation really worth less than the value of the externalised testing?

madez11y ago

That remains to be seen. I really hope not.

With Samsungs finished-forms walling the company already tells Linux users to not expect any support, at all. So, that is consisting with the testbed-theory.

1 more reply

pjc5011y ago

Loss of reputation isn't a real thing in this industry. Pretty much all hard drive manufacturers have had high-profile "bad" models, for example.

2 more replies

PythonicAlpha11y ago

I once was a huge fan of Samsung. But with the EVO disaster and this one, I really regret to have bought one of these.

1 more reply

bbcbasic11y ago· 6 in thread

I have a Samsung SSD 850 PRO 512GB in my Windows PC. And I have TRIM enabled in Windows:

     > fsutil.exe behaviour query DisableDeleteNotify
     DisableDeleteNotify = 0

Should I be worried?

ploxiln11y ago

Released versions of Windows do not use queued trim.

(That's why serious bugs like this can happen ;)

bad_user11y ago

The problem exposed in the article is about un-queued trim.

feelix11y ago

This issue is related to TRIM in the context of command queuing, not the relatively ancient straightforward TRIM which Windows supports.

malbs11y ago

Pretty sure this is an interaction issue with Samsung drives, trim support, and the Linux kernel, so no, you don't need to be worried.

sengork11y ago

Install the Samsung Magician Toolbox for best results on Windows platforms: www.samsung.com/samsungssd/

eveningcoffee11y ago

When you do this, read what they claim in ToS.

kbar1311y ago· 5 in thread

if one machine failed and failover kicked in correctly, why was the engineer paged?

jimrandomh11y ago

Because it's hard to make an automatic monitoring system that reliably distinguishes between "a failure occurred but everything is fine" and "a failure occurred and now everything is on fire".

InclinedPlane11y ago

Depends on how much spare capacity they had. Being one failure away from going down is an emergency situation at many places.

mentat11y ago

I wondered this as well. Valuing your engineers' sleep is important.

adamsurak11y ago

Qantourisc11y ago

Also depends on how many machines you got running. If it's 2: do you really want to wait it out and risk the other one going to hell too ?

cabirum11y ago· 4 in thread

[1] http://techreport.com/review/27062/the-ssd-endurance-experim...

[2] http://www.anandtech.com/show/8216/samsung-ssd-850-pro-128gb...

notacoward11y ago

sfilipov11y ago

Windows doesn't do queued TRIM (yet).

drzaiusapelord11y ago

scott_karana11y ago

The 840 Pro exceeded 2.4PB of writes before failing in Anandtech's tests over 18 months: http://techreport.com/review/27909/the-ssd-endurance-experim...

Even if Samsung has some systemic problems, it's more subtle than just schlocky marketing, or targeted benchmarking.

douglasheriot11y ago· 4 in thread

Wow, that sucks. Another reason to use ZFS – you’d notice the corrupted files a lot sooner.

Freaky11y ago

Yup. I was seeing occasional corruption with my SanDisk Extreme Pro's and quite happy that ZFS was able to repair the damage each time.

The problem appears to have gone away following a firmware update, touch wood.

icebraining11y ago

Or run a verification layer on top of whatever FS you use (e.g. running git fsck would discover corruption in your git indexed files too).

ThatPlayer11y ago

Or Btrfs on Linux.

rleigh11y ago

1 more reply

lvs11y ago· 3 in thread

ploxiln11y ago

Also, some Samsung 800 series drives only gained this bug in a recent firmware update (840 EVO specifically).

guns11y ago

The 840 EVO joined the club with firmware EXT0DB6Q, which itself is a nasty little hack around a fundamental design problem with the tightly packed NAND cells.

Linux 4.0.5 ships with the patch linked above, but for a while you had to roll with a kernel built from source.

EDIT: The blatant file corruption issues only manifested after updating to firmware EXT0DB6Q.

1 more reply

kuschku11y ago

So, if I don’t update the firmware of my 840 EVO, I can continue using it with discard?

1 more reply

Aardwolf11y ago· 3 in thread

I'm so sick of this TRIM. Constant configurations needed because of it, constant care like "this thing you better don't do on SSDs". And then problems like this.

Do you think there'll ever be SSDs that don't need it?

userbinator11y ago

They never "needed" TRIM, it was mainly introduced as a performance optimisation.

I have an old Intel SSD that doesn't even support TRIM, and it still works fine. As do all the other USB flash drives I have...

yardie11y ago

Eventually, they relented and enabled it on their SSDs. I'm pretty sure the marketing and engineering butted heads over this one stupid bullet point.

drzaiusapelord11y ago

Apple didn't do this because of "windows users whining" but because they knew they didn't want an angry mob of customers wondering why their drive is 10x slower than it was on day one.

On-board GC + OS-driven TRIM are considered the optimal solution for SSD's.

teraflop11y ago· 2 in thread

Here's an Ubuntu bug tracker entry for what sounds like the same problem: https://bugs.launchpad.net/ubuntu/+source/fstrim/+bug/144900...

Linux 4.0.5 includes a patch that blacklists queued TRIM for the buggy drives. Windows and OS X apparently don't support queued TRIM at all, so they're unaffected.

adamsurak11y ago

The drives we have detected the issue had still un-queued TRIM. I have reached to one of the kernel I/O developers for help and he confirmed that it is not related.

asayler11y ago

1 more reply

sandGorgon11y ago· 2 in thread

I have this running on my Ubuntu Thinkpad with A Samsung 840 Pro as a weekly cron job. should I turn it off ?

  #!/bin/sh
  # call fstrim-all to trim all mounted file systems which support it
  set -e
  
  # This only runs on Intel and Samsung SSDs by default, as some SSDs with faulty
  # firmware may encounter data loss problems when running fstrim under high I/O
  # load (e. g.  https://launchpad.net/bugs/1259829). You can append the
  # --no-model-check option here to disable the vendor check and run fstrim on
  # all SSD drives.
  exec fstrim-all

teraflop11y ago

Probably, unless you're running a kernel that was released within the last couple of weeks and includes this patch: https://github.com/torvalds/linux/commit/9a9324d

sandGorgon11y ago

wow - thanks. did NOT know about this.

Aardwolf11y ago· 2 in thread

"Samsung SSD 850 PRO 512GB recently blacklisted as 850 Pro and later in 8-series blacklist"

That's what I have in my home computer, with ArchLinux.

Do you think this problem only is something particular in the servers of the author of that article, or should this be interpreted as:

linux + samsung 850 = you will lose your data?

Thanks...

adamsurak11y ago

Unless you run the latest kernel, I would disable TRIM.

stream_fusion11y ago

I have the same drive in a laptop. There were lots of trim errors in the kernel logs, with debian so I ended up disabling trim.

cft11y ago· 2 in thread

baruch11y ago

adamsurak11y ago

I completely agree and we are going this direction.

mrmondo11y ago· 2 in thread

u02sgb11y ago

There are Crucial SSDs on the list. I'm going to be keeping a closer eye on them now.

mrmondo11y ago

Yeah I saw that - although that's the older, now discontinued series that has a different controller and doesn't show the same consistent performance as the newer M600 drives.

stream_fusion11y ago· 2 in thread

I have one of the affected drives mentioned in the article in my development laptop - the Samsung SSD 850 PRO 512GB.

Disabling trim in fstab, stopped the error messages. However it's difficult to get good information about whether drive performance or longevity may be impacted without the trim support.

hvidgaard11y ago

stream_fusion11y ago

1 more reply

Supersaiyan_IV11y ago· 1 in thread

Undoubtedly the same issue happened to me on an 500GB 840 EVO with NTFS.

I began being suspicious of 840 EVO when sorting old files by date became fantastically slow. If you have a feeling this has happened to you recently - buckle up for a shitstorm.

TL;DR Avoid 840 EVO.

Supersaiyan_IV11y ago

To the downvoters: this occurred a week after upgrading to Samsung's EXT0DB6Q firmware. Meaning that mentioned read delays should've been nonexistent.

Not to mention that this disk has only had 5TB written to it.

ChuckMcM11y ago

MrBuddyCasino11y ago

Not directly related to TRIM, but AeroSpike has a nice test suite for SSDs, probing for IOPS and latency: https://github.com/aerospike/act

They share their test results for both physical and cloud-based storage, I figured this would be of interest:

http://www.aerospike.com/docs/operations/plan/ssd/ssd_certif...

notacoward11y ago

andmarios11y ago

Been there, done that. :|

Such is the nature of free software; you are allowed to fix your computer yourself. :)

suprjami11y ago

What a wonderful story. I wish everyone was this diligent at troubleshooting. Then again, that would put me out of a job.

microcolonel11y ago

anigbrowl11y ago

sengork11y ago

Given how many Samsung drives are listed in their findings, I can only attribute this to the fact Samsung make their own SSD controllers.

Figs11y ago

How do you disable TRIM on common distros? Under Ubuntu, is it just preventing /etc/cron.weekly/fstrim from running, or is there more to it? What about CentOS, etc?

frik11y ago

What SSD do cloud hoster like DigitalOcean, Linode, Rackspace, Vultr, etc use?

I would some sites trade storage speed for more space (HDDs instead of SSDs).

j / k navigate · click thread line to collapse