24/192 music downloads make no sense (opens in new tab)

(people.xiph.org)

325 pointsLERobot10y ago228 comments

228 comments

133 comments · 30 top-level

hatsunearu10y ago· 22 in thread

The single worst thing that harms audio quality is excessive compression. It's ruining everything, I'd say. Heck, I'm sure everyone who knows their worth would agree that compression harms audio quality.

Recommended reading: https://en.wikipedia.org/wiki/Loudness_war

edit: compression as in dynamic range compression, not data compression like mp3 in audio

beat10y ago

I've played acoustic and electric instruments for over 30 years, and recorded numerous albums.

Compression is the best tool we have for accurately reproducing the musicality and emotion of a musical performance. Without compression, most recordings would be unlistenable.

Don't confuse the foolishness of the loudness wars for "compression is bad". That's like saying the internet is bad because there's porn on it.

chillingeffect10y ago

I respect your opinion, but it's only your opinion, not a global truth.

Compression is a style. There is far more to musicality and emotion than compression. The problem compression solves is that the environments where industrialized cultures now listen in are not dedicated listening areas, but alternately loud and quiet places, so compression makes all parts of the music almost equally loud so there are no drop outs where the quieter parts would be. There is no need to compress music in headphones, for example, to the extent that it is currently compressed.

I find compression and other techniques such as removing vocal breath sounds, makes most recordings unlistenable. They don't sound like humans anymore, but synthetic puppets animated by humans with conflicting values. Take the Foo Fighters, for example. They're popular, sure, but all of their songs sounds like one continuous din. Between the compression induced by the guitar distortion settings and the compression added to the recording, then the compression added by the radio station, it just sounds like a waterfall with a few bandpass filters changing between the verse and chorus.

Also their vocals have no dynamics. When he yells loud, the vocals don't get louder but the timbre changes. That changes it from cathartic to strained. The dynamics have all been flattened.

Why do you think the indie rock movement and bands and styles with wide dynamic range like the Pixies, Nirvana and dubstep got so popular? They eschewed the trend of hardline compression with alternating loud and quiet parts. They match the rhythm of human thought and motion which has fast and slow, detail and empty parts.

> That's like saying the internet is bad because there's porn on it.

Yes but on the internet you can go where there is no porn. Where can you find music with no compression?

6 more replies

bootload10y ago

"Compression is the best tool we have for accurately reproducing the musicality and emotion of a musical performance. Without compression, most recordings would be unlistenable."

That is your artistic choice, as it should be.

"The MP3 only has 5 percent of the data present in the original recording. … The convenience of the digital age has forced people to choose between quality and convenience, but they shouldn’t have to make that choice." -- Neil Young. [0]

Not every artist wants this to happen. They have no choice and listeners get a fraction of the sound recorded. This was not the case with vinyl.

@mborch, the exact compression method is of less importance than recognising that for all the compression being discussed, is a retrograde step from vinyl. Why?

[0] http://allthingsd.com/20120131/neil-young-and-the-sound-of-m...

2 more replies

hammock10y ago

Dynamic range compression is an important part of the aesthetic of pop music today, like it or not. Pop hits don't sound the same without it. For this reason (unaffected by normalization on itunes/youtube) it will be slow to go away.

Eiriksmal10y ago

Did some brief Googling to see if I could participate in a blind listening test and hear the difference between 24- and 16-bit recordings. I found this instead [0], an even more interesting gap to see if you can hear the difference between 8- and 16-bit!

The source song, PSY's Gagnam Style, is the epitome of modern pop. I got a 3/10 on the listening test on a decent pair of Sennheiser headphones in a quiet room.

Some people are commenting that modern pop pairs well with 16-bit because of the heavy-handed mastering techniques and that older music thrives under 24-bits. Well, Audio Check offers the same 16 vs 8 test, using a Neil Young track from 1989... I couldn't fool myself into hearing any differences between the source WAVs at all and didn't even attempt to score the 10 soundbites.

[0] http://www.audiocheck.net/blindtests_16vs8bit.php

3 more replies

copperx10y ago

This is true. HDTracks.com offers one of the Green Day records with no mastering compression.

Yes, it has much more dynamic range, but it sounds wrong. Compression basically emulates what our ears naturally do when hearing very loud material, so compression gives one the feeling that the music is LOUD.

1 more reply

nosuchthing10y ago

If you listen to pop recordings between 1950-1980, you'll notice there's more depth and character to every instrument.

Yes, dynamic range compression is currently used/abused extensively in pop music productions, but if mixes weren't compressed they would have a much wider range for the sounds to play around in.

  Not only is Justin Bieber's My World 2.0 louder than 
  Metallica's The Black Album, it's louder than The Sex 
  Pistols' Never Mind The Bollocks.

http://www.sonicstate.com/news/2011/02/21/why-is-justin-bieb...

1 more reply

kinghajj10y ago

I recall listening to a vinyl rip of Lady Gaga's first album, and found that "Pokerface" was much more enjoyable with the additional dynamic range. So I'm not sure if overcompression is necessary for newer songs, but rather that it's just what the mass market has become accustomed to because of the loudness war.

2 more replies

muraiki10y ago

Reading "compression" as both a programmer and audio-minded person made that first sentence difficult to parse at first. :)

But yeah, it's obvious when I have my car stereo nearly on max to listen to classical or jazz, and then if I turn the radio and get a pop music station my ears are about to explode.

JonnieCache10y ago

Don't worry, the loudness war is basically over. itunes and youtube normalize everything to -15dbRMS and -13dbRMS respectively. Spotify uses the ReplayGain algorithm. Broadcasters worldwide now use the EBU R128 standard, which is a more effective form of loudness normalization, hopefully to be adopted by online services too at some point. All that's required is for personal media players to widely implement loudness normalization (by default) for preexisting records in peoples collections.

http://productionadvice.co.uk/lufs-dbfs-rms/

4 more replies

bitwize10y ago

It's not just the acoustic properties of dynamic-compressed music. The compression forces music to be more simplistic, because there's less "room" for complexity. Michael Jackson's "Thriller" is a standard pop hit, engineered for chart performance and not artistic virtuosity, but it is still a rich, deep, satisfying piece of music to listen to in part because there's a whole lot of stuff going on. If you tried to cram that many instrument parts into a modern Katy Perry tune, it would clip like crazy and become unlistenable hash.

2 more replies

darkmighty10y ago

I have mixed feelings about it. But I fully agree all music should be uncompressed by default.

The good thing about compression is that it allows you to save your hearing quite a bit. Some music have dramatic parts that get super loud, which can have awesome emotional response; but it does take a toll on your hearing, unless you are in a very silent environment or have fantastic headphone insulation (I have none) -- so compression actually allows me to hear everything the music has to offer. I also use compression tool on my soundcard to play most games, specially FPSes that have incredibly loud bangs and yet you need to hear footsteps and quiet environmental noises -- with a compressor that's possible without blowing up your ears.

donatj10y ago

I'm truly curious how having good isolation would help your hearing? Is the room noise a large additive?

1 more reply

pseudosavant10y ago

The irony is that extreme dynamic compression only became popular after all of the mediums we listen to (CDs, MP3, digital radio, etc) went digital and we got >90 db dynamic range and signal-to-noise ratio. Tapes and records are typically less than 60 db. So our digital music is capable of being as dynamically expressive as our ears but we clamp/ruin the dynamic range down to <20db.

thrownaway242410y ago

There's a fundamental difference between analog and digital, though. On digital you have to stay well away from the limits or you will get extreme artifacts. On analog, you can stomp on the limits, and reconstruct the original signal with filters (analog or digital) after the fact, because tape doesn't simply stop responding, it just responds less as the inputs get more extreme.

sagawee10y ago

You are talking about (dynamic) compression applied to an entire song with the sole purpose of making it as loud as possible to the human ear but still complying to the max allowed peaks (or to the energy of the signal).

This does degrade the quality, sometimes heavily. Nevertheless it is done by mastering engineers (they rarely enjoy it) as well as by radio and tv stations extensively because of the psycho-acoustic fact that a songs appears to be better if it is played louder. This gives them an advantage over the competition: On average, people searching for a radio station are more likely to listen to your radio station if it is louder than the competition.

The main issue lies in the fact that the current peak measurement of audio signals does only marginally correlate with the perceived loudness and heavy compression is used to trick this system. The broadcasting industry is aware of this. An open and quite effective loudness measurement algorithm [0] has been introduced a few years ago and it gets slowly adapted all over the world by new broadcasting laws: AGCOM 219/09/CSP (Italy), ARIB TR-B32 (Japan), ATSC A/85 PRSS CALM Act (US), EBU R128 (Europe) and OP-59 (Australia). iTunes Soundcheck is also based on [0] and since this year Youtube applies this to newly uploaded videos as well [1]. Even games use [0] to keep their audio at a consistent loudness.

So slowly, the over-usage of compression does not give music producers and broadcasters any advantage anymore and beautiful dynamic music will be competitive again.

I have collected some links [2] about this topic. Because of the lack of any affordable implementation at the time I created one myself [3] with some additional notes [4].

[0] ITU-R BS.1770, http://www.itu.int/dms_pubrec/itu-r/rec/bs/R-REC-BS.1770-4-2... [1] http://productionadvice.co.uk/youtube-loudness/ [2] https://www.klangfreund.com/lufsmeter/manual/#about_loudness [3] https://github.com/klangfreund/LUFSMeter [4] https://github.com/klangfreund/LUFSMeter/tree/master/docs/de...

snissn10y ago

This actually helps me articulate what I don't like about Spotify and why I don't use it, they only seem to have modern remasterings off all of the albums from the 70s, 80s and 90s that I'd want to listen to, but they lose a lot of the feel of the originals that I'm used to.

bootload10y ago

This is a good point. Further recommended listening, "Neil Young on Why High-Resolution Music Matters" ~ https://www.youtube.com/watch?v=5oTtylYR76o (55min, 2015JAN17)

CamperBob210y ago

This. The reality is that we're arguing over whether it's best to use 16 bits or 24 bits to distribute 8-12 bits worth of information. The problem that really needs to be addressed is in the studio, not in the iTunes store. Compression-happy producers are doing more than their share of the work required to make music suck. </pet_peeve>

Edit: Some useful educational material to read before moderating: https://en.wikipedia.org/wiki/Loudness_war .

rudolf010y ago

Excessive data compression will also lower dynamic range, though, no?

klodolph10y ago

Not really, no.

Maybe if you really mangled your audio by encoding at extremely low bit rates.

But in general, no.

2 more replies

maloney10y ago

I think you are confusing limiting with compression.

earlz10y ago· 15 in thread

I can't hear a difference between 96khz/44khz in it's raw form. However, I can tell the difference from effects in audio mixing. The extra detail can really make a difference in how well an audio effect VST works.

I have a 96khz/24bit interface that I use and ATH-M30X headphones, and I can tell a difference between at least some 24bit FLAC files and 16bit highest-quality-possible MP3s. I was mixing my own music and the difference was quite obvious to me. The notable thing was that drum cymbals seemed to have a bit less sizzle and such.

Now that being said, if I hadn't heard the song a million times in it's lossless form from trying to mix it, I probably wouldn't have noticed, and even then it didn't actually affect my "experience".

I'm one of those guys that downloads vinyl rips as well, but I do that mostly just to experience the alternative mastering, not that I think it's higher quality or anything. (though I have heard a terrible loudness-war CD master that sounded great on vinyl with a different master)

ska10y ago

The article is pretty clear about this too - higher bitdepths and sampling rates can be quite useful in mixing and recording situations.

They're pointless for playback.

niels_olson10y ago

> higher bitdepths and sampling rates can be quite useful in mixing

That is really the central issue. It's much like imaging since the time of Ansel Adams: the sensor can capture more dynamic range than the human eye can experience. The producer may have use for that range when editing, but the audience will never know what was -- may have been -- missed. And we're not talking about limits of reproduction. We're talking about the human sensors both instantaneous and absolute upper and lower bounds.

2 more replies

alkonaut10y ago

Doesn't that depend on what type of playback you are doing? More and more playback these days is done via digital transfer with the volume set in software at the sending end, to amplifiers at fixed volume, such as many multi-room systems.

If I airplay a song from my iPhone and have the volume at 50% set in software, then a few extra bits can help. Not sure if it makes a noticable difference, but it's a digital mixing scenario occurring at playback. If you play at extremely low volume it should be noticable.

3 more replies

rcthompson10y ago

I think the point was that sometimes you do want to apply some effect to the sound at playback time, e.g. an equalizer, and in that case a higher bit depth could maybe conceivably become useful.

TheOtherHobbes10y ago

No they're not. And no matter how many times this gets linked to on the Internet, it's still wrong.

The basic problem: the quieter a sound or detail gets, the fewer bits of resolution are used to represent it.

In 16-bit recording, there simply aren't enough bits to represent very low level details without distorting them with a subtle but audible crunchy digital halo of quantisation noise.

In a 24-bit recording, there are.

Talking about dynamic range completely misses the point. It's the not the absolute difference between the loudest and quietest sounds that matters - it's the accuracy with which the quieter sounds are reproduced.

This is because in a studio, 0dB full-scale meter redline is calibrated to a standard voltage reference, and both consumer and professional audio has equivalent standard levels for the loudest level possible.

These levels don't change for different bit depths, and they're used on both analog and digital equipment. (In fact they've been standard for decades now.)

This is why using more bits does not mean you can "reproduce music with a bigger dynamic range" - not without turning the volume up, anyway.

What actually happens is that the maximum possible volume of a playback system stays the same, but quieter sounds are reproduced with more or less accuracy.

In a 16-bit recording quiet sounds below around 50Db have 1-8 bits of effective resolution, which is nowhere near enough for truly accurate reproduction. (Try listening to an 8-bit recording to hear what this means.)

You might think it doesn't matter because they're quiet. Not so. 50dB is a long way from being inaudible, ears can be incredibly good at spectral estimation, and your brain parses spectral content and volume as separate things.

There's a wide range between "loud enough to hear" and "too loud" and 24-bit covers that whole range accurately. 16-bit is fine for louder sounds, but the quieter details just above "loud enough to get hear" get audibly bit-crushed.

The effect isn't glaringly disturbing, and adding dither helps make it even less obvious. But it's still there.

24-bit doesn't need tricks like dither - because it does the job properly in the first place.

Now - whether or not commercial recordings have enough musical detail to take full advantage of 24-bits is a different question. For various reasons - compression, mastering, cheapness - many don't.

But if you have any kind of aural sensitivity, you really should be able to A/B the difference between a 24-bit uncompressed orchestral recording and a 16-bit recording using an otherwise identical studio-grade mixer/mike/recorder/speaker system without too much difficulty.

6 more replies

pizza23410y ago

Compressed formats are not really relevant to considerations about the bit depth itself.

Besides, mp3 [audio] compressions have difficulty in handling specific samples, or type of samples (eg. sharp attacks), and they may manifest artifacts independently of the bitrate; MP3, AFAIK, also has a ceiling of 320 kbps within the standard specification, which certainly doesn't help.

Secondly, I'm not sure if you process further the MP3s (when you refer to mixing), but if you do, you're definitely going to make noticeable, artifacts which weren't so in the unprocessed MP3 form.

earlz10y ago

I mean, I'm just some hobbyist, but my understanding is everyone renders to lossless and then converts that to the various MP3/AAC formats, never changing anything solely because of the final compressed format.

hatsunearu10y ago

Old thread, but I thought I made it clear that I wasn't talking about data compression, but rather dynamic range compression.

mmastrac10y ago

MP3's perceptual model can still throw away information at the highest quality settings. FLAC doesn't throw away anything.

It's possible you are just hearing the difference between codecs. You'd have a fairer comparison with 24-bit vs 16-bit FLAC.

radicalbyte10y ago

Cymbals are the biggest tell for compressed music. Even with crappy speakers they sound very strange.

copperx10y ago

SiruisXM satellite radio is probably the worst offender. Makes music unlistenable to me.

Even 128Kbps MP3s render cymbals better.

nullc10y ago

" can tell the difference from effects in audio mixing."

Yes, non-linear effects can be sample rate sensitive. However-- this really means that their internal model is aliasing and not faithfully simulating an infinite sample rate system.

In an ideal world, effect that needed more sample rate would internally upsample/downsample (or be constructed in a way that they didn't need to). Then they would behave consistently across rates; though doing this would waste cpu cycles.

In any case, the article is all about distribution. Having excess rate in mastering is cheap and harmless, and-- because of these reasons, can be practically pretty useful.

malandrew10y ago

For those interesting in different versions of an album where it was mastered by a different audio engineer, you should check out the Steve Hoffman forums: http://forums.stevehoffman.tv/

thrownaway242410y ago

Or even the same engineer years later.

shams9310y ago

That's your processing overhead, you can mess with the sound a lot more at 96k before you hear audio issues.

The difference you hear is the difference between flac's lossless format and mp3's lossy format it has nothing to do with 16 bit versus 24 bit.

andy_ppp10y ago· 10 in thread

Why do high quality DACs clearly sound better then? And they sound better with better files. Maybe it really is all in my head but I mean listening to a £20000 hifi the other day (vinyl) really just shocked me.

I was listening to Marvin Gaye on my friends system and I could hear that there were several different backing singers all moving and at different distances from the microphone.

Are there any double blind trials anywhere of Vinyl/CD/24-192khz with super high end hifi systems? Mostly I see people suggesting that these tests are performed from the phono output of a mac with a pair of average ear buds...

JonnieCache10y ago

>Why do high quality DACs clearly sound better then? And they sound better with better files. Maybe it really is all on my head but I mean listening to a £20000 hifi the other day...

You were listening to £20,000 worth of amps and speakers, and you were most likely in an acoustically treated room.

Also, novelty is almost always euphonic when it isn't overtly bad. This fact is often neglected. You hear something you didn't hear before and your brain immediately tells you that it sounds better, even if it doesn't actually represent higher fidelity. Actually making an objective judgement requires a careersworth of experience, or a test lab and the skills to use it.

For example: you were listening to vinyl, which is covered in delicious noise and warm harmonic distortion, and is mastered differently. Highly euphonic, very novel if youve only ever heard the CD version before, but definitely not higher fidelity.

BTW higher end DACs do sound better, but the rest of your signal chain needs to be really good for you to notice it. It's often to do with better phase accuracy between the left and right channels, which affects the soundstage, or stereo image. If your speakers/amp have loose timing however, you'll never be able to tell.

sanoli10y ago

> higher end DACs do sound better

This hasn't passed the blind tests either. A good, 100 dollar dac (a schiit or an odac) will sound just as good as a 1000 dollar dac.

1 more reply

lawnchair_larry10y ago

Vinyl actually has far less fidelity. You also physically change the recording every time you play it back. Even on the same equipment, no two plays of a vinyl LP sound exactly the same, unlike digital.

This fact alone should cause you to question your subjective experience. You have no idea what part of that system was contributing to what you found pleasant. Someone who knew what they were doing could probably build a $2000 system that would blow you away just the same.

And if you were playing vinyl, there wasn't even a DAC present in the signal chain :)

cthalupa10y ago

>Vinyl actually has far less fidelity.

Vinyl mastering is sometimes better than CD mastering though, due to the loudness war.

I would love to sell my turntable and vinyl collection and rely purely on digital formats. Takes up less space, technically superior format, etc.

But one thing keeps me buying vinyl:

AWFUL mastering on CDs. A significant portion of LPs are released with more normal mastering on the vinyl, while the CD will be brickwalled all to hell.

I listen to metal, and rock as a broader genre is particularly bad about it. One of my favorite albums of last year, Fallujah's The Flesh Prevails, had a dynamic range of 2 to 3 on almost every track on the CD. The vinyl master? 9 to 10. Still not great, but leaps and bounds better. The CD actually clips if you convert the songs into MP3.

Until they go back to not murdering CD mastering, I'll continue buying vinyl :(

(I know your comment isn't directly about vinyl being bad or anything - I just have a compulsion to bitch about the loudness war any chance I can)

1 more reply

criddell10y ago

> You also physically change the recording every time you play it back

Not on a laser stylus turntable.

http://diffuser.fm/laser-turntable/

1 more reply

oberstein10y ago

Here's a classic "OMG I can hear him moving around" recording that works with the cheapest of headphones: https://www.youtube.com/watch?v=IUDTlvagjJA Most of the audio experience quality comes out of the production phase, and artists can put as much or little effort into that as possible and consider several mediums of listening (headphones, TV, surround, concert, vinyl) and make tradeoffs for the medium's particular experience.

I don't know about double blind trials but people do tests on their own. It's further complicated though because the hardware you use could be optimized for certain types of music, e.g. have a read through http://arstechnica.com/gadgets/2014/07/some-of-the-worlds-mo...

codazoda10y ago

That is awesome. At first, I actually thought people around me were making the noise and speaking. I thought the audio sample had not yet started.

ianferrel10y ago

Wine tastes better when it's poured out of fancy bottles, too.

The article mentions just such a study performed with high end equipment.

aidenn010y ago

Better masters are the key difference. See for example [1]; everyone agreed that the DVD-A and SACDs sounded better when truncated to 16 bits then a printed CD.

1: http://drewdaniels.com/audible.pdf

justin6610y ago

See listening tests section of TFA:http://people.xiph.org/~xiphmont/demo/neil-young.html#toc_lt

rplst810y ago· 8 in thread

First, let me state that I believe that CD audio, played through a modern DAC and quality stereo equipment is pretty much the pinnacle of home audio listening. That is to say, I think 44.1kHz 16-bit PCM audio is plenty good and I'm in no rush to replace my CD collection, nor do I think significant investment in higher bandwidth audio (for playback, mixing and mastering are another story) buys you much.

That said, there's one thing the article does not address and that is "beating", or really inter-modulation distortion from instrumental overtones.

Instruments are not limited to 20-20kHz. They can have overtones well above this range. Additionally, note that short pulse-width signals, i.e. transients, like drum strikes, especially involving wooden percussion, can have infinite bandwidth. (Not really infinite, but pulse-width is inversely proportional to bandwidth.

In a real listening environment (i.e. live performance) these overtones have a chance to interact with one another in the air. It is possible that these overtones may beat with one another and cause inter-modulation products in the audible range. For an example of this, play a 1000 Hz tone through your left speaker, and a 1001 Hz through your right speaker. You will hear a distinct 1 Hz "beat". The audibility of these are largely dependent on listening position and amplitude, but it is possible to occur with instruments. Since most recordings are done using a "close mic" technique (placing the microphone very close to the source) the interactions such as this are never recorded.

However, if full bandwidth of the producing instruments is preserved, these interactions of the overtones can be reproduced in a playback environment given equipment having a wide enough bandwidth and degree of quality.

cynicalkane10y ago

Nope. Intermodulation distortion for out-of-range frequencies is inaudible. The 1hz beat you are hearing is not a 1hz sound wave, it's a 1000.5hz sound wave becoming louder and softer once per second.

The comparison of a 1hz beat to a 1hz sound should be absurd on its face: you need about 20-30hz to become audible, and it's a low rumble more felt than heard. Very low frequencies sound absolutely nothing like intermodulation beats.

hamiltonkibbe10y ago

Intermodulation creates both sum and difference frequencies, the latter can certainly fall into the audible spectrum, assuming the ultrasonic frequencies are within the passband of your recording medium. The sum component can also alias back into the passband as well...

thescriptkiddie10y ago

Is there a difference? Audio is one-dimensional, frequency is just the derivative of amplitude. An arbitrarily high frequency sound wave becoming louder and softer 440 times per second is just as much an A as a 440 Hz sound wave at constant volume. A lot of cheap audio gear even uses a "1-bit DAC" that is just very high frequency PWM.

1 more reply

tonecluster10y ago

"...and cause inter-modulation products in the audible range." AFAIK, this is true in acoustic environments under conditions as described in the original post.

1 more reply

darkmighty10y ago

First off, from the point of view of the linear theory of sound waves, you're plain wrong. Two waves of any frequencies will only interact additively in linear media -- so no low frequencies are created through their interaction (unless non-linear effects come into play, but those usually create higher, not lower, order harmonics afaik). Beating is merely an interpretation of a modulating wave, the reality is the Fourier spectrum.

Second, as far as I know our hearing is composed of linear excitation elements (they have a definite bandwidth), and this is confirmed pretty well by experiments with human hearing -- you can see the threshold of our hearing at about 20kHz and that we experience tones of different frequencies fairly independently. Those assumptions imply that two tones, one at e.g. 50kHz and another at 50.001kHz are inaudible, end of story.

You can actually do this experiment yourself if you have a signal generator that can do 1Hz amplitude modulation and drive a transducer with a non-negligible sensitivity in that range.

vadman10y ago

But the within-audible-range-"beat" at the recorded "listening position" (where the microphones are located) would be recorded anyway, no?[1] So how does hi-res audio help in this case?

[1] AFAIK most music is not recorded like that, instruments are recorded separately and then overlaid; but then adding realistic-sounding "beats" based on whatever positioning the sound engineer envisions should be possible in software?

xiphmont10y ago

>That said, there's one thing the article does not address and that is "beating", or really inter-modulation distortion from instrumental overtones.

Beating and intermodulation distortion are entirely different things. They look similar on an oscilloscope, but they're not and they don't sound the same.

>Instruments are not limited to 20-20kHz. They can have overtones well above this range.

Correct. You can't hear the overtones beyond the upper portion of the hearing range (many people believe you can).

>In a real listening environment (i.e. live performance) these overtones have a chance to interact with one another in the air.

In reality they do not unless you're driving the air so hard the trough rarification is approaching hard vacuum. (That's not actually impossible. It's how ultrasonic audio 'beaming' devices work). Some performances are powerful enough to get close, eg, if you're sitting six feet from the pipe organ.

Once you're driving air so hard it becomes nonlinear, thus introducing intermodulation distortion in the air, that distortion produces actual audible-range distortion products. And because the distortion you're hearing is in the audible range, a recording will sample and reproduce it accurately.

You're hearing the audible _result_ of IMD, you're not somehow listening to the distortion curve itself.

> It is possible that these overtones may beat with one another

You're continuing to confuse beats and IMD, but here you're talking about beat frequencies, so Yes. But beat frequencies are a sort of auditory illusion. If one of the frequencies that would produce a beat is inaudible--- there's no beat. Easy to test, go try it.

> and cause inter-modulation products in the audible range.

IMD is not a beat. Inaudible ultrasonics will produce audible artifacts when the underlying reproduction system is nonlinear (another way of saying 'there's intermodulation distortion'). However, that's a playback artifact. If the IMD products were audible in the original signal, audible range sampling would reproduce them.

If it wasn't audible in the original performance, it should not be part of the recording, and it should not be part of the playback.

Keyframe10y ago

Dumb question, but wouldn't you need to reproduce position of audio sources as well in order to replicate that?

splitdisk10y ago· 7 in thread

From my experience, what matters more than sample rate is 24 bit vs. 16 bit sampling in the recording/production process. Using heavy compression and EQ can mean that very quiet sounds can become louder, in this case 24 bit recording is ideal. Sample rate wise, anything above 40khz is fine for most ears (I've probably lost a few khz in the upper range anyways) Another note is that most converters operate at a multiple of 48K, so it makes sense to use 48/96khz if you are recording. It all comes down to how much disk space you have, and want to use up.

klodolph10y ago

I can still hear around the 20k range, so let's not exclude a few listeners just because we wear hearing protection when it matters. In practice, 20k audio content means 44k or higher sample rates, due to the fact that actual filters have finite transition bands. There's an unfortunate history of engineers with poor hearing who inflict pain on others, such as the horizontal retrace on NTSC TVs which still annoys me when I encounter one.

24bit also means we don't have to record at 0dBFS, which saves a lot of time.

splitdisk10y ago

I'm very jealous of your healthy hearing range... I now wear hearing protection, but the damage has been done. When it comes to file storage, I will take a 44.1/48K 24 bit FLAC happily, since it usually comes out to the same size as a CD-Q WAV file anyways. I see no reason why everything shouldn't be in this format, but CD's have made a serious dent in formatting standards/habits.

1 more reply

toast010y ago

Telephone audio is 8khz, so recording at a multiple of 8k helps with downsampling for IVR prompts or hold music. With dithering, it isn't terrible to downsampling from 44.1k to 8k, but it's nice to avoid it.

jchrisa10y ago

Have you tried listening to SACD? The high sample rate might not give you more reproduction of audible frequencies, but the difference in arrival times it can encode makes well recorded stereo stuff more interesting to listen to, in my limited experience.

itp10y ago

I know this seems counterintuitive, but there is literally no difference in arrival times (over audible frequencies) that can be encoded at higher sampling rates. Digital sampling does not quantize over the time domain for any frequency below the Nyquist frequency.

If you have the time, watch the two videos that xiph.org did a few years ago[0]. There's a great in-depth explanation, as well as a hands on demonstration to demonstrate this reality.

[0] https://xiph.org/video/

__david__10y ago

This is directly addressed by the article under the "Sampling fallacies and misconceptions". You don't lose "arrival time" (AKA phase) when you use a lower bitrate. They have a video that explains it very well: http://xiph.org/video/vid2.shtml

splitdisk10y ago

I would be very curious to listen to SACD on some good headphones in a quiet room. Not sure if I've ever even seen a SACD player aside from maybe in the Sony store 10 years ago. The trick would be to find something that would be mastered for the format.

1 more reply

yzhou10y ago· 6 in thread

There's a big difference in impulse response with different sample rates, any one can see it on a oscilloscope, I bet some one can hear the difference.

Those who don't have a oscilloscope can see the picture here: http://i.imgur.com/wY0wzcW.png

nullc10y ago

What you are showing is _precisely_ the effect of low-passing, nothing more, nothing less.

See the digital media primer 2 for more information on that: https://wiki.xiph.org/Videos/Digital_Show_and_Tell

If humans were able to hear audio above 22kHz (or what not) in any meaningful way, we'd expect to be be able to demonstrate that effect in carefully controlled studied and then that lack of low-passing may matter; but that isn't what the best evidence so far shows.

yzhou10y ago

The low-passing with a brick wall filter on 44.1KHz audio can be a bad thing sometimes, for example, pre-echo https://en.wikipedia.org/wiki/Pre-echo You won't hear the pre-echo on a 2.8MHz DSD audio.

yzhou10y ago

In the real world, it is almost impossible to make a voltage divider with 24bit resolution. So all the DAC makers have to convert 24bit audio into lower bits(6bits to 1bits), this step requires oversampling the original audio. It is a lot easier to oversample a 192KHz/24bit audio than a 44.1K/24bit audio, and the ringing is much less after oversampling the 192KHz/24bit audio.

fancyketchup10y ago

The two pictures don't have the same vertical scaling, and it's clear that the probe is ahead of the LPF in the signal chain.

yzhou10y ago

the probe is placed on the headphone jack. The difference in the vertical scaling is that 0dB DSD signal is 6dB below a 0dB PCM.

yzhou10y ago

The brick wall filters used on low sample rate sound cause ringing in the time domain, which can "blur" the neighboring impulse.

some-guy10y ago· 3 in thread

This article hits close to home: before I became a programmer I worked as an audio engineer at a fledgling studio in my hometown.

The amount of misinformation / junk-science in the audio world is preposterous. There's a religious-cult of an industry that feeds off the ignorance and placebos of its participants. I have many friends who swear by their What.cd 24/192 FLAC vinyl rips and spend hundreds of dollars on audiophile AC wall outlets. Not to say that there are no differences in high-end audio equipment, but so much of what's "good" is subjective.

lfam10y ago

In the case of sites like what.cd, I think that FLAC 16/44 rips of CDs and vinyl are useful for creating distributed backups of our cultural corpus. But I agree that 24/192 FLACs of vinyl are ridiculous.

some-guy10y ago

I agree, in fact I very much like the sound of vinyl, but to say it's more "accurate" or of higher fidelity and dynamic range than 16/44 is completely false.

SSLy10y ago

this is the first mention of what.cd outside of the tracker scene i ever saw. funny when you think about it.

pgrote10y ago· 3 in thread

So, what are the better settings for ripping songs?

LeoPanthera10y ago

I rip to FLAC, not because I think it sounds better, but simply because if some newer better codec comes along in the future that will let me compress my songs on my smartphone even smaller (Opus?) I don't want to have to get my CDs out again. I can just transcode from the FLAC files.

aidenn010y ago

when ripping songs you are probably starting out with either 44.1 (CD) or 48kHz (DVD) sound. Just keep whatever the native sampling rate is.

Synaesthesia10y ago

Compression wise I go for 256kbps AAC. It's quite superior to MP3 as a codec.

Retra10y ago· 3 in thread

They are useful if you're resampling them or editing them, but I doubt that's something consumer music services are overly concerned with.

saidajigumi10y ago

I'll note that's the entire point of Monty's (great) article, which has this near the top:

Unfortunately, there is no point to distributing music in 24-bit/192kHz format. Its playback fidelity is slightly inferior to 16/44.1 or 16/48, and it takes up 6 times the space.

This has all been known to anyone with actual signal processing and/or audio engineering knowledge for a long time now. As in, common knowledge to the kinds of folks attending the AES conference at least back to ~2001 or so. The high sample rate/bit depth stuff is useful for production process, but irrelevant for final distribution.

thwest10y ago

There's a reasonable argument that fits within DSP theory that frequencies sampled above audible range could have harmonics down in the audible range.

1 more reply

kuschku10y ago

Or if you apply an equilizer, like lots of people do in consumer applications.

sliverstorm10y ago· 3 in thread

To what can I attribute the consistently horrible quality of 64kHz streams ten or fifteen years ago? Would that fall under the "bad encoder" bucket?

Edit: christ, I mixed up bitrates (e.g. 192kbps) with sampling frequency (e.g. 192kHz) again. I was referring to 64kbps streams.

CamperBob210y ago

64 kHz isn't a standard sample rate -- you're probably thinking of the bit rate of an MP3 or AAC file. A 64-kbit MP3 does sound pretty awful.

sliverstorm10y ago

Yup. Further confusing me was the fact that (if memory serves) Apple did offer MP3's at 192kbps for a while, before upping to 320kbps.

Edit: apparently my memory is worse than I thought.

3 more replies

joosters10y ago

mp3 encoders have gotten better over time. As well as general improvements in fidelity, older encoders had bugs that would cause occasional terrible encoding for fragments of a sample.

gwbas1c10y ago· 3 in thread

This article really misses the facts of the Nyquest-Shannon theory.

In order to decimate a signal to 44.1 or 48khz, and preserve high-frequency content, high frequencies need to be phase-shifted.

This phase-shift is similar to how lossy codecs work.

For what it's worth: I'm a big fan of music in surround, and most of it comes in high sampling rates. When I investigated ripping my DVD-As and Blurays, I found that they never have music over 20khz. It's all filtered out. However, downsampling to 44.1 or 48khz isn't "lossless" because of the phase shift needed due to the Nyquist-Shannon theory.

I still rip my DVD-As at 48khz, though. There isn't a good lossless codec that can preserve phase at high frequencies, yet approach the bitrate of 12/48 flac.

nullc10y ago

> In order to decimate a signal to 44.1 or 48khz, and preserve high-frequency content, high frequencies need to be phase-shifted.

Your understanding of sampling theorem is incorrect. Sampling alone (not quantization, of course) is completely lossless under the critical frequency.

We demonstrated this in a very clear way near the end, at about 21 minutes in, on the primer two video: http://www.xiph.org/video/vid2.shtml where we show a square wave being phase shifted tiny fractions of the intersample length.

itp10y ago

When you say

> In order to decimate a signal to 44.1 or 48khz, and preserve high-frequency content, high frequencies need to be phase-shifted.

What do you mean by high frequency? If you mean frequencies below but near the Nyquist frequency then no, there is no phase shift. If you mean at or above...

I'm struggling to avoid a blatant appeal to authority here, but your position is that the author of the Ogg Vorbis coded doesn't understand digital sampling, which seems challenging to believe.

Hello7110y ago

no, it addresses it fairly clearly (albeit briefly):

> So the math is ideal, but what of real world complications? The most notorious is the band-limiting requirement. Signals with content over the Nyquist frequency must be lowpassed before sampling to avoid aliasing distortion; this analog lowpass is the infamous antialiasing filter. Antialiasing can't be ideal in practice, but modern techniques bring it very close. ...and with that we come to oversampling.

if you accept that the limit of hearing is around 20 kHz, then you must also accept that frequencies above that can freely be removed without loss of fidelity to the human ear.

the article notes that higher frequencies can be heard, but only in the form of ultrasonic intermodulation distortion. (i.e. not in fact the higher frequencies at all)

fla10y ago· 3 in thread

With nowdays bandwidths, why do we keep using destructive compression for songs?

icegreentea10y ago

Lossy vs non-lossy compression is orthogonal to the sampling rate and bit-depth (which is what this article is about). While the MP3 standard effectively means sampling more than 48kHz is useless, there's no reason you can't have a lossy comprssion scheme that attempts to capture higher frequencies.

joosters10y ago

But the article makes a compelling case for why > 48kHz is completely pointless.

Avenger4210y ago

A lot of listening today occurs through streaming services, and huge uncompressed songs will eat into data plans quickly.

weinzierl10y ago· 2 in thread

  Because digital filters have few of the practical  
  limitations of an analog filter, we can complete the 
  anti-aliasing process with greater efficiency and 
  precision digitally. The very high rate raw digital 
  signal passes through a digital anti-aliasing filter, 
  which has no trouble fitting a transition band into a 
  tight space.

I always thought digital anti-aliasing filters were creatures from a fairy-tale world. Much talked about but no one has ever seen one.

My understanding: If you have a an analog filter of a given steepness the only way to further reduce aliasing effects digitally is oversampling. Or less steep (cheaper) analog filter plus oversampling is the same as steeper (more) expensive analog filter. People tend to say digital anti-aliasing filters when they really mean oversampling.

"24/192 music downloads make no sense" seems to be a thoroughly researched and carefully written article. It explains oversampling very well, possible confusion with digital filtering (anti-aliasing or not) is out of question. But then it goes on to talk about digital anti-aliasing filters, which makes me afraid I could be wrong.

Do digital anti-aliasing filters exist?

squeaky-clean10y ago

The digital anti-aliasing filter can only ever work on a digital -> digital signal, but they're still useful in the analog->digital process.

> My understanding: If you have a an analog filter of a given steepness the only way to further reduce aliasing effects digitally is oversampling. Or less steep (cheaper) analog filter plus oversampling is the same as steeper (more) expensive analog filter. People tend to say digital anti-aliasing filters when they really mean oversampling

You're right, and it's actually both. The ADC can run at a much higher sample rate with a cheaper analog filter, and then that digital signal is again passed through a digital filter and downsampled.

hamiltonkibbe10y ago

Yes, but obviously they don't work if your signal is already aliased. The most common example would be when decimating a signal, e.g using a digital filter with a cutoff at 22khz before down sampling from 192kHz to 44.1kHz. This is often realized in a single step... Check out polyphase interpolation/decimation if you're interested in learning more

Johnny55510y ago· 2 in thread

Wouldn't this question be answered with a large-scale double blind trial?

If more people prefer the sound at the higher bitrate and sampling rate, then that's the better format, even if there's no technical reason why that format is superior.

Much like how some people prefer the "warm" sound of tube amps, even if that means more distortion.

upofadown10y ago

From the article:

>Empirical evidence from listening tests backs up the assertion that 44.1kHz/16 bit provides highest-possible fidelity playback.

You can read the article if you want to find the actual references. No one is arguing that higher rates/bits produces any sort of distortion that anyone would prefer.

TheCoelacanth10y ago

> Much like how some people prefer the "warm" sound of tube amps, even if that means more distortion.

The difference from my perspective is that an amp is a tool for sound production while a digital music format is a tool for sound reproduction. When producing sound, choosing more distortion over less distortion is a valid choice. When reproducing sound, the goal should be accurate reproduction of the original.

z3t410y ago· 2 in thread

I can hear insects and buzzing electronic devices, and my partner thinks I'm crazy some times. Thinking I might have golden ears I tested * my range and I could hear up to 18kHz.

* http://onlinetonegenerator.com/hearingtest.html

rubberbandage10y ago

Honestly, depending on your age, that still could be “golden” — I’m 31, I’ve taken very good care of my hearing, I’m very acutely aware of audio subtleties, and my hearing range tops out around 16.5KHz. The so-called standard upper limit of 20KHz really only applies to young children, which is why CD audio being able to reproduce frequencies of 22.05KHz is already beyond ideal, and calls for 48! no, 96!, no, 192! (or higher) is literally insane for playback.

lstamour10y ago

Using a tone generator on my computer and a pair of headphones, I found that I couldn't easily hear past 15-ish myself, then I started turning up the volume, or playing with turning the volume all the way up, then all the way down. Using that technique, I was able to distinguish noise and high pitches up to 20.2khz or so. So I think from now on, if I hear some whine, I'm going to trust that it's there and not my imagination. Of course, it's also the definition of going deaf, I suppose, that I have to turn up the pitches to such a loud volume to hear them in the first place...

tonecluster10y ago· 2 in thread

Some [consumer] digital low-pass filter can benefit from higher sampling rates, leading to an overall better representation of the analog signal up to 20kHz. But there are diminishing returns as the filter "folds" the octaves above 22kHz; A rate of 96k for certain lowpass filters is better than 48k, but at some point there's little (if any) benefit by going to 192k or 384k. For recording studios, go as high as you can in both bit-rate and bit-depth. Especially when you're processing the signal "in the box". Give the software as much data as possible to operate without introducing errors and artifacts. There are diminishing returns there as well, but RTFM for (for example) UA gear and software and you're good to go.

aidenn010y ago

TFA mentions that for recording and mastering there is a use. Furthermore, the headline implies it see the term "downloads" in the title.

tonecluster10y ago

Yep, I read that too. Even so, there are low-pass filters in some consumer gear that benefit from, say, 96k sampling rates and result in better quality sound. This does imply that at 44.1 or 48 they don't represent up to 20kHz properly, of course.

2 more replies

derefr10y ago· 2 in thread

So, no point in 24/192 because it makes no difference in playback... but having lossless downloads is important in part for enabling remix culture? There's a bit of a double-standard here. Maybe I can't hear 24/192 audio, but isn't it better input for sampling?

some-guy10y ago

The article is specifying 24/192 as useless for playback quality only. Halfway down he addresses the benefits of 24/192 for the sake of mixing and mastering different digital audio signals, but a final mix offers no benefit to the human listener when choosing between 16bit/44Khz and 24bit/192Khz.

derefr10y ago

What I was trying to get across is: every file has two potential purposes—listening and serving as input for sampling. So, if we care about enabling "remix culture", wouldn't make sense to offer a "24/192 FLAC" option for download, push DVDA over CD, etc., anyway?

I've never seen the hype from artists about 24/192 as being about better listening experience. It's about handing their consumers a better master so as to encourage and enable more of them to be remixers.

1 more reply

StavrosK10y ago· 2 in thread

Everything else is fine and good in the article, but I can see the infrared in the Apple remote (and all the other IR remotes I've tried). It's faint, but plainly visible. Am I the only one?

glitch10y ago

Went to a dark room with an Apple Remote; let my eyes adjust for a little while. Pressed it many times; I couldn't see the infrared coming from the LED with my naked eye. (But the camera on my iPhone imaged the infrared from the remote's LED.) I envy your biological wavelength detection.

StavrosK10y ago

Hmm, that's odd. I've noticed this with lots of remotes, I usually just look at them to tell if the batteries are dead. I wonder why I can see it.

lmm10y ago· 2 in thread

Dithering is a horrible thing to be doing, and 44.1 is an awkward rate. So while I agree that 192khz is dumb, 24/48 is a better standard than CD.

ska10y ago

No, dithering (properly) is usually what you should be doing when you quantize.

See Vanderkooy and Lipshitz 1987 for why.

lmm10y ago

Paper seems to be paywalled. I can't imagine any possible purpose for dithering before encoding that wouldn't be better served by dithering on playback.

4 more replies

acd10y ago· 1 in thread

Try what age is your ears https://www.youtube.com/watch?v=VxcbppCX6Rk

Or generate a tone sweep in audacity. Generate->chirp http://www.audacityteam.org/

You loose the ability to hear high frequency sounds as you age.

Personally I can hear up to about 14kHZ

lstamour10y ago

Huh. Downloaded a tone generator, and found that while in the video I heard a series of clicks at 16khz and beyond, I could in fact hear 16khz if I raised and lowered the volume from nothing to loudest in the app. It sounded like a whine and much harder to hear than I expected, distinguished most easily by when it quickly went from present to not present and back. In fact, I kept going up the scale doing that, and raising the volume, and found that I was able to hear even 19 to low 20khz as a high pitched noise, very quiet even at -6db. So ... yeah, probably does me no good considering that the loudness of other pitches makes it near impossible for me to hear anything practical in those frequencies. Of course, then I go to listen to music and wow, I can hear all this detail. I think I trained my ears for it, or I'm losing it. ;-)

S_A_P10y ago· 1 in thread

I just recently purchased Izotope Ozone 7 advanced. One feature it has is "codec preview" which lets you "solo" the codec artifacts for MP3 and AAC format. Even at high but rates it's amazing how swishy bit reduction sounds. It also made me realize what I was hearing with mp3s was artifacts from compression. That said, it's not unlike tape hiss or vinyl noise. In fact I think it can have its own charm and in some cases make the music sound more full. It's also probably why 24/192 digital audio can sound so "cold" or lifeless.

beat10y ago

From mastering records at home, I've found that in all but the most golden-ears focused listening, I can't hear the difference between 192 bit mp3 and 44.1/16 cd quality. But 128 bit mp3 is audibly degraded and irritating.

That's a pretty cool feature for Ozone 7, for sure! I'm still using Ozone 5 and don't feel a need to upgrade, but that might make it...

idlewords10y ago· 1 in thread

I love the idea the author mentions in passing of a dedicated speaker assembly for ultrasonics. This seems like something that could be a huge margin business, and the parts costs would be as low as you wanted.

xiphmont10y ago

You're late to the game. Such high margin products have existed for some time. There are even published papers about them()!
The published papers tend to be by the same people making the supertweeters

guelo10y ago

(2012)

Previous discussion https://news.ycombinator.com/item?id=3668310

PaulHoule10y ago

It is not just headphones that are the problem, it is the speakers.

People today are often amazed when they listen to CD or turntable content through 70's era crossover speakers. Back in the 70's you'd have a stereo with 2 "speakers" that each had 3 subspeakers for a total of six speakers. The fad today is to have 5.1 sound with a single driver in each satellite, also a total of six speakers. The spatial resolution increase is good for movies, games and TV but surround sound in music is marginal. An amazing number of old "classic rock" recordings were done in quad and anything by Donald Fagan will sound pretty good w/ Dolby Pro Logic, there are some more recent Bjork recordings, but almost everything is mixed for stereo and what you loose in frequency response is not compensated by anything, except perhaps the ability to produce more volume with more speakers.

aidenn010y ago

If you want to know more, Monty made one of the best intros to digital sampling I've ever seen: https://www.xiph.org/video/vid2.shtml

ChuckMcM10y ago

I have a pair of Roger Sound Labs studio monitors for my speakers at home. I got to look at their insides when a technician was replacing a blown midrange speaker (they have a "lifetime" warranty, however that warranty expired when RSL did). Looking at the cross over filter network I could see a network selecting for frequencies > 20khz and it was shunted to a resistor. I asked about it, and the reponse was exactly like the authors, by filtering out signals higher than the tweeter could reproduce, they improved the listening experience.

It made sense to me, and I love how the speakers sound. Understanding is not inserting distortion makes even more sense.

rphlx10y ago

24/192 lossless is a digital Veblen good; some people will pay more for it (and/or the HW to play it & store it), and almost all of them will enjoy it more, if only because it costs more. Whether it actually sounds better is somewhat tangential.

lips10y ago

My third time reading this, and a new question popped into my head: Are there any volume adjustments (on software or hardware) that take into account the pain threshold curves? That is, volume adjustments that aren't flat, but that will attenuate the frequencies that will cause discomfort at the lowest volumes?

JadeNB10y ago

It may be worth noting (though it doesn't change any of the science) that this is from 2012.

iamleppert10y ago

This is totally offtopic but I can't stand the "XXX considered harmful" stuff. I had to rage-quit the article.

j / k navigate · click thread line to collapse

228 comments

133 comments · 30 top-level

hatsunearu10y ago· 22 in thread

Recommended reading: https://en.wikipedia.org/wiki/Loudness_war

edit: compression as in dynamic range compression, not data compression like mp3 in audio

beat10y ago

I've played acoustic and electric instruments for over 30 years, and recorded numerous albums.

Compression is the best tool we have for accurately reproducing the musicality and emotion of a musical performance. Without compression, most recordings would be unlistenable.

Don't confuse the foolishness of the loudness wars for "compression is bad". That's like saying the internet is bad because there's porn on it.

chillingeffect10y ago

I respect your opinion, but it's only your opinion, not a global truth.

Also their vocals have no dynamics. When he yells loud, the vocals don't get louder but the timbre changes. That changes it from cathartic to strained. The dynamics have all been flattened.

> That's like saying the internet is bad because there's porn on it.

Yes but on the internet you can go where there is no porn. Where can you find music with no compression?

6 more replies

bootload10y ago

"Compression is the best tool we have for accurately reproducing the musicality and emotion of a musical performance. Without compression, most recordings would be unlistenable."

That is your artistic choice, as it should be.

Not every artist wants this to happen. They have no choice and listeners get a fraction of the sound recorded. This was not the case with vinyl.

@mborch, the exact compression method is of less importance than recognising that for all the compression being discussed, is a retrograde step from vinyl. Why?

[0] http://allthingsd.com/20120131/neil-young-and-the-sound-of-m...

2 more replies

hammock10y ago

Eiriksmal10y ago

The source song, PSY's Gagnam Style, is the epitome of modern pop. I got a 3/10 on the listening test on a decent pair of Sennheiser headphones in a quiet room.

[0] http://www.audiocheck.net/blindtests_16vs8bit.php

3 more replies

copperx10y ago

This is true. HDTracks.com offers one of the Green Day records with no mastering compression.

1 more reply

nosuchthing10y ago

If you listen to pop recordings between 1950-1980, you'll notice there's more depth and character to every instrument.

Yes, dynamic range compression is currently used/abused extensively in pop music productions, but if mixes weren't compressed they would have a much wider range for the sounds to play around in.

  Not only is Justin Bieber's My World 2.0 louder than 
  Metallica's The Black Album, it's louder than The Sex 
  Pistols' Never Mind The Bollocks.

http://www.sonicstate.com/news/2011/02/21/why-is-justin-bieb...

1 more reply

kinghajj10y ago

2 more replies

muraiki10y ago

Reading "compression" as both a programmer and audio-minded person made that first sentence difficult to parse at first. :)

But yeah, it's obvious when I have my car stereo nearly on max to listen to classical or jazz, and then if I turn the radio and get a pop music station my ears are about to explode.

JonnieCache10y ago

http://productionadvice.co.uk/lufs-dbfs-rms/

4 more replies

bitwize10y ago

2 more replies

darkmighty10y ago

I have mixed feelings about it. But I fully agree all music should be uncompressed by default.

donatj10y ago

I'm truly curious how having good isolation would help your hearing? Is the room noise a large additive?

1 more reply

pseudosavant10y ago

thrownaway242410y ago

sagawee10y ago

So slowly, the over-usage of compression does not give music producers and broadcasters any advantage anymore and beautiful dynamic music will be competitive again.

I have collected some links [2] about this topic. Because of the lack of any affordable implementation at the time I created one myself [3] with some additional notes [4].

snissn10y ago

bootload10y ago

This is a good point. Further recommended listening, "Neil Young on Why High-Resolution Music Matters" ~ https://www.youtube.com/watch?v=5oTtylYR76o (55min, 2015JAN17)

CamperBob210y ago

Edit: Some useful educational material to read before moderating: https://en.wikipedia.org/wiki/Loudness_war .

rudolf010y ago

Excessive data compression will also lower dynamic range, though, no?

klodolph10y ago

Not really, no.

Maybe if you really mangled your audio by encoding at extremely low bit rates.

But in general, no.

2 more replies

maloney10y ago

I think you are confusing limiting with compression.

earlz10y ago· 15 in thread

Now that being said, if I hadn't heard the song a million times in it's lossless form from trying to mix it, I probably wouldn't have noticed, and even then it didn't actually affect my "experience".

ska10y ago

The article is pretty clear about this too - higher bitdepths and sampling rates can be quite useful in mixing and recording situations.

They're pointless for playback.

niels_olson10y ago

> higher bitdepths and sampling rates can be quite useful in mixing

2 more replies

alkonaut10y ago

3 more replies

rcthompson10y ago

I think the point was that sometimes you do want to apply some effect to the sound at playback time, e.g. an equalizer, and in that case a higher bit depth could maybe conceivably become useful.

TheOtherHobbes10y ago

No they're not. And no matter how many times this gets linked to on the Internet, it's still wrong.

The basic problem: the quieter a sound or detail gets, the fewer bits of resolution are used to represent it.

In 16-bit recording, there simply aren't enough bits to represent very low level details without distorting them with a subtle but audible crunchy digital halo of quantisation noise.

In a 24-bit recording, there are.

These levels don't change for different bit depths, and they're used on both analog and digital equipment. (In fact they've been standard for decades now.)

This is why using more bits does not mean you can "reproduce music with a bigger dynamic range" - not without turning the volume up, anyway.

What actually happens is that the maximum possible volume of a playback system stays the same, but quieter sounds are reproduced with more or less accuracy.

The effect isn't glaringly disturbing, and adding dither helps make it even less obvious. But it's still there.

24-bit doesn't need tricks like dither - because it does the job properly in the first place.

Now - whether or not commercial recordings have enough musical detail to take full advantage of 24-bits is a different question. For various reasons - compression, mastering, cheapness - many don't.

6 more replies

pizza23410y ago

Compressed formats are not really relevant to considerations about the bit depth itself.

Secondly, I'm not sure if you process further the MP3s (when you refer to mixing), but if you do, you're definitely going to make noticeable, artifacts which weren't so in the unprocessed MP3 form.

earlz10y ago

hatsunearu10y ago

Old thread, but I thought I made it clear that I wasn't talking about data compression, but rather dynamic range compression.

mmastrac10y ago

MP3's perceptual model can still throw away information at the highest quality settings. FLAC doesn't throw away anything.

It's possible you are just hearing the difference between codecs. You'd have a fairer comparison with 24-bit vs 16-bit FLAC.

radicalbyte10y ago

Cymbals are the biggest tell for compressed music. Even with crappy speakers they sound very strange.

copperx10y ago

SiruisXM satellite radio is probably the worst offender. Makes music unlistenable to me.

Even 128Kbps MP3s render cymbals better.

nullc10y ago

" can tell the difference from effects in audio mixing."

Yes, non-linear effects can be sample rate sensitive. However-- this really means that their internal model is aliasing and not faithfully simulating an infinite sample rate system.

In any case, the article is all about distribution. Having excess rate in mastering is cheap and harmless, and-- because of these reasons, can be practically pretty useful.

malandrew10y ago

For those interesting in different versions of an album where it was mastered by a different audio engineer, you should check out the Steve Hoffman forums: http://forums.stevehoffman.tv/

thrownaway242410y ago

Or even the same engineer years later.

shams9310y ago

That's your processing overhead, you can mess with the sound a lot more at 96k before you hear audio issues.

The difference you hear is the difference between flac's lossless format and mp3's lossy format it has nothing to do with 16 bit versus 24 bit.

andy_ppp10y ago· 10 in thread

I was listening to Marvin Gaye on my friends system and I could hear that there were several different backing singers all moving and at different distances from the microphone.

JonnieCache10y ago

>Why do high quality DACs clearly sound better then? And they sound better with better files. Maybe it really is all on my head but I mean listening to a £20000 hifi the other day...

You were listening to £20,000 worth of amps and speakers, and you were most likely in an acoustically treated room.

sanoli10y ago

> higher end DACs do sound better

This hasn't passed the blind tests either. A good, 100 dollar dac (a schiit or an odac) will sound just as good as a 1000 dollar dac.

1 more reply

lawnchair_larry10y ago

And if you were playing vinyl, there wasn't even a DAC present in the signal chain :)

cthalupa10y ago

>Vinyl actually has far less fidelity.

Vinyl mastering is sometimes better than CD mastering though, due to the loudness war.

I would love to sell my turntable and vinyl collection and rely purely on digital formats. Takes up less space, technically superior format, etc.

But one thing keeps me buying vinyl:

AWFUL mastering on CDs. A significant portion of LPs are released with more normal mastering on the vinyl, while the CD will be brickwalled all to hell.

Until they go back to not murdering CD mastering, I'll continue buying vinyl :(

(I know your comment isn't directly about vinyl being bad or anything - I just have a compulsion to bitch about the loudness war any chance I can)

1 more reply

criddell10y ago

> You also physically change the recording every time you play it back

Not on a laser stylus turntable.

http://diffuser.fm/laser-turntable/

1 more reply

oberstein10y ago

codazoda10y ago

That is awesome. At first, I actually thought people around me were making the noise and speaking. I thought the audio sample had not yet started.

ianferrel10y ago

Wine tastes better when it's poured out of fancy bottles, too.

The article mentions just such a study performed with high end equipment.

aidenn010y ago

Better masters are the key difference. See for example [1]; everyone agreed that the DVD-A and SACDs sounded better when truncated to 16 bits then a printed CD.

1: http://drewdaniels.com/audible.pdf

justin6610y ago

See listening tests section of TFA:http://people.xiph.org/~xiphmont/demo/neil-young.html#toc_lt

rplst810y ago· 8 in thread

That said, there's one thing the article does not address and that is "beating", or really inter-modulation distortion from instrumental overtones.

cynicalkane10y ago

hamiltonkibbe10y ago

thescriptkiddie10y ago

1 more reply

tonecluster10y ago

"...and cause inter-modulation products in the audible range." AFAIK, this is true in acoustic environments under conditions as described in the original post.

1 more reply

darkmighty10y ago

You can actually do this experiment yourself if you have a signal generator that can do 1Hz amplitude modulation and drive a transducer with a non-negligible sensitivity in that range.

vadman10y ago

But the within-audible-range-"beat" at the recorded "listening position" (where the microphones are located) would be recorded anyway, no?[1] So how does hi-res audio help in this case?

xiphmont10y ago

>That said, there's one thing the article does not address and that is "beating", or really inter-modulation distortion from instrumental overtones.

Beating and intermodulation distortion are entirely different things. They look similar on an oscilloscope, but they're not and they don't sound the same.

>Instruments are not limited to 20-20kHz. They can have overtones well above this range.

Correct. You can't hear the overtones beyond the upper portion of the hearing range (many people believe you can).

>In a real listening environment (i.e. live performance) these overtones have a chance to interact with one another in the air.

You're hearing the audible _result_ of IMD, you're not somehow listening to the distortion curve itself.

> It is possible that these overtones may beat with one another

> and cause inter-modulation products in the audible range.

If it wasn't audible in the original performance, it should not be part of the recording, and it should not be part of the playback.

Keyframe10y ago

Dumb question, but wouldn't you need to reproduce position of audio sources as well in order to replicate that?

splitdisk10y ago· 7 in thread

klodolph10y ago

24bit also means we don't have to record at 0dBFS, which saves a lot of time.

splitdisk10y ago

1 more reply

toast010y ago

jchrisa10y ago

itp10y ago

If you have the time, watch the two videos that xiph.org did a few years ago[0]. There's a great in-depth explanation, as well as a hands on demonstration to demonstrate this reality.

[0] https://xiph.org/video/

__david__10y ago

splitdisk10y ago

1 more reply

yzhou10y ago· 6 in thread

There's a big difference in impulse response with different sample rates, any one can see it on a oscilloscope, I bet some one can hear the difference.

Those who don't have a oscilloscope can see the picture here: http://i.imgur.com/wY0wzcW.png

nullc10y ago

What you are showing is _precisely_ the effect of low-passing, nothing more, nothing less.

See the digital media primer 2 for more information on that: https://wiki.xiph.org/Videos/Digital_Show_and_Tell

yzhou10y ago

The low-passing with a brick wall filter on 44.1KHz audio can be a bad thing sometimes, for example, pre-echo https://en.wikipedia.org/wiki/Pre-echo You won't hear the pre-echo on a 2.8MHz DSD audio.

yzhou10y ago

fancyketchup10y ago

The two pictures don't have the same vertical scaling, and it's clear that the probe is ahead of the LPF in the signal chain.

yzhou10y ago

the probe is placed on the headphone jack. The difference in the vertical scaling is that 0dB DSD signal is 6dB below a 0dB PCM.

yzhou10y ago

The brick wall filters used on low sample rate sound cause ringing in the time domain, which can "blur" the neighboring impulse.

some-guy10y ago· 3 in thread

This article hits close to home: before I became a programmer I worked as an audio engineer at a fledgling studio in my hometown.

lfam10y ago

some-guy10y ago

I agree, in fact I very much like the sound of vinyl, but to say it's more "accurate" or of higher fidelity and dynamic range than 16/44 is completely false.

SSLy10y ago

this is the first mention of what.cd outside of the tracker scene i ever saw. funny when you think about it.

pgrote10y ago· 3 in thread

So, what are the better settings for ripping songs?

LeoPanthera10y ago

aidenn010y ago

when ripping songs you are probably starting out with either 44.1 (CD) or 48kHz (DVD) sound. Just keep whatever the native sampling rate is.

Synaesthesia10y ago

Compression wise I go for 256kbps AAC. It's quite superior to MP3 as a codec.

Retra10y ago· 3 in thread

They are useful if you're resampling them or editing them, but I doubt that's something consumer music services are overly concerned with.

saidajigumi10y ago

I'll note that's the entire point of Monty's (great) article, which has this near the top:

Unfortunately, there is no point to distributing music in 24-bit/192kHz format. Its playback fidelity is slightly inferior to 16/44.1 or 16/48, and it takes up 6 times the space.

thwest10y ago

There's a reasonable argument that fits within DSP theory that frequencies sampled above audible range could have harmonics down in the audible range.

1 more reply

kuschku10y ago

Or if you apply an equilizer, like lots of people do in consumer applications.

sliverstorm10y ago· 3 in thread

To what can I attribute the consistently horrible quality of 64kHz streams ten or fifteen years ago? Would that fall under the "bad encoder" bucket?

Edit: christ, I mixed up bitrates (e.g. 192kbps) with sampling frequency (e.g. 192kHz) again. I was referring to 64kbps streams.

CamperBob210y ago

64 kHz isn't a standard sample rate -- you're probably thinking of the bit rate of an MP3 or AAC file. A 64-kbit MP3 does sound pretty awful.

sliverstorm10y ago

Yup. Further confusing me was the fact that (if memory serves) Apple did offer MP3's at 192kbps for a while, before upping to 320kbps.

Edit: apparently my memory is worse than I thought.

3 more replies

joosters10y ago

mp3 encoders have gotten better over time. As well as general improvements in fidelity, older encoders had bugs that would cause occasional terrible encoding for fragments of a sample.

gwbas1c10y ago· 3 in thread

This article really misses the facts of the Nyquest-Shannon theory.

In order to decimate a signal to 44.1 or 48khz, and preserve high-frequency content, high frequencies need to be phase-shifted.

This phase-shift is similar to how lossy codecs work.

I still rip my DVD-As at 48khz, though. There isn't a good lossless codec that can preserve phase at high frequencies, yet approach the bitrate of 12/48 flac.

nullc10y ago

> In order to decimate a signal to 44.1 or 48khz, and preserve high-frequency content, high frequencies need to be phase-shifted.

Your understanding of sampling theorem is incorrect. Sampling alone (not quantization, of course) is completely lossless under the critical frequency.

itp10y ago

When you say

> In order to decimate a signal to 44.1 or 48khz, and preserve high-frequency content, high frequencies need to be phase-shifted.

What do you mean by high frequency? If you mean frequencies below but near the Nyquist frequency then no, there is no phase shift. If you mean at or above...

I'm struggling to avoid a blatant appeal to authority here, but your position is that the author of the Ogg Vorbis coded doesn't understand digital sampling, which seems challenging to believe.

Hello7110y ago

no, it addresses it fairly clearly (albeit briefly):

if you accept that the limit of hearing is around 20 kHz, then you must also accept that frequencies above that can freely be removed without loss of fidelity to the human ear.

the article notes that higher frequencies can be heard, but only in the form of ultrasonic intermodulation distortion. (i.e. not in fact the higher frequencies at all)

fla10y ago· 3 in thread

With nowdays bandwidths, why do we keep using destructive compression for songs?

icegreentea10y ago

joosters10y ago

But the article makes a compelling case for why > 48kHz is completely pointless.

Avenger4210y ago

A lot of listening today occurs through streaming services, and huge uncompressed songs will eat into data plans quickly.

weinzierl10y ago· 2 in thread

  Because digital filters have few of the practical  
  limitations of an analog filter, we can complete the 
  anti-aliasing process with greater efficiency and 
  precision digitally. The very high rate raw digital 
  signal passes through a digital anti-aliasing filter, 
  which has no trouble fitting a transition band into a 
  tight space.

I always thought digital anti-aliasing filters were creatures from a fairy-tale world. Much talked about but no one has ever seen one.

Do digital anti-aliasing filters exist?

squeaky-clean10y ago

The digital anti-aliasing filter can only ever work on a digital -> digital signal, but they're still useful in the analog->digital process.

You're right, and it's actually both. The ADC can run at a much higher sample rate with a cheaper analog filter, and then that digital signal is again passed through a digital filter and downsampled.

hamiltonkibbe10y ago

Johnny55510y ago· 2 in thread

Wouldn't this question be answered with a large-scale double blind trial?

If more people prefer the sound at the higher bitrate and sampling rate, then that's the better format, even if there's no technical reason why that format is superior.

Much like how some people prefer the "warm" sound of tube amps, even if that means more distortion.

upofadown10y ago

From the article:

>Empirical evidence from listening tests backs up the assertion that 44.1kHz/16 bit provides highest-possible fidelity playback.

You can read the article if you want to find the actual references. No one is arguing that higher rates/bits produces any sort of distortion that anyone would prefer.

TheCoelacanth10y ago

> Much like how some people prefer the "warm" sound of tube amps, even if that means more distortion.

z3t410y ago· 2 in thread

I can hear insects and buzzing electronic devices, and my partner thinks I'm crazy some times. Thinking I might have golden ears I tested * my range and I could hear up to 18kHz.

* http://onlinetonegenerator.com/hearingtest.html

rubberbandage10y ago

lstamour10y ago

tonecluster10y ago· 2 in thread

aidenn010y ago

TFA mentions that for recording and mastering there is a use. Furthermore, the headline implies it see the term "downloads" in the title.

tonecluster10y ago

2 more replies

derefr10y ago· 2 in thread

some-guy10y ago

derefr10y ago

1 more reply

StavrosK10y ago· 2 in thread

Everything else is fine and good in the article, but I can see the infrared in the Apple remote (and all the other IR remotes I've tried). It's faint, but plainly visible. Am I the only one?

glitch10y ago

StavrosK10y ago

Hmm, that's odd. I've noticed this with lots of remotes, I usually just look at them to tell if the batteries are dead. I wonder why I can see it.

lmm10y ago· 2 in thread

Dithering is a horrible thing to be doing, and 44.1 is an awkward rate. So while I agree that 192khz is dumb, 24/48 is a better standard than CD.

ska10y ago

No, dithering (properly) is usually what you should be doing when you quantize.

See Vanderkooy and Lipshitz 1987 for why.

lmm10y ago

Paper seems to be paywalled. I can't imagine any possible purpose for dithering before encoding that wouldn't be better served by dithering on playback.

4 more replies

acd10y ago· 1 in thread

Try what age is your ears https://www.youtube.com/watch?v=VxcbppCX6Rk

Or generate a tone sweep in audacity. Generate->chirp http://www.audacityteam.org/

You loose the ability to hear high frequency sounds as you age.

Personally I can hear up to about 14kHZ

lstamour10y ago

S_A_P10y ago· 1 in thread

beat10y ago

That's a pretty cool feature for Ozone 7, for sure! I'm still using Ozone 5 and don't feel a need to upgrade, but that might make it...

idlewords10y ago· 1 in thread

xiphmont10y ago

guelo10y ago

(2012)

Previous discussion https://news.ycombinator.com/item?id=3668310

PaulHoule10y ago

It is not just headphones that are the problem, it is the speakers.

aidenn010y ago

If you want to know more, Monty made one of the best intros to digital sampling I've ever seen: https://www.xiph.org/video/vid2.shtml

ChuckMcM10y ago

It made sense to me, and I love how the speakers sound. Understanding is not inserting distortion makes even more sense.

rphlx10y ago

lips10y ago

JadeNB10y ago

It may be worth noting (though it doesn't change any of the science) that this is from 2012.

iamleppert10y ago

This is totally offtopic but I can't stand the "XXX considered harmful" stuff. I had to rage-quit the article.

j / k navigate · click thread line to collapse