AC3Filter ver $ver

SourceForge Logo
English    Russian    Italian   
Donate! Download Forum Source forge project

AC3Filter is a high quality free DirectShow filter designed for real time audio decoding and processing. It has a priority on wide functionality and convenient settings. Filter can decode following audio formats: AC3/DTS/MPEG Audio. It also supports multi-channel and digital (SPDIF) outputs.

Main Features:

Contents

  1. Download
  2. Install
  3. Uninstall
  4. System setup
    1. Windows
    2. Setup for analog multichannel outputmode
    3. Setup for digital (SPDIF) output mode
    4. Subwoofer setup
    5. Filter setup
  5. Player setup
    1. Compatibility table
    2. Media Player Classics
    3. Windows Media Player 6.4
    4. Windows Media Player 7+
    5. BSPlayer
    6. ZoomPlayer
    7. LightAlloy
  6. Filter settings
    1. Filter settings window opening
    2. Main settings
    3. Mixer settings
    4. Gains
    5. System settings
  7. Theory
    1. Loudness and dynamic range
    2. What is LFE?
    3. SPDIF
  8. Distribution
  9. Feedback
  10. Help to the project
  11. Changelog

1. Download.

ac3filter ver $ver: $download_bin
All versions: http://sourceforge.net/projects/ac3filter

New versions of the program can be found at the following site:
http://ac3filter.sourceforge.net - official site
http://sourceforge.net/forum/?group_id=66022 - AC3Filter forum
http://sourceforge.net/projects/ac3filter - project page on Sourceforge - errors, requests...

2. Install.

Run the installation program and follow the instructions. Additional system settings may be required for using of multi-channel modes and digital output mode (refer to 4. System setup).

3. Uninstall.

Run uninstall program from the Start menu or from the Control Panel.

All saved settings of matrices, equalizers, etc. will be lost!

Filter can be installed together with other programs (codec-packs, players…). In this case, filter can be uninstalled only together with that program. If you have no idea about what program filter was installed with or you are going to uninstall the filter but not the program you can uninstall the filter independently as follows:

4. System setup

This section describes one-time settings, which should be made after the filter installation. The most compatible operation mode is 16-bit stereo mode (not multi-channel), which is the default filter operating mode. This mode is compatible with all Windows versions and all audio cards. There is no need for additional settings to operate in this mode. For operating in multi-channel modes and in digital output mode some system settings may be required.

4.1. Windows

Compatibility table
Win95, Win98 Win98SE WinNT (3) Win2000 WinXP
mono/stereo 16bit + + + + +
multichannel - (?) WaveOut (1) SP6 + +
SPDIF - (?) WaveOut (1) + ïàò÷ (2) - (?) SP2 +
1) refer to 6.5. System settings
2) You can download the patch from here:
server1:http://ac3filter.sourceforge.net/download/269601USA8.EXE
server2:ftp://download.intel.com/design/motherbd/bt2/269601USA8.EXE
server3:ftp://ftp.asus.com.tw/pub/ASUS/beta/mb/driver/269601USA8.EXE
patch effect is described in http://www.microsoft.com/hwdev/tech/audio/Non-PCM.asp.
3) Audio card must support 48 kHz output mode

4.2. Setup for analog multichannel output mode

Configure speakers in filter settings (refer to 6.2. Main settings). Some audio cards operates incorrectly with odd number of channels (for instance, 2/2+SW 4.1 quadro or 3/0 3 front). Therefore, such modes should be avoided, if possible. Dolby Surround/ProLogic and Dolby ProLogic2 modes are not multi-channel modes. Moreover, modes with subwoofer (2/0+SW 2.1 stereo, 4/0+SW 4.1 quadro) should not be used if acoustics has no separate input for subwoofer (refer to 7.2. What is LFE?).

Set speakers mode in Control Panel->Sounds and Multimedia->Audio->Sound Playback->Advanced. Furthermore, audio card drivers often have their own utilities for switching of speakers configuration. It is necessary to set the proper configuration at both locations - within audio card driver settings and within the Control Panel.

For Creative audio card it is necessary to switch off all sound effects and equalizers, otherwise there can be a loss of central/rear channels.

Audio card can operate in two modes: DirectSound and WaveOut. In some cases, the multi-channel mode can operate only in one of those modes and cannot operate in another one. If that is the case, output device should be changed in player settings. To get information whether the player supports the device changing function, refer to the table of compatibility in 5. Player setup section (Renderer column). If the player has no such function, it is possible to change the default system output device (refer to 6.5. System settings ). However, this option changes the global system settings; therefore, it is recommended to use it only if the player does not support the mentioned function.

4.3. Setup for digital (SPDIF) output mode

(It is recommended to read 7.3 SPDIF section firstly)

SPDIF mode is required only in case of digital connection to external decoder/receiver exist. It is important previously to ensure whether the receiver connected correctly (refer to audio card and receiver documentation).

It is necessary to check SPDIF box in filter settings to switch the SPDIF mode on (refer to 6.2. Main settings)

Digital connection supports different formats of data transfer (PCM, AC3, DTS, MPEG Audio). Refer to receiver documentation to get information about the formats supported by receiver. In most cases only AC3 format is supported - if that is the case, there is no need for additional settings (it is active by default). If the receiver supports some other formats, they can be indicated in filter settings (refer to 6.5. System settings ).

Some drivers have an ability of AC3 decoding. If such option is available it should be disabled (refer to audio card manual).

Audio card can operate in two modes: DirectSound and WaveOut. In some cases, the SPDIF mode can operate only in one of those modes and cannot operate in another one. If that is the case, output device should be changed in player settings. To get information whether the player supports the device changing function, refer to the table of compatibility in 5. Player setup section (Renderer column). If the player has no such function, it is possible to change the default system output device (refer to 6.5. System settings). However, this option changes the global system settings; therefore, it is recommended to use it only if the player does not support the mentioned function.

Most of audio cards can operate in digital transfer mode only with 48000 Hz sample rate. Therefore, in all other cases the digital transfer is impossible at all. Current digital output mode can be checked during playback on the Main page of filter settings (refer to 6.2. Main settings)

4.4. Subwoofer setup

(It is recommended to read 7.2. What is LFE? section firstly)

If acoustics has no separate input for subwoofer, modes with subwoofer ( 2/0+SW, 4/0+SW) should not be used (refer to 7.2. What is LFE?).

It is recommended to activate bass redirection for the modes with subwoofer. In acoustics documentation, information about bandwidth of speakers and subwoofer should be found; it is recommended to set the cutoff frequency equal to lower bound for main speakers and to upper bound of subwoofer. For instance, if the bandwidth of main speakers is 100 Hz - 20 kHz and that of subwoofer is 30 Hz - 100 Hz then the cutoff frequency must be set at 100 Hz. In most cases, the default setting is satisfactory, so there is no need to change anything.

Some audio cards have their own bass redirection control settings. It is not recommended to activate both options at the same time! This can result in significant distortions. Bass redirection must be switched on either in filter only or in audio card settings only.

4.5. Filter setup

After installation filter has already all settings and in general, it does not require any additional settings. Therefore, if you are not familiar with it, do not make any changes. It is recommended to modify settings only if there are some problems with sound. However, if you are inquisitive and you like to play with settings, please, refer to 7. Theory section.

5. Player setup.

This section briefly describes the most known players and possibility of their operation together with AC3Filter, abilities of players and required settings.

5.1 Compatibility table

Player Abbr Site License AVI DVD Filters Renderer Decoder Supported Details
Media Player Classics MPC (site) OpenSource + + + + + + (details)
Windows Media Player v6.4 WMP64 (site) Freeware + - + - - + (details)
Windows Media Player 7+ WMP7+ (site) Freeware + + - - - + (details)
BSPlayer BSP (site) Freeware + - + + + + (details)
ZoomPlayer ZP (site) Freeware + - + + + + (details)
ZoomPlayerPro ZPP (site) Shareware + + + + + + (details)
LightAlloy LA (site) Freeware + + + + - + (details)
PowerDVD
WinDVD
PDVD
WDVD
(site)
(site)
Shareware
Shareware
+
+
+
+
-
-
-
-
-
-
AVI only
AVI only
(details)
Abbr - abbreviated player name.
AVI - ability to play AVI files.
DVD - ability to play DVD files.
Filters - player supports filter configuring feature.
Renderer - player supports output device changing feature.
Decoder - player supports decoder changing feature.
Supported - player has an ability to use the AC3Filter.

It is clear that the table is far from being complete, therefore any corrections and additions are welcomed.

5.2. Media Player Classics

It is one of the most multifunctional players. Though it has a simple look and a small memory size (~1Mb), of player surrenders only ZoomPlayer by functionality. Furthermore the player has a great number of internal decoders (there is no need to install any codec-packs and other external decoders). It supports DVD playing, Flash animation (with rewind function!), RealVideo/RealAudio, QuickTime and many others. This player is reliable, stable and has no special requirements for resources (my favorite player :-).

During watching filter settings can be called by right button click on the frame, then select Filters->AC3Filter in drop-down menu:

Player settings can be called from View->Options menu. To make the player to use AC3Filter player settings should be changed as shown in the figure:

If there are several decoders installed in the system, decoder with the highest priority will be in use. To modify the filter priority, refer to 6.5. System settings . The player can be configured for continuous use of AC3Filter independently of current system settings. For that, enter the Overrides section of player settings, press AddFilter button and then select AC3Filter from the list. That will result as follows :

5.3. Windows Media Player 6.4

Simple player delivered within all Windows versions from Win98 (Win98, WinME, Win2000, WinXP, ...). Player is located at C:\Program files\Windows Media Player\mplayer2.exe. This player is extremely reliable, has no special requirements for resources and utilizes standard playback methods only; therefore, it is essential for detection of different problems. Many functions are not available but the player excellently copes with its general task - file playing. If there, any playing problems appear when using other players it is recommended to try this player.

During watching filter settings can be called from File->Properties->Advanced menu:

Player has no settings for specific AC3Filter use, therefore if there are several decoders installed in the system, the decoder with the highest priority will be in use. To modify the filter priority, refer to 6.5. System settings.

5.4. Windows Media Player 7+

This family of players has no ability of calling the filter settings during watching. Thus, you cannot know whether AC3Filter or some other decoder is in use. Also, you cannot change the settings and monitor the current activity (properties of current audio stream, input/output levels, DRC activity, etc.) during playing. The single exception is the Windows Media Player 9 in DVD playing mode only. In this case, the filter settings can be called from Tools->Options menu (this option will not take effect when watching .avi files or files of other formats).

In all other cases, it is possible to configure the filter only by configuration utility before watching the film.

Player has no settings for specific AC3Filter use, therefore if there are several decoders installed in the system, the decoder with the highest priority will be in use. To modify the filter priority, refer to 6.5. System settings .

5.5. BSPlayer

Multifunctional player with skins support.

Filter settings can be called by right button click on the frame. Then select Options->Filters->AC3Filter in drop-down menu:

Player settings can be called from Options->Preferences menu. To make the player to use AC3Filter you should change player settings as shown in the figure:

5.6. ZoomPlayer

This is the most multifunctional player. There is no sense to specify all functions because of their great number.

Filter settings can be called by right button click on the frame. Then select Filter Properties->AC3Filter in drop-down menu:

5.7. LightAlloy

Filter settings calling is shown in the picture:

Player has no settings for specific AC3Filter use, therefore if there were several decoders installed in the system, the decoder with the highest priority will be in use. To modify filter priority refer to 6.5. System settings .

5.8. PowerDVD/WinDVD

Both players cannot use the AC3Filter to play DVD. It’s concerned with the players features, therefore it is impossible to force them to use AC3Filter. That is no ways in principle to force those players to operate with AC3Filter! So to watch DVD with AC3Filter use it is recommended to use other players - Media Player Classics, LightAlloy or ZoomPlayer.

When playing AVI-files by these players you can use AC3Filter both for post-processing and for AC3/DTS audio track decoding. However, the player cannot call the filter settings, so you should configure the filter by configuration utility before watching only.

6. Filter settings.

After installation filter is already well-configured and in general, it does not require additional setup. Therefore, if you are not familiar with it, do not make any changes. It is recommended to modify the settings only if there are some problems with sound. However, if you are inquisitive and you like to play with settings, please, refer to 7. Theory section.

There are two principal rules of configuring:

6.1. Filter settings window opening

You should enter the filter property window to configure the filter. There are two ways to do that:

6.2. Main settings

This page contains most essential filter settings and information about audio playing. Information and levels are displayed when calling the settings from the player only:

6.2.1. Output

This section describes the settings of filter output format. Filter will automatically convert all input formats into defined configuration. Speakers' configuration is represented in the form of front/rear+subwoofer. There are conventional names for some configurations. For instance, 2/0 - stereo means 2 front speakers, i.e. stereo. 2/2 - quadro means 2 front and 2 rear speakers, i.e. quadro. 3/2+SW 5.1 channels - 3 front speakers, 2 rear speakers and subwoofer - 5.1 configuration. It is desirable for defined configuration to be in accordant with the system settings (refer to 4.2. Setup for multichannel mode ), which in turn must match with the actual number of speakers. To know the difference for subwoofer and for LFE channel, and how to set configuration, refer to 7.2. What is LFE? section.

Dolby Surround/ProLogic and Dolby ProLogic II modes are required only when computer has stereo output only and is connected to Dolby ProLogic decoder. In this case, central and rear channels are mixed into two channels to let external decoder to extract them from those two channels backwards. This is not a multi-channel mode! This is a special type of stereo mode required only if the external Dolby ProLogic decoder exists.

The second selection list sets the sample output format. Filter will automatically convert any input format into defined one. When decoding the filter defines decoding precision. At present 16/24/32 bit and floating-point formats are supported.

SPDIF checkbox switches the digital output mode. This mode is required only when digital (SPDIF) connection and external hardware decoder/receiver exists. In this case the coded stream will be transferred to external decoder without any modifications (if the decoder supports this format) or will be recoded into the format clear for decoder (for more information refer to 7.3. SPDIF ). Current digital output mode status is shown in parentheses:

  • disabled - digital output mode is inactive, there is an ordinary decoded PCM stream transferred to the output. When SPDIF is on, the filter always attempts to use digital transfer. And only in case when digital transfer is impossible (audio card driver rejects to process the indicated format) ordinary PCM is transferred. For instance, many audio cards support digital transfer only when the sample rate is 48000 Hz but do not support widespread format of 44100 Hz.
  • passthrough - pass -through transfer mode (for AC3, DTS and MPEG Audio formats only). In this case, encoded digital stream is transferred without any modifications and decoding is performed by external decoder. In this mode the filer does not display current activity and all filter settings do not affect sound (since sound is not decoded and the coded data is transferred ‘as is’).
  • AC3 encode - AC3 encoding mode. In this mode filter performs decoding of the input stream and post-processing (all filter options are active including changing of speakers number). Then the resulted signal is encoded into AC3 and transfer into external hardware decoder in digital format. This mode is required to transfer multi-channel audio in formats, which are not supported by external decoder (e.g., AAC or OGG Vorbis) or in case when the external decoder does not support stereo decomposition into multi-channel format. Remember that AC3 encoding leads to quality loss, therefore it is preferable to use analogue connection for multi-channel formats.

Formats supported by the external decoder are shown in the system settings page in SPDIF passthrough section (refer to 6.5. System settings).

Note that the current mode can be changed during watching when the input stream format is changed. For instance, when watching DVD there is a possibility of audio tracks switching. At that, the track format can be changed. If the receiver supports only AC3 format then when switching from audio track of AC3 format to track of DTS format there will be switching from passthrough mode to AC3 encode mode.

6.2.2. Preset

All filter settings can be recorded as single composition of settings (preset) for fast uploading them later. There are several standard presets installed together with the filter:

  • Standard - no additional processing performed, signal is transferred without any modifications.
  • Loud - maximum loudness.

Default preset has special sense - it is uploaded during the filter uploading.

6.2.3. Gains

You should not use this option instead of player volume or system volume. An overflow and a noticeable decrease of playing quality can take place when there is a high gain. For more information, refer to 7.1. Loudness and dynamic range.

Master level defines the desirable gain level in dB. If the existing scale is insufficient, you can set the desirable gain level in the entry field without any restrictions.

Gain level reflects the current gain level. If there is no overflow then the Gain level coincides with the Master level. When an overflow takes place, the automatic gain control system decreases this level. Thus, changing of gain level can indicate too high Master level set.

Refer to Auto gain control and One-pass norm options of 6.3. Mixer settings section.

6.2.4. Dynamic range compression (DRC)

DRC Enabled checkbox switches dynamic range compression function. The first level defines compression level of the dynamic range. The second level reflects the current gain level. For more information, refer to 7.1. Loudness and dynamic range.

6.2.5. Decoder info.

It displays type and parameters of the current audio stream. Input row reflects type of input stream. Text box displays the format information. Decoders for different formats display different information, including:

  • speakers - channel configuration of the stream
  • sample rate
  • bitrate
  • stream - bit stream type (8/14/16 bit big/little endian)
  • frame size
  • nsamples - number of samples per frame

Frames/errors fields reflect the decoder statistics: number of decoded frames / number of error frames. There must be no errors in normal operating mode!

6.2.6 Processor usage (CPU).

This indicator shows the usage of CPU resources by the filter.

6.2.7 Levels.

Here the current input and output peak levels of signal are displayed. They are shown in a logarithmic form. When there is an overflow output level indicators become red. In this case, it is recommended to reduce gain (Gain level), since overflow means appearing of distortions (refer to 7.1. Loudness and dynamic range).

6.3. Mixer settings

This page contains settings for mixing control.

6.3.1. Matrix

The main element of this page is mixing matrix. It is represented in a form of matrix multiplication:

Simpler it means that there is a following rule for every cell: we mix the channel given by the column into the channel given by the row with gain given by the cell. Therefore, if we want to direct left channel into right channel we should set the value of cell on the cross of 'L' column and 'R' row as 1. Then we will hear both left and right channels from right channel.

It is necessary to clear the difference between input and output channels. Input channel is information that was encoded in original stream. Output channel is that of we will hear from speakers.

Matrix row defines the contents of output channel (speaker) given by this row. Therefore, the first row of the matrix above indicates sound, which will be heard from the left speaker:

L' = 1 * L + 0.7071 * C + 1 * SL + 1 * LFE

i.e. from the left speaker we will hear the left channel, part of the central channel, the left rear and the low frequency effects channel. Thus, mixing from 6 original channels into 2 stereo channels is performed (downmix).

One important property of matrix is that when swapping contents of two matrix rows we can 'interchange' appropriate speakers. Therefore, swapping of the first and the third rows results in 'interchange' between the left and right speakers. This property can be used to rearrange speakers without their physical switching.

Values of the matrix column show the target of the channel mixing. Therefore, the central channel is directed both to the left and right speakers with the 0.7 factor (otherwise the center outgoing from two speakers, will sound louder than if we would have the real third speaker). Thus, we will hear the central channel in the middle of the left and right speakers. Changing factor of the left or right channel you can 'move' the center. Thus, setting ' Ñ ' column value of the 'L' row as 1 will result in louder sound effect of the center from the left channel. The center will seem to be moved to the left.

In most cases there’s no need to change the matrix manually: when the AutoMatrix option is on the mixing matrix is calculated automatically based on the following parameters: speakers mode, Voice, Surround, LFE levels, Normalize matrix, Voice control, Expand stereo options. Thus, level changing for the central (voice) channel will result in changes of matrix coefficients (column 'C').

6.3.2. Gains.

Master/Gain - desirable and current gain level (refer to 6.2. Main settings ).

Voice - gain level of the center channel. If the input signal contains real central channel, then this level affects only on this channel. If there is no central channel but Voice control flag is set, this level controls the level of 'virtual' center. This level is a matrix parameter, so it makes sense only when the Auto matrix option is on. When changing level the matrix is recalculated.

Surround - rear gain level. If there are rear channels in input signal then this level affects only on them both in case of down mixing (in this case level of rear channels mixing into front channels is controlled) or without it (i.e. just control channel levels in respect of the frontal channels). If there are no rear channels in source but Expand stereo flag is set, then this level controls the level of 'virtual' rear. This level is a matrix parameter, so it makes sense only when the Auto matrix option is on. When changing level the matrix is recalculated.

LFE - LFE channel level. This level is a matrix parameter, so it makes sense only when the Auto matrix option is on. The matrix is recalculated when changing the level.

6.3.3. Options

  • Auto gain control - this option reduces the current gain level to avoid an overflow. Gain indicator displays the current gain level. When this option is off the extremely strong signal will be cut off resulting in distortions, which are unpleasant to hear. If the option is on then in case of overflow the sound becomes quieter thereby excluding overflow. System behavior after overflow is defined by One-pass norm option. If this option is off the current level will return to adjusted Master level. At that, this level specifies an average gain level. If there are many locations with overflows such behavior may result in unpleasant leaps of loudness, therefore you should not set the Master level at extremely high value. It is recommended to use this mode when overflow is not expected or when it appear relatively rarely. If One-pass norm option is on gain level is not recovered after overflow. Thus, every overflow reduces probability and 'intensity' of next overflows, therefore after some time the gain level adjusts to a certain average value and keeps around it thereafter. As a rule, level becomes stable (and the current level is displayed by the Gain level) after ~10 minutes of film playing. This mode is used to reach the highest possible loudness with minimum of possible artifacts. In this case, the Master level acts as an initial gain value for the one-pass normalization operation; therefore, usually it is adjusted at maximum. For more information, refer to 7.1. Loudness and dynamic range.
  • One-pass norm - one-pass normalization. Refer to Auto gain control.
  • Auto matrix - automatic calculation of the mixing matrix. When this option is on you cannot edit the matrix directly. LFE, Voice and Surround gain levels are parameters for matrix calculation; when changing them matrix is changed immediately. When this option is off the following control elements loose their effect: LFE, Voice and Surround gain levels, Normalize matrix, Expand stereo, Voice control options. Then ability of direct editing of the mixing matrix is available.
  • Normalize matrix - When this option is active, the mixing matrix is normalized to ensure that no overflows occur in output channels (the sum of the output channels ? 1). But at the same time, changing of LFE, Voice and Surround gain levels may lead to loudness redistribution (for instance, when increasing LFE loudness , loudness of other channels will decrease) that is not always convenient. Besides, in most cases matrix normalization strongly reduces total loudness. Switching this option off allows to avoid those effects, however possibility of overflow appears at the same time.
  • Expand stereo - takes effect only if there are no rear channels in input signal. This option allows to control gain level of the virtual environment by means of Surround level. If there are rear speakers in output speakers configuration the separated environment information is redirected that way.
  • Voice control - takes effect only if there is no central channel in input signal. This option allows to control gain level of the virtual center by means of Voice level. If there is a central channel in output speakers configuration, the separated virtual center is redirected that way. This option together with Expand stereo option allows to decompose the stereo signal to 5 speakers.
  • Bass redirection - bass redirection from all channels to subwoofer. It takes effect only if there is a subwoofer in output speakers configuration. This option is recommended for systems with separate subwoofer (refer to 7.2. What is LFE? ). You can set the cutoff frequency (Hz) in entry field.

6.3.3. Matrix

Matrix selection list allows to save and upload matrices.

6.4. Gains

This page contains gain level settings:

6.4.1. Gains

Master/Gain - desirable and current gain level (refer to 6.2. Main settings ).

6.4.2. Dynamic range compression (DRC)

DRC Enabled checkbox switches dynamic range compression function. The first level defines compression level of the dynamic range. The second level reflects current gain level. For more information, refer to 7.1. Loudness and dynamic range.

6.4.3. Delays

These are delays for each output channel. Delay values can be set in different units (Units selection list). Positive values for time units mean that positive delay will be introduced, i.e. the sound will be delayed. Conversely, positive values for distance units mean negative delay, i.e. the sound will be played 'in advance'. In both cases, there can be negative delay values. There are two main applications of delays:

  • Compensating of distance difference between speakers. Acoustic wave needs certain time to reach listener, therefore, ideally all speakers must be located at equal distance from listener, so as the sound simultaneously produced by all speakers at a certain time could reach listener simultaneously. Otherwise, sounds will be in wrong positions. However, the distance to speakers may differ in practice. The farther the speaker, the earlier sound produced by it must be played. To compensate this defect you can set distances to speakers directly.
  • Ambisonic effect creation when decomposing stereo record to multi-channel system. Introduction of additional delay to rear channels can enhance the ambisonic effect.

6.4.4. Input gains

Gain levels of input channels. Sometimes it is required to change gain levels namely of input channels. That's the case when the input signal has a central channel while the output one has no central channel (the center is mixed to front channels), and it is required to change dialog level namely. I.e. these levels are intended for record defects compensation.

6.4.5. Output gains

Gain levels of output channels. These levels serve for speakers loudness equalization (if speakers are located asymmetrically). In other words, they serve for compensation of acoustic system defects.

6.5. System settings

This page reflects system settings (which have no direct influence on sound):

6.5.1. Use AC3Filter for ...

It defines the cases of AC3Filter use.

  • PCM - use AC3Filter for post-processing of all audio streams. To use this mode the filter must have high priority (refer to Filter merit option further in this section).
  • AC3 - use AC3Filter for AC3 decoding
  • DTS - use AC3Filter for DTS decoding
  • MPEG Audio - use of AC3Filter for MPEG Audio decoding
  • PES (DVD) - use AC3Filter for audio decoding in MPEG1/MPEG2 PES streams. This mode is required for DVD/VideoCD playing.

6.5.2. SPDIF passthrough

This setting defines formats supported by external decoder and being transferred without any modifications when digital output is used. It makes sense only if the SPDIF option is on (refer to 6.2. Main settings).

6.5.3. Filter merit

The filter is installed into the system with the highest priority, i.e. it will be used instead of any other decoders installed. If it is required to use other decoders by any reasons, the priority can be decreased to minimum using the Prefer other decoder option. Filter will stay in system and will be used only if there are no other decoders installed. Prefer AC3Filter option means the high priority.

6.5.4. Default audio renderer

Audio card can operate in two modes: DirectSound and WaveOut. If there are some problems occur when using the filter (e.g. SPDIF is inoperative), sometimes they can be solved by switching of operating mode. Some players have settings to change output device (refer to 5. Player setup), others use the default system settings. To switch the operating mode use Use DirectSound option for DirectSound interface, and Use WaveOut option for WaveOut interface. This option changes the system global settings! Thus, it is preferably to use native settings of player if possible.

This option takes effect only if the player uses the default output device or if it has appropriate settings.

6.5.5. Audio/Video sync

In some cases, audio and video can be non-synchronized. Generally, this can occur due to incorrect mixing of video file, but the hardware causes of desynchronization are possible too. This option is intended to solve hardware problems. It can be used for defect correction of certain file, but do not forget to change it back! Here shift between audio and video is specified in ms. If sound is heard before video you should increase this value. If sound occurs after picture on the screen, you should decrease the value. This option helps to correct constant time shift only.

Option takes effect immediately. When changing the delay value the picture can freeze or jerk within several seconds.

7. Theory

Audio quality is a very complex concept depending on a huge number of factors and their relationship: type of acoustic system, listening conditions, quality of original record, listener himself, and many others. It is impossible to give some unique recommendations about how to reach good quality. There is no any unique formula and ideal sound: the record prepared for one listening conditions can sound absolutely unacceptable in other environment. However, we will try to present some of basic principles, which can help to understand what to do in each specific case. Therefore, it is strongly recommended to read this section for full understanding of filter abilities.

7.1. Loudness and dynamic range

7.1.1. What is loudness

This question, which seems to be clear is hard enough to formalize, since different cases imply absolutely different things.

The sense of loudness is most clear when speaking about acoustic pressure, since that is directly perceived by ear.

Acoustic pressure is an additional pressure, which occurs when the acoustic wave passes through liquid or gaseous medium. When propagating through medium acoustic wave forms some compressions and depressions, which make additional variation of pressure with respect to average value of medium pressure. Thus, the acoustic pressure is a variable part of pressure, i.e. variation of pressure with respect to average value with frequency corresponding to acoustic wave frequency. (Great Soviet Encyclopedia)

Thus, we can measure any sound - loud sounds produce high pressure, quiet ones produce low pressure. The pressure is measured in Pascals, however, in acoustics it is usually measured in decibels (dB) with respect to the hearing threshold. By definition, the value of hearing threshold is accepted to pt = 0.00002 Pa = 20 mkPa. The hearing threshold is taken as 0 dB and loudness is calculated as l = 20 * log(p / pt), where l [dB] is loudness (in terms of acoustic pressure), p [Pa] is acoustic pressure, pt [Pa] is hearing threshold. At the same time, all audible sounds have positive loudness value; all inaudible sounds (below the hearing threshold) have negative value; loudness change by 6 dB corresponds to the double change in pressure; change by 20 dB corresponds to the tenfold change in pressure. Loudness in terms of acoustic pressure we will call an absolute loudness.

Here are some typical loudness values:

Sound Loudness Pressure
Hearing threshold 0dB 20 mkPa
Rustling of leaves and gentle breeze
Wristwatch ticking
Breath
10-20dB 60 - 200 mPa
Quiet whisper
Clock ticking
20-30dB 200 - 600 mPa
Noise indoors 30-40dB 0.6 - 2 mPa
Quiet conversation 40-50dB 2 - 6 mPa
Normal conversation 50-60dB 6 - 20 mPa
Loud conversation 60-70dB 20 - 60 mPa
Noisy street 70-80dB 60 - 200 mPa
Noise of the truck engine ~80dB 200 mPa
Underground railroad noise on the run
Chisel hammer
~90dB 600 mPa
Loud disco 100-120dB 2 - 20 Pa
Plane taking-off 120dB 20 Pa
Pain threshold 130dB 60 Pa

Note the perceived pressure range: hearing threshold pressure and pressure produced by plane differ in million times! Therefore, the logarithmic scale is in a better accordance with the ear physiology - the linear change of acoustic pressure does not correspond with sense of loudness linear change. For instance, change of acoustic pressure by 50 mPa during conversation will be extremely sensible; however, it will be absolutely insensible during plane taking-off. But change of acoustic pressure by 6 dB (in twice) will be perceived as an approximately equal loudness change in both cases, though in the first case it will correspond to change of pressure by 25 mPa, and in second case - by 10 Pa.

Another type of loudness is a record loudness (signal loudness). This loudness is not an acoustic pressure (it can be voltage, magnetic polarization, etc.), however acoustic pressure is produced by acoustic system in accordance with loudness of recorded signal. Each certain signal loudness corresponds to the certain acoustic pressure. Signal loudness can also be measured in decibels. However, while the acoustic pressure is usually measured with respect to hearing threshold (minimum audible acoustic pressure), loudness of digital signal is usually measured with respect to maximum digital level accepted as 0 dB. Thus, loudness of digital signal is represented by negative values (-3 dB, -20 dB) since the record loudness must always be lower than the maximum one. The lower loudness value, the quieter the signal (-20 dB is quieter than -3 dB). Positive loudness of the digital signal means overflow and results in appearance of digital distortions. (These distortions will be considered later).

Volume controls of amplifier, in system settings, of player do not produce any acoustic pressure at all. Without signal even at maximum volume settings, we will hear nothing (with the assumption that the playing system does not produce any noise). Therefore, these settings affect loudness indirectly only and have sense of signal gain. (The gain may mean the signal decay too). From here, we will not use the term of loudness for gain levels notation except system loudness since this is established term. We will consider gain level set both in operating system settings and in player, amplifier/receiver, etc. as system loudness.

Gain level can also be measured in decibels. It is convenient since in this case the signal loudness and the gain level are simply summed. For instance, signal with loudness of 70 dB gained by 10 dB will sound with loudness of 80 dB. However though loudness and the gain are measured in same units you should distinguish them.

During playing, the recorded signal is converted into acoustic pressure using acoustic system. Let the maximum pressure produced by acoustic system is 100 dB. Then the record with loudness of 0 dB will produce pressure of 100 dB, the record with loudness of - 30 dB will produce pressure of 70 dB, etc. When changing the gain level the absolute loudness will change too. Therefore, when changing the gain level you can always set up a correspondence between the record loudness and the required absolute loudness. For instance, if the conversation level in movie is -30 dB and we want to hear these dialogues with natural loudness, the record loudness must correspond to the pressure of 50 dB. Since when the gain level is maximum (0 dB) the record level of -30 dB produces pressure of 70 dB (too high), while when the level is -20 dB the same record loudness produces the required 50 dB of pressure. Acoustic system adjusted in such way is called calibrated, i.e. the calibrated acoustic system is a system with sounds with the correct absolute loudness produced. (In fact, the calibrating process can be much more complex and can include much more parameters, however here and further we will talk about loudness calibrating only). Most of household devices have no labels of the gain level control (or there can be some abstract percents or other meaningless characters), therefore, it is difficult for user to make a precise calibration of acoustic system.

Hearing adaptation also affects loudness perception. Hearing adapts to loudness of surrounding noise and corrects loudness perception accordingly. Certainly, you could see that the man with earphones where loud music is playing begins to talk louder (with point of perception of people around him), however with his point of view it is normal speech loudness (with respect to loudness of earphones sound). Otherwise, in full silence, people begin to whisper and nevertheless it seems to be loud. In conditions of normal city noise, it is difficult to hear ticking of mechanical watches - this sound is perceived as very quiet. But the same watch ticking in full silence at night can be perceived as clear heard sound. Thus, the third type of loudness appears - subjective loudness. We will not discuss some kinds of subjective loudness measuring, just talk about qualitative comparing of sounds (louder-quieter).

Thus, sound with the same absolute loudness can be perceived as loud or quiet depending on the environment. Assume that we have adjusted the gain level so as the dialogue loudness in the movie approximately corresponds to reality. So if there is a sound of watch ticking in silence in the movie then we will not hear this sound at all when watching the movie since it is significantly quieter then the surrounding noise and our hearing is adapted to eliminate the surrounding noise. In good hearing conditions when there is no surrounding noise the same watches will be heard clearly.

In most cases, it is convenient to measure record loudness with respect to some reference level. For instance, if the record loudness is -20 dB, is it loud or quiet? And if we know that the dialogue loudness in the same record is -30 dB we can just say that it is loud enough, and if dialogue loudness is -10 dB, we can say that it is quiet enough. Dialogue level (average loudness of the conversation) is a very convenient reference level, which can be used for orienting. If sound loudness of 10 dB is higher than the dialogue level then it is loud, and if loudness of 10 dB is lower than the dialogue level then it is quiet. At the same time the dialogue level can be arbitrary and depend on the record - one record may have dialogue level of -10 dB, and the other one may have dialogue level of -30 dB. Anyway, all sounds with loudness which is lower than the dialogue level will be perceived like quiet sounds, while ones with loudness which is higher than the dialogue level will be perceived like the loud sounds, even if the system have been calibrated incorrectly, and dialogues will sound with absolute loudness of 40 dB or 60 dB. Due to the property of adaptation, hearing adapts to the current average loudness and introduces an appropriate 'correction'. However, the correct perception of loudness will be broken when loudness of surrounding noise is approaching dialogue loudness (or even exceeds it). In such a case, all sounds played will seem to be quiet.

Thus, we can introduce one more loudness scale (in addition to the digital signal loudness scale and the absolute loudness scale) - loudness with respect to the dialogue level. When comparing different loudness we will get the follows:


Absolute loudness of real sounds Loudness with respect to the dialogue level Record level-1 Record level-2 Absolute loudness during record playing 1 Absolute loudness during record playing 2
Quiet sounds
Quiet whisper 10dB -40dB -70dB -50dB 10dB 10dB
Noise norm in living quarters 20..30dB -20..-30dB -50..-60dB -30..-40dB 20..30dB 20..30dB
Quiet conversation 40dB -10dB -40dB -20dB 40dB 40dB
Dialogue level
Normal conversation 50dB 0dB -30dB -10dB 50dB 50dB
Loud sounds
Noise of a typewriter 70dB +20dB -10dB 0dB (maximum) 70dB 60dB
Noise of truck engine working 80dB +30dB 0dB (maximum) 0dB 80dB 60dB
Loud horn within at a distance of 5-7 m 100dB +50dB 0dB 0dB 80dB 60dB
Noise of a tractor working at a distance of 1 m 120dB +70dB 0dB 0dB 80dB 60dB
Pain threshold 130dB +80dB 0dB 0dB 80dB 60dB

There are also two hypothetical records made in different environments given in the table: dialogue level of record-1 is -30 dB while that of record-2 is -10 dB. It is clear that when playing on calibrated system the first record transfers loud sounds much better - it is possible to reproduce sounds with loudness up to 80 dB, while the second record reproduces sounds with loudness up to 60 db only.

Note that two mentioned records require different calibration of acoustic system. Thus, for acoustics that is able to produce 100 dB pressure the gain level required for the first record is -20 dB, and that required for the second record is -40 dB. Hence, the first record requires significantly higher gain and the first record will sound significantly quieter when the gain level settings are equal. Therefore, the second record is suitable for non-calibrated systems since it allows strong deviations of the gain level.

Thus, record-1 transfers loud sounds well but requires higher gain level; if there is an insufficient gain or some external noise this record will be perceived like extremely quiet. Record-2 does not require high gain, it is heard well even if the gain levels are low or some noise exists, but this record cannot transfer loud sounds well.

And now let's remember that audio signal is alternating:

What is required for loudness evaluation? Obviously, variation of pressure for one period of acoustic wave has no sense of sound loudness change since we cannot hear single oscillation. Therefore, we must find loudness within the certain time interval but not for one point. There are many different methods for loudness evaluation. The simplest one is to find the maximum and to calculate the signal energy. More complex methods take into account the non-uniformity of hearing of sounds of different frequency and intensity.

When finding loudness as maximum of the signal we scan the range looking for signal maximum:

The found level in dB will define loudness. To distinguish this loudness from ones obtained by other methods it is called the peak level. From here, we will use only this loudness definition. In fact, this definition does not reflect the real loudness perception well but it is convenient for the next discussion, therefore we will not consider in details any other methods of loudness evaluation even if they are more precise.

7.1.2. Loudness histogram

This picture illustrates loudness distribution of typical DVD movie (DVD FightClub, hereinafter we will consider only original audio tracks with no transformation and no mixing in order not to distort sound pattern). X axis reflects loudness in decibels, Y-axis reflects occurrence rate of sound of this loudness. It is better to say that during the movie playing the current loudness have been continuously controlled. The more often a loudness occurred during the movie playing the higher the histogram of this part. I.e. we can say that there are few explosions in the movie but at the same time, there are many different background sounds. The histogram is divided into the several conventional areas:

  1. Voice. A dialogue level for movies is one of most important parameters. It is the reference point for all other sounds: sounds with lower level are considered as quiet sounds while the sounds with higher level are considered as loud ones. Good audibility of dialogues is one of the main setting criteria (we will discuss this setting below).
  2. Background sounds - the cars going by, footfall sounds, background music
  3. Extremely quiet sounds - light breeze, grass rustling, etc.
  4. Loud sounds - phone ringing, impact sounds, and so on
  5. Explosions and other global crashes
  6. Record noise

Boundaries between areas are rather conventional and can be modified for different records. In this case, it is known that the dialogue level is -27 dB.

Looking at the histogram, we can note some interesting moments. Firstly, the most often sounds in the movie were the ones with loudness of -40 dB. Let us just note this fact for now. Secondly, the main hump of histogram is located in area from 0 dB (maximum level) to -70 dB (minimum level). It is reasonable to assume that all significant sounds are inside of this hump while the quieter sounds are just a noise. This value is called dynamic range and will be very important further.

This situation is typical enough for DVD movies. Dialogues take much time; background sounds and music take a little more time; some loudness surges occur periodically with the following decays. Generally, sound pattern of this movie is balanced enough and practically full available dynamical range is used.

Let’s look at some other movies:


(Dialogue level for all movies is -27 dB)

These examples can be divided into two large groups. First group is characterized by almost full absence of loud sounds (Cube, Dracula, About Schmidt, Savior). Loudness range from 0 dB to -15 dB (very loud sounds) is not practically used here. Otherwise, loud sounds are extremely important for the second group of movies (Interstate60, Blood (Last Vampire), Pitch Black, Matrix). It seems that the number of these sounds is relatively small and they play no special role. However, for instance, the sound of a shot has duration of a split second (the louder part) but exactly high loudness accentuates this sound. If this sound was quieter there would be an effect of ‘flatness’ and inexpressiveness of the sound. Even when occurring once during the movie loud sound has no great sense in movies of the first type ('quiet' movies); therefore its loudness is not critical so much. Otherwise, in movies of the second type those sounds (even with the small duration) form the mood of the movie.

Another conspicuous feature - high 'humps' in Cube and Interstate 60 movies. There is a constant quiet background hum in the Cube movie. This is why the graph has 'hump' at loudness of approximately -50 dB. The same thing relates to the Interstate 60 movie - it is car travel and, consequently, there is a constant sound of moving car. Shapes of both histograms are very similar; however, the maximum of the histogram for the Interstate 60 movie is lower by 10 dB. As the voice levels are the same in both movies, one can immediately conclude that sound in the Cube movie is much quieter. This conclusion is far from an obvious one. Let us recall hearing adaptation - if we shift graphs to align their maximums at one level, the Cube movie will be perceived as 'quiet' all the same.

The other movies have no such well-marked background sounds, so their graphs are more distributed across the dynamic range.

About Schmidt movie has 'hump' at the level of -70 dB - -80 dB. These are recor ding noises; and they are relatively loud in this movie, as compared with other movies, but quiet enough and do not hinder viewing.

So far, we considered solely movies. Now let us have a look at music:

Right off, it strikes the eye that distribution of loudness is absolutely different. ( To show a variety, there are histograms of various music styles having different quality collected. They do not show a real balance of different types of compositions .) All sounds are very much shifted towards the 'loud' part of the dynamic range. The difference between middle-level loudness of movies reaches 40 dB. Besides, a characteristic feature of the movies is a smooth decrease of the histogram in the loud area, whereas for music a histogram maximum often reaches the highest level (0 dB). A small number of such graphs is provided; nevertheless, such situation is very widespread.

The dialogue level concept is often inapplicable to music, thus it is very difficult to separate loud and quiet sounds. Therefore, it is also difficult to determine a subjective loudness of a composition. So, a subjectively loud composition may be objectively quieter than a subjectively loud one. Positive or negative drops of the loudness have a great importance here; these drops are not visible on the histograms.

Absence of a common reference mark results in a chaos - sounds from different sources have different loudness; difference between maximums of composition histograms reaches 20 dB. Besides, in general it is very difficult to formalize a concept of a 'composition's loudness' anyhow. Certainly, many people are familiar with a situation when there is a variety of music from different sources and during its successive playback (e.g., if music is recorded on an audio CD), loudness changes at every passing from one composition to another. It is unpleasant (the mentioned effect of a mismatch between actual loudness and subjective loudness may more embarrass a perception). The provided graphs illustrate this situation very well.

Dynamic range of musical compositions (difference between the loudest and the quietest sounds) is 20-40 dB that is far less than dynamic range (70 dB) of movies.

7.1.3. DVD loudness problem

The average loudness level of musical records (as well as of Windows system sounds) is notably higher than a loudness of DVD movies. Therefore, under the same settings of the system loudness the absolute loudness of movies will be significantly lower. In the presence of external noises loudness may turn out to be insufficient, hearing will be poor (sic!) and it will seem that the sound is of a poor quality. Gain increase in system settings and in amplifier, even to a maximum, may fail to resolve the problem: difference of average loudness reaches 40 dB; it is too high. However, even if an amplifier's power turns out to be sufficient and a DVD is played with sufficient absolute loudness it is not always suitable, as sounds of operating system, which have normal absolute loudness under normal gain settings, will be simply thundering.

This problem is typical mainly for computer playback, as in hardware players gain level is controlled by the decoder itself. Some program DVD players are able to control the system loudness, but it is not always suitable, as in this case loudness of all system sounds will be changed (you may accidentally deafen your neighbors). Besides, in any case, the program cannot control loudness at an external amplifier. So, it is just a partial solution of the problem.

A compromise solution is to process the sound immediately before playback. Such processing can greatly enhance quality of certain record playback under certain conditions. Many people may object that in doing so 'quality' will be lost; however, as it has been already mentioned, there is no absolute quality. We do not set ourselves a goal to cue the sound; our goal is to ensure a pleasant listening to it under our conditions. If an acoustic system does not have sufficient power or if you have nervous neighbors it will be just unpleasant to view a movie with lowered loudness when you cannot distinguish a half of words and a half of quiet sounds is not heard. Even self-suggestion that maximum quality is achieved will not smooth this impression. I should repeat once more the key idea: quality is estimated by our ears . The sound passes many processing stages before it reaches a listener, and a variety of acoustic devices and their characteristics is so wide that the last processing stage before playback is actually a necessity.

7.1.4. Level change. Overflow, clipping and limiting.

Level change is simply multiplication of signal amplitude by a defined value. It results in a change (increase or decrease) of the entire signal loudness.

Logarithmically, multiplication by a number is just an addition of a constant. So, if the same number is added to all levels the levels' histogram will be simply shifted:

If we multiply the signal by a too big number, it may result in an overflow. If an overflow is not tracked the amplitude will be equal to virtually random values (see the figure), and this results in frequent clicks, which are very noticeable audibly. The easiest way to struggle with this defect is clipping. That is, a signal is clipped when its amplitude goes beyond limits (see the figure). If the overflow is small, the clipping is almost imperceptible audibly; however, when the level increased clipping is heard in the sound as 'sand'.

More complicated, but more efficient way is limiting. It consists in an automatic decrease of the signal level to prevent overflow. To do this, it is necessary to use automatic gain control (AGC), which will adjust t he current signal level. Thus, when there is no overflowing AGS does not change a signal, and as soon as a signal exceeds the maximum level, adjustment is made, which changes the signal to make its level lower than the maximum one. As it can be seen on the figure, the signal completely preserves its form! However, at that loudness remains at maximum.

It is necessary to take into account that the previous picture shows amplitude and this one shows loudness (defined on the basis of a large group of amplitudes). Therefore, notwithstanding that loudness graph is 'cut off', the form of the signal itself is distorted in minimum. This way is not free from shortcomings also. Initially, loudness of the signal itself varied, and after the limitation, all sounds that go beyond the limits have the same loudness. As a result, the sound may become 'flat' and inexpressive. Let us look at histograms (Fight Club):

An example of unsuccessful gain is shown here. With gain of +10 dB, distortions are still less noticeable (there is still a relatively small number of scenes in the movie where overflow takes place). However, with an increase of gain, the level is limited more and more often, and when the level is equal to +30 dB voice begins to be limited that is highly noticeable. Shortcomings of the limitation are especially noticeable when loudness changes significantly over a short time period - subjectively, loudness begins to jump up and down. If, against a background conversation (which, at the gain of +30 dB, is already reproduced with maximum loudness), loud sound is heard (which is louder than the voice in the original, e.g. a telephone ring) the gain level is decreased to prevent distortion of loud sound, but at that the conversation loudness drops abruptly, too. When the loud sound ceases, the conversation loudness returns to the original level, abruptly as well:

Thus, relative loudness of various sounds reproduced simultaneously is preserved, but gain level is continuously varying. It is well noticeable and very unpleasant. Thus, at high gain levels (20-30 dB and more) the limitation also produces a bad result.

When gain levels are small, the signal limitation defects are practically unnoticeable, whereas the signal clipping is heard well virtually always. That is, under otherwise equal conditions, limitation gives more good result. The filter always performs either limitation (the AGC option is switched on) or clipping (the AGC option is switched off). Hereafter, unless otherwise specified explicitly, it will be assumed that the limitation is switched on. Thus, it is recommended to keep the AGC option active.

7.1.5. Normalization

Let us set a goal to raise loudness without any loss of quality at all. Is it possible? Yes, if there is a 'margin' of the dynamic range. Let us look again at movies' histograms, Cube movie in particular (the yellow graph). One can see on the histogram that loudness does not rise higher than -15 dB (it is possible that in the entire movie there are one-two such occasions; however, for the purposes of this review we will assume that there are no loud sounds at all). So, as there are no loud sounds, it is possible to raise loudness by 15 dB without loss of quality!

Except for increase of loudness, there are no other changes in the signal. At that, the histogram will shift close by the right-hand margin without changing a form:

Thus, it will turn out that there is now a sound in the movie having maximum loudness. However, if we continue to increase loudness the distortions (described above) will appear. Such signal form where there is a sound of maximum loudness without introduced distortions is called normalized and the process is called normalization. Normalization is often carried out at the musical records preparation stage, and the sound reaches us already normalized (e.g. AudioCD). However, the normalization is not carried out for movies. Why?

Let us recall histograms for music and a chaos relating to loudness. For music, it is important to playback with maximum loudness, as it is meant for maximum target audience - CD players, a noisy street, underground, cheap earphones, croaky radio stations where low sounds less than -40 dB are not hearable. (Let us have a one more look at graphs for music - minimum loudness is approximately equal to - 40 dB.)

For DVD it is not so. They are primarily designed for quality home cinema. Playback equipment should be calibrated to reproduce dialogs always with the same absolute loudness for any movie, will it be an action movie with explosions smashing a wall or a melodrama with soft rustle of grass. At that, playback system should always be ready to playback these sounds without necessity to change the gain level manually. Thus, the most important is not a necessity to make a record maximum loud, but a necessity to have fixed reference level to make it possible for a decoder to adjust gain level automatically. This reference level may be any (as a matter of fact, this is not important, i.e. if there is a reference level, corresponding adjustment may be made at any time). The standard de facto for DVD is the dialog level equal to -27 dB. Thus, even if a movie does not contain loud sounds, one should not carry out normalization at the disc creation stage and the dynamic range remains unspent.

7.1.6. One-pass normalization

So, for the purpose of loudness increasing normalization is useful. However, to carry out normalization, one needs to know maximum recording level. To identify it, one needs to review the whole record beforehand. However, it is not always possible and is inconvenient. There is a normalization method, which does not need preview of the whole record - the one-pass normalization. The method consists in a constant search for maximum when we are watching a movie. At the initial moment, the gain is at maximum. When an overflow arises, we decrease gain:

As it shown on the figure, the first loudness peak is clipped almost in the same way as in the case of usual overflow, but duration of 'clipping' is significantly shorter (compare with simple limitating) and later there will be no overflows. Thus, gain will be adjusted when every new maximum is found, and, as a result, the histogram is automatically shifted to achieve maximum loudness.

As all other, the method is not free from shortcomings. First, the method introduces distortions (though they are low observable, it is necessary to know precisely what to listen to in order to notice the one-pass normalization). Second, loudness is continuously decreasing during a movie show (though usually the main loudness decrease process ends during the first 10-15 minutes of a movie and so it is practically unnoticeable). And third, it may be that the main goal (loudness increase) will not be achieved - everything depends upon the movie itself (it relates also to the usual normalization). Here is an example of histograms obtained in the case of one-pass normalization of the Cube (where normalization is possible) and Fight Club (where normalization will not give the desirable effect) movies:

As it seen on these histograms, the result is drastically different. Activation of one-pass normalization gives a substantial positive effect for the Cube movie - sound has become noticeably louder (our assumption that sounds louder than -15 dB are absent in the movie was confirmed - compare with normalization graph). However, in the Fight Club movie no increase of loudness took place at all - thus the desired effect was not achieved, but additional distortions were introduced.

It is necessary to note once more that for one-pass normalization operation preliminary (initial) gain is needed - without level rising, the one-way normalization is senseless. As well as the normal gain, it is set by the Master level. However, if it is too strong loudness drop from the beginning of the algorithm operation will be very noticeable.

7.1.7. Dynamic range compression

Let us think: what for do we need to raise loudness? Is it needed in order to hear quiet sounds, which are inaudible in our circumstances (e.g., if it is impossible to listen loud sounds, if there are unwanted noises in the room, etc.)? Is it possible to amplify quiet sounds without processing loud sounds? It turns out that it is possible. This technique is called the Dynamic Range Compression (DRC). For this compression, it is necessary to change the current loudness constantly: to amplify quiet sounds and not to amplify loud ones. The simplest law of loudness change is linear, i.e. loudness is changed according to the law: output_loudness = k * input_loudness, where k - dynamic range compression coefficient:

When k = 1, no changes are made (output loudness is equal to input one). When k < 1, loudness will increase and the dynamic range will be contracted. Let us have a look at the graph (k=1/2) - quiet sound with loudness equal to -50 dB will be higher by 25 dB, that is significantly louder. Meanwhile dialog loudness (-27 dB) will rise only by 13.5 dB, and loudness of the loudest sounds (0 dB) will not change at all. When k > 1, loudness will decrease and the dynamic range will increase.

Let us look at loudness graphs (k = 1/2: the dynamic range will be compressed in double):

As it may be seen, there were very quiet sounds in the original that were lower than dialog level by 30 dB, as well as very loud ones higher than dialog level by 30 dB. Thus, the dynamic range is 60 dB. After compression, loud sounds are higher just by 15 dB, and quiet ones are lower than dialog level by 15 dB (the dynamic range is now equal to 30 dB). Thus, loud sounds have become significantly quieter, and quiet sounds - significantly louder. At that, there is no overflow!

Let us now consider histograms:

As it clearly seen, when gain level is up to +30 dB the histogram form is well preserved. It means that loud sounds remain well marked (they do not become maximum and are not clipped as in the process of simple gain). At that, quiet sounds are identified. The histogram shows it poorly but the difference is very noticeable by ears. The method's shortcoming: the same jumps of loudness. However, their generation mechanism differs from jumps of loudness arising during clipping, and their character is completely different: they arise generally in the case of very high gain of quiet sounds (and not during clipping of loud sounds as in the process of the normal gain). Excessive compression level results in flattening of the sound pattern: all sounds tend to be equally loud and inexpressive.

High gain of quiet sounds may result in audible recording noise. Thus, the filter uses a somewhat modified algorithm to lessen an increase of noise level:

That is, at loudness level equal to -50 dB there is a knee of the transfer function and noises will be amplified less (yellow line). When there is no such knee, noises will be significantly louder (grey line). Such simple modification significantly decreases noise level even at very high compression levels (on the figure - compression is equal to 1.5). The DRC level of the filter defines gain level for quiet sounds (-50 dB), thus the compression level 1/5 shown on the figure corresponds to +40 dB level in filter settings.

7.1.9. Mixing and loudness

There is one more detail not obvious, which may highly influence loudness during playback of multichannel records on a stereo system (or using headphones). Let us suppose that we have an initial record in the 5.1 format and it is necessary to obtain two channels at the output. Into each output channel there are mixed: one front channel, one rear channel, the LFE channel and part of the central channel:

L' = L + 0.7*C + SL + LFE
R' = R + 0.7*C + SR + LFE

Let us suppose that there is maximum loud sound simultaneously in all input channels (though it is not probable enough). Then the amplitude in the output channel will be 3.7 times more than maximum (by 11 dB), i.e. there will be the highest overflow. To prevent overflow, mixing formulas should be rewritten in the following way:

L' = (L + 0.7*C + SL + LFE) / 3.7
R' = (R + 0.7*C + SR + LFE) / 3.7

This is a normalized mixing (you should not confuse it with normalization of record itself) - this guarantees absence of an overflow. However, sound mixed in this way is quieter by 11 dB! Is it possible not to carry out normalization? Yes. However, in this case there are possible overflow and related distortions. Overflow arises only then when all input channels simultaneously reproduce a loud sound. It is not typical for movies (usually rear channels are noticeably quieter than front ones and the LFE channel is used far from always), but it is usual for multichannel musical records. That is why one may leave the non-normalized mixing for musical records, and for music, it is better to switch the normalization on. (In the filter, normalization of mixing is controlled by the Normalize matrix option).

7.1.10. Recommendations.

Setting recommendations are highly depend on specific conditions of listening and a goal set. Conditionally, all conditions may be categorized as follows:

  1. High-quality home cinema and good listening conditions - no unwanted sounds (street noise, neighbors from above, children running about and a possibility to make noise (neighbors are absent). For this category all post-processing may be switched off (preset standard).
  2. Good acoustics, but there is external noise. It is possible to use small gain and small compression level.
  3. One cannot make noise him(her)self (night, neighbors/children), ordinary acoustics (cheap headphones, weak speakers). There is necessary high compression level plus an increase of the level or the one-pass normalization.

The best results may be achieved only by combining all methods. In this section, we will consider only a case of viewing DVD movies. For other cases (music, MPEG4 movies, etc.) sound characteristics are very different. However, after familiarization with the filter's setting for DVD-movies it is not difficult to study other cases.

Gain (Master level). Jumps of loudness arising in the case of overflow are unnoticeable only then when the sound being clipped is much louder than the basic one: in the example, relating to a telephone ring loudness of the telephone sound is comparable with loudness of voice sound. However, if instead of a telephone there will be a thundering explosion, voice sound will be indistinguishable in any case. Therefore, a limitation of the loudest sounds is admissible and practically unnoticeable. As very loud one may consider sounds from +15...+20 dB as related to dialog level. Thus, when dialog level is equal to -27 dB (de facto for DVD), the admissible gain level is equal to +7...+12 dB. As one more reference, one may address gain histograms: gain up to +10 dB does not highly influence the histogram form, while at gain equal to +20 dB a very large number of sounds will be limited. Thus, one may consider gain up to +10 dB as admissible. This value may be kept practically always: in movies with quiet sound, it will notably raise loudness, and in loud ones, it will not hinder very much. (We should note once more that the reasoning is provided only for the case of watching DVD movies; for music and for most mpeg4 movies it is inapplicable, as histograms' characteristics are very different).

Gain can also be used for limiting signal loudness (e.g. for listening at night). When dialog levels are equal to -27 dB and gain is equal to +17 dB dialog levels after gain will be equal to -10 dB, and level of the loudest sound will be equal, as usual, to 0 dB that is just by +10 dB higher than dialog level. Thus, setting gain in the filter equal to +17 dB and adjusting the system loudness so that dialogs will be reproduced with acceptable absolute loudness, we will get a guarantee that the loudest sounds will not exceed dialog levels by more than 10 dB (though, certainly, in this case signal limitation will be used for loud sounds).

Dynamic range compression (DRC). It is much more difficult to determine bounds of using compression, as audibility of compression defects is highly dependable on acoustic system, listening conditions and the listener him(her)self. If histograms are used to solve this issue the upper limit of compression applicability may be estimated as +20...+30 dB (see histograms). In this case, loud sounds still remain distinguished from quiet ones by loudness. Thus, compression level is selected only audibly, until the necessary loudness is achieved and defects will remain unnoticeable.

It is necessary to take into account that compression and gain are applied simultaneously. Thus, when the gain level is +20 dB and compression is twofold (+25 dB) the actual gain level will be equal to +10 dB. It is normal, as a requirement for gain is also decreased with an increase of compression.

One-pass normalization. It is also applicable practically always. For high-quality systems with low gain level it will allow to diminish number of overflows; and in other cases, combined with compression, it will allow to achieve maximum loudness, also with minimum of distortions related to overflow. An initial gain level (Master level) for normalization is selected on the basis of goals; if a high gain is not needed the desirable gain is set. To achieve maximum loudness, one may set a level equal to +20 dB.

7.2. What is LFE?

The first that should be noted: LFE is not the same as a subwoofer! In addition, the designation '5.1' for AC3 does not coincide with '5.1' for an acoustic system. Thus, let us begin from considering this difference.

The AC3 format was created for cinema and not at all for a computer. At cinemas, quality acoustics may reproduce low frequencies quite adequately, whereas LFE was designed as a channel for powerful low-frequency effects that is effects, with which usual acoustics cannot cope any more. It is used in order to make walls to shake, people to cry from a horror and weak-nerved young ladies to faint. Thus, all channels of the ac-3 stream contain low frequencies and only sometime (in especially explosive moments) the LFE channel begins to operate, just to help the basic acoustics to shake wal ls. It may quite happen so that the LFE channel will never be switched on during the entire movie. This also will be normal.

The computer acoustics has quite another specific character. Most of contemporary 5.1 acoustics just is not able to reproduce low frequency through main speakers (satellites) and subwoofer should be entirely responsible for its reproduction. Thus, if we connect 5.1 acoustics and begin to watch movie with sound track of the 5.1 format assigning the LFE channel to the subwoofer (without redirection of low frequencies from main channels) we risk not to hear low frequencies at all!

Acoustic systems with subwoofer may be divided into several categories. First, an acoustic system may have a subwoofer, and, however, not have separate input for it. In this case, the acoustic system itself separates input signal into low-frequency signal for the subwoofer and signal for main speakers. It is necessary for such systems that all low-frequency information is already contained in the signal for main speakers, i.e. it is necessary to mix the LFE channel into all main channels. Thus, output mode should be indicated without subwoofer. (From the point of view of operating system and sound card, the acoustic system has no subwoofer, as the acoustic system has no separate input for it).

Systems where subwoofer is controlled separately may be divided into two categories.

To obtain more detailed information on purposes of channels, subwoofer and bass control, address the Dolby site: http://www.dolby.com/tech/c.in.0011.LFE.pdf (Eng).

7.3. SPDIF

SPDIF (S/P DIF, Sony/Philips Digital Interface, IEC958 "Digital audio interface" standard) is digital interface for sound transfer.

When the connection is analog, an acoustic system is connected in the following way:

Digital-to-analog converter (DAC) is situated on the sound card, and sound is transferred out of computer in the analog form. For connection, one needs three stereo wires (for 6-channel system) from sound card to amplifier and six wires from amplifier to each speaker.

When the connection is digital the following connection scheme is us ed:

Here, to connect a receiver, only one wire is needed, and sound is transferred in digital form. DAC and amplifier are situated in the receiver. As DAC is subjected to lesser influence of electronic noises outside computer, and amplifier is situated nearby, the receiver can provide better sound quality in the digital transfer mode.

Initially, the SPDIF standard enabled transmission only of uncompressed PCM audio in stereo mode. However, as the transfer is made in digital form, one may transfer data of any format, if only transmission rate is satisfactory. So, the standard for transferring of compressed multichannel audio is SPDIF-passthrough mode. (A multichannel sound cannot be transmitted in uncompressed mode, as the transmission rate is limited.)

The pass-through communication mode may be illustrated in the following way:

Thus, sound is transmitted unchanged through the entire system from DVD disc (or other sound source) to receiver; it explains the mode's name. As the data are transferred without changes, other computer sound processing options, including change of loudness, are unavailable. All settings may be made for receiver only. Besides, at the same time only one sound source may be reproduced, as mixing of compressed sound is impossible.

As sound is transmitted in compressed form, receiver should be able to decode it. There are many sound compression formats, and it is obvious that receiver cannot support all existing formats. So the following formats are supported: AC3 (DolbyDigital), DTS, MPEG Audio (including MP3). However, not all receivers support even this set of formats, usually the AC3 is supported, rarely - the DTS format and more rarely - the MPEG Audio format. The DTS format ensures higher quality of sound, but the AC4 format is more widespread, as it is the standard for DVD.

As a decoder, a digital-to-analog converter and an amplifier are in the same device, it is possible to adjust the gain level automatically. It eliminates loudness problem.

To use the pass-through transfer mode there are necessary:

However, even when all these requirements are met the pass-through transfer mode may be used far from always. E.g., the SPDIF standard allows to use just three sampling rate: 48 kHz (DVD), 44.1 kHz (AudioCD) and 32 kHz. If the source has another sampling rate, the pass-through transfer is impossible. Besides, some sound cards (e.g. Creative) support only 48 kHz frequency, i.e. the mode may be used for DVD only and cannot be used with many other sources.

Some sound tracks of the DTS format also cannot be transmitted in the pass-through transfer mode. The thing is that this format ensures very good quality of sound; however, this format has lower compression level. Some DTS records have too low compression level and SPDIF bandwidth is insufficient.

A short comparison table for analog and digital connections:

Analog connection Digital connection
(-) Many wires (+) One wire
(-) DAC is situated on sound card and is subjected to electronic noises inside computer (+) DAC is situated in an external device and can ensure better sound quality
(+) The format is practically unrestricted (-) There are many restrictions on a format of transmitted data
(+) It is possible to reproduce several sounds simultaneously (-) Only one sound source may be used
(+) Setting is more simple (-) Setting is more difficult
(+) Possibility of sound processing on computer (-) Sound processing on computer is impossible
(+) The price is lower (-) The price is higher

In some cases there is a possibility to transmit multichannel sound even when the pass-through transfer mode is unavailable (the format is not supported by receiver). To this effect, it is necessary to convert the format into a form understood by receiver:

Such model of operation enables to bypass many restrictions on the pass-through transfer mode. However, it is necessary to remember that coding to AC3 always results in quality loss (the AC3 format is a format of coding with losses, as well as e.g. the MP3 format), i.e. the main advantage of the digital connection turns out to be questionable. In these cases, it is recommended to use analog connection as it may ensure better quality.

8. Distribution

This software product is distributed under the GNU General Public License v2, the text is appended as the GNU_eng.txt file in English and the GNU_rus.txt in Russian. The variant in Russian is provided as a reference only; in case of any discrepancies, the English version prevails.

This software product may be used only for demonstration and training purposes. Any use for other purposes may be prohibited in some countries. This product may be distributed freely, except for cases prohibited by the legislation.

This software product is distributed exceptionally with a hope that it will be useful; however, without any guarantees of operation on a specific equipment, of meeting any conditions, standards or needs. I do not undertake an assurance of twenty-four-hour (or any) support, of an error correction, of recovery of data lost due to operation of the program, of liability for spoilt hardware as well as for lost working time. I am not responsible for legality of multimedia products reproduced by this program.

This software product may be distributed in form of installation package ($download_bin) in CD collections or along with other programs. No author permission is needed for it. For codec-packs, it is admissible to distribute the filter as individual module.

9. Ñâÿçü ñ àâòîðîì

All questions relating to the filter operation, errors, new ideas, proposals, etc. should be addressed to the following address : . Due to a vast number of spam, filtering is used, so the subject of the message itself should contain AC3Filter word and the message size should not exceed 20Ê. If you wish to send files send a preliminary notice.

If you have a problem, describe it in detail, up to a sequence of pressed keys. Information on the system's configuration (OS version, sound card), the player being used, filter settings (it is desirable to indicate fields) is required. Here is an example of a style that should not be used in a letter (a real message):

You have a cool prog, but I failed toanderstand anything. I have such stuff Topdevice 440D/5.1 and tsenter does not plough with ÀÑ-3 only 5.1? big requezt wha should be done for a tsentor to play (Winamp playr). One more requezt if you can this all quicker (a big desire that everything work "OK").

Such letters will be simply ignored. And now this is an example of a well-made message, which helped to solve the problem quickly:

Good day,
I have found a link to your codec at the IXBT forum, downloaded it, installed and a problem arouse.
To begin with, I inform you on my configuration.
Windows 98SE, C-Media 8738 (PCI) card, speaker mode - 5.1
I used 3 different players: WMP 6.01.05.0217, ZoomPlayer 2.5, BsPlayer 0.86.
The problem is: while playing DivX+AC-3 a sound is reproduced, but not AC-3, to be more precise, not a full-fledged AC-3. In the Output levels of codec's properties level of only three channels can be changed: two front and the central ones (see the figure). The subwoofer sound exists, but it is quiet and very "dirty", and there is no subwoofer level in Output levels. Rear speakers are silent.
Default WaveOut Device properties are:
wFormatTag: 65534
nChannels: 6
I wish to warn right away, that I downloaded and installed a patch for W98, a reference to which you gave on the project page.
I send registry branch and the Main page of codec's properties as attached files.

Before writing on an existence of a problem, make sure that the problem is really caused by the filter: uninstall the filter. If the problem remains, the filter has no connection to it - during deinstallation, the filter completely removes itself from the system leaving no traces.

I try to review all incoming problems, but simply do not have a physical capacity to deal with every problem individually. However, in any case, your messages are not lost for nothing - using accumulated statistics often leads to a successful solution of very knotty cases.

Read the documentation! Many "problems" are successfully solved by available settings.

Besides, I recommend to use the forum: you may find people with similar problems, which were successfully solved.

10. Donations

The project needs your help! If you like the program, it enables you to solve your problems, you will use it at home or in your professional activities, and if you have ideas how to improve it, you may support the further project development.

A financial support is accepted at this address:

Donate!

Any sum is important; this is not only money, but a moral support too.

You may support the project not only financially, I accept any help. In the first place, I need equipment for testing. E.g., a large number of problems relating to the SPDIF mode are caused by fact that I have absolutely no SPDIF equipment.

Besides, the project needs a site and its maintenance. In addition, first of all, a support of the forum is needed to provide a patient help to beginners as well as to identify really serious problems.

It is required to translate the documentation, as for large number of users neither Russian, nor English languages are the native ones. Unfortunately, interface translation is not planned yet, but translation of documentation is necessary.

11. Changelog.

$changelog

Copyright (c) 2002-2004 by Last updated $date