We subjected the Xiaomi 13 Ultra to our rigorous SBMARK audio test suite to measure its performance both when recording sound using its built-in microphones, and when playing audio through its speakers.
In this review, we’ll analyze how it performed in a variety of tests and several common use cases.

Overview


Key audio specs include:

  • Two speakers (top side, bottom side)
  • No audio jack output
  • Dolby Atmos technology

Reproduction

Pros

  • Good tonal balance, natural highs
  • Great dynamic performance with excellent attack
  • Without artifacts at soft and nominal volumes

Against

  • Occasionally metallic tonal balance
  • Blurry localizability

Registration

Pros

  • Excellent tonal balance in all use cases
  • Good dynamic performance

Against

  • Not great for recording loud events, like concerts
  • Limited range and locationability with front facing camera

In our SBMARK audio tests, the Xiaomi 13 Ultra performed well overall, consistent across all playback and recording secondary tests. On playback, it worked best when gaming, but the results were of an equally high standard when listening to music or watching movies. Recording results were best with the main camera and great with the memo app. The narrow width of the audio scene and fuzzy localizability meant that the audio performance of the front camera lagged slightly.

The sonority was very good and the dynamics were punchy and precise, both with the integrated speakers in playback and recording. However, our experts found the device to be unsuitable for recording at high sound pressure levels, such as concerts, and the audio zoom function was not very effective at focusing on a particular sound source.

Trial summary

Learn about SBMARK audio tests: For scoring and analysis in our smartphone audio reviews, SBMARK engineers perform a series of objective tests and undertake more than 20 hours of perceptual evaluation under controlled laboratory conditions.
(For more details on our playback protocol, click here; for more details on our recording protocol, click here.)

The following section compiles the key elements of our extensive testing and analysis performed in the SBMARK laboratories. Detailed performance evaluations in the form of reports are available upon request. Do not hesitate to contact us.

How the audio playback score is composed

SBMARK engineers test playback through smartphone speakers, the performance of which is evaluated in our labs and under real-life conditions, using apps and preset settings.

The Xiaomi 13’s tonal performance under playback was good overall, although our testers found the tonal balance slightly tinny at times. Treble delivery was nice and natural, but the midrange lacked warmth and body, and the device lacked bass overall. Dynamic performance was excellent, with great attack, precise bass and good punch. Our experts also found the spatial performance to be good, with an excellent balance between the two speakers, a decent width of the audio scene and an accurate rendering of the distance. However, the localizability of individual sound elements was a bit fuzzy and the stereo output didn’t follow the rotation of the device with the default movie app.

Volume was good at maximum volume, and our experts found the lowest volume level was loud enough to hear quiet volume passages in highly dynamic content, such as classical music. The distribution of volume steps was also quite consistent. Also, unwanted audio artifacts were pretty well under control at low and nominal volume. However, our testers did notice some distortion at maximum volume, especially at higher frequencies. When playing games, we found the speakers to become clogged quite easily. This resulted in a muffled sound or even complete muting.

Hear about the playback performance of the smartphone tested in this comparison with some of its competitors:

Recordings of smartphones playing some of our songs at 60 LAeq in an anechoic environment via 2 microphones in AB configuration, at 30 cm

Here’s how the Xiaomi 13 Ultra fares in playback use cases compared to its competitors:

Playback of use case scores

The Timbre score represents how well a phone reproduces sound across the audible tonal range and takes into account bass, mids, treble, tonal balance, and volume dependence. It is the most important attribute for reproduction.

Frequency response of music reproduction

A 1/12-octave frequency response graph, which measures the loudness of each frequency emitted by your smartphone as it reproduces a pure sine wave in an anechoic environment.

The Dynamics Score measures the accuracy of changes in the energy level of sound sources, such as how accurately a bass note or impact sound of drums is played.

Secondary attributes for spatial testing include pinpointing the location of a specific sound, its positional balance, distance, and amplitude.

The volume score represents the overall volume of a smartphone and how smoothly the volume increases and decreases based on user input.

Here are some sound pressure levels (SPL) measured while playing our sample recordings of hip-hop and classical music at maximum volume:

hip-hop Classic
Xiaomi 13 Ultra 71.4 dBA 69.4 dBA
Samsung Galaxy S23 Ultra (Snapdragon) 74.8 dBA 71.8 dBA
Apple iPhone 14 Pro Max 74 dBA 71.1 dBA

The following graph shows the gradual changes in volume from minimum to maximum. We expect these changes to be consistent across the range, so that all volume steps match user expectations:

Music volume consistency

This line graph shows the relative loudness of the playback versus the user selected volume step, measured at several volume steps with correlated pink noise in an anechoic box recorded 0.20 meter on axis.

The artifact score measures the extent to which the sound is affected by various types of distortion. The higher the score, the less noticeable sound disturbances are. Distortion can occur due to the sound processing in the device and the quality of the speakers.

Playback Total Harmonic Distortion (maximum volume)

This graph shows total harmonic distortion and noise over the audible frequency range.
It represents the distortion and noise of the device playing our test signal (0 dB Fs, Sweep Sine in an anechoic box at 40cm) at the device’s maximum volume.

How the score of the audio recording is composed

SBMARK engineers test recording by evaluating recorded files on reference audio equipment. These recordings are made in our laboratories and under real-life conditions, using apps and default settings.

On the recording, the timbre was good overall, with consistent tonal balance between use cases but a slight lack of clarity. However, when recording at high sound pressure levels, such as at rock or pop concerts, the tonal balance was focused on the upper midrange. Depending on the content, the treble could sound dull and there was a serious lack of low-end extension across all types of content. Dynamic recording performance was good overall, with a precise envelope and correct signal-to-noise ratio, although background sounds could have been attenuated a bit more.

The breadth of the recorded audio scene and the localizability of individual sound sources were great with the main camera and memo app, but more limited when recording with the selfie camera in portrait orientation. Distance rendering was good, with accurate perception of voices in the audio scene. The recordings sounded quite loud and were generally artifact-free. However, at high sound pressure levels, our testers noted heavy compression. The background was clean in all use cases, with a natural tonal balance.

Here’s how the Xiaomi 13 Ultra fares in recording use cases compared to its competitors:

Use case scoring

The Timbre Score represents how well a phone captures sounds across the audible tonal range and takes into account bass, mids, treble, and tonal balance. It is the most important attribute for registration.

Video frequency response of life

A 1/12-octave frequency response graph, which measures the loudness of each frequency captured by your smartphone while recording a pure sine wave in an anechoic environment.

The Dynamics Score measures the accuracy of changes in the energy level of sound sources, such as how accurately plosives in a voice (p, t, k, for example) are reproduced. The score also considers the signal-to-noise ratio (SNR), such as how loud the lead voice is compared to the background noise.

Secondary attributes for spatial testing include locating a specific sound’s location, positional balance, distance, and amplitude on recorded audio files.

Directness of registration

Smartphone directivity graph while recording test signals using the camera app, with the main camera. It represents the acoustic energy (in dB) on the angle of incidence of the sound source. (Normalized to 0° angle, in front of the device.)

The loudness score represents how loud audio is normalized on recorded files and how well the device handles loud environments, such as electronic concerts, while recording.

Here are the sound levels recorded in the audio and video files, measured in LUFS (Loudness Unit Full Scale); for reference, we expect volume levels to be above -24 LUFS for recorded content:

Encounter Videos life Selfie videos Memorandum
Xiaomi 13 Ultra -24.2 LUFS -20.6 LUFS -18.9 LUFS -20.4 LUFS
Samsung Galaxy S23 Ultra (Snapdragon) -26.5 LUFS -21.8 LUFS -22.4 LUFS -21.6 LUFS
Apple iPhone 14 Pro Max -26 LUFS -22.9 LUFS -19.2 LUFS -19.6 LUFS

The Artifacts score measures the extent to which recorded sounds are affected by various types of distortions. The higher the score, the less noticeable sound disturbances are. Distortions can occur due to in-device sound processing and microphone quality, as well as user handling, such as how the phone is held.

In this audio comparison, you can hear how this smartphone handles wind noise compared to its competitors:

matrix(3) {
[“Xiaomi 13 Ultra”]=> string(66) “resources/Xiaomi/13UltraV2.1/Xiaomi13Ultra_MicrophoneArtifacts.m4a”
[“Samsung Galaxy S23 Ultra”]=> string(74) “resources/Xiaomi/13UltraV2.1/SamsungGalaxyS23Ultra_MicrophoneArtifacts.m4a”
[“Apple iPhone 14 Pro Max”]=> string(72) “resources/Xiaomi/13UltraV2.1/AppleiPhone14ProMax_MicrophoneArtifacts.m4a” }

Recordings of a voice sample with slight background noise, facing a 5 m/s turbulent wind

Background evaluates how smoothly various sounds around a voice blend into the video recording file. For example, when recording a speech at an event, the background shouldn’t interfere with the main vocal, but should provide context of your surroundings.

Let's talk about "Xiaomi 13 Ultra Audio test" with our community!
Start a new Thread

Philip Owell

Professional blogger, here to bring you new and interesting content every time you visit our blog.