
How to Use Physical Modeling to Create Impact Sounds
How to Use Physical Modeling to Create Impact Sounds
1) Introduction: why impacts are harder than they look
Impact sounds—hits, drops, collisions, crushes—sit in an awkward place between “short transient” and “complex resonant event.” They are not just impulses with reverb; they are time-varying excitations coupled to objects whose resonances shift with contact conditions, geometry, and boundary constraints. A single hammer strike on a steel plate can show a broadband attack with measurable high-frequency content above 10–15 kHz, followed by decays that vary by mode, plus nonlinearities such as rattles, buzzes, or partial sticking.
Traditional sample-based design works well until you need parameterized impacts: the same object hit at different velocities, with different materials, sizes, or contact angles, while remaining physically coherent. Physical modeling addresses that: it generates impacts by simulating the mechanics that create them. The goal of this article is to translate the core physics into practical sound design workflows—what to model, what to simplify, what parameters matter most, and how to make the results mix-ready without losing physical plausibility.
2) Background: the physics and engineering under the hood
2.1 Contact mechanics: the impact as a force signal, not an impulse
In audio, we often approximate a hit as an impulse (Dirac delta) exciting a linear resonator. Real impacts are finite-duration force interactions governed by contact stiffness, geometry, and damping. A widely used approximation is Hertzian contact for elastic bodies. For a spherical indenter contacting a flat surface, the contact force can be modeled as:
F(t) = k · δ(t)3/2
where δ is indentation (overlap) and k depends on effective Young’s modulus and curvature. In practice, you rarely need full Hertz derivations; you need the consequences:
- Impact duration is not arbitrary. Hard materials and small contact areas produce shorter contacts (often 0.1–2 ms for small metal-on-metal taps, several ms for rubber/wood), which raises excitation bandwidth.
- Force is nonlinear with indentation; at high velocity, peak force rises faster than linearly, changing spectral balance (brighter attacks).
- Damping during contact (viscoelastic loss, plasticity, micro-slip) determines “thud” vs “ping.”
2.2 Modal behavior: objects don’t “ring,” they ring in modes
After the contact, the struck object behaves (approximately) as a linear time-invariant (LTI) system: a sum of damped sinusoids (modes). For many rigid objects, sound radiation is well approximated by modal synthesis:
x(t) = Σm Am e-t/τm sin(2π fm t + φm)
Two engineering parameters dominate perceived material and size:
- Modal frequencies (fm): geometry and stiffness-to-mass ratio. A larger object generally lowers modes; a stiffer material raises them.
- Decay times (τm): internal damping and radiation damping. Metals often show long high-Q partials; wood/plastics damp faster, especially at higher frequencies.
2.3 Waveguide and finite-difference views: when the object is extended
Modal models work well for compact objects (bells, plates, bars) when you can precompute or approximate modes. For extended structures (beams, strings, membranes), two other families are common:
- Digital waveguides represent wave propagation and reflections. Great for strings, rods, and tubes; inherently stable and efficient.
- Finite difference / finite element time domain (FDTD/FEM) discretize the PDE (e.g., plate equation) and simulate time evolution. They handle complex boundaries and nonlinear contacts but cost more CPU.
For impact sound design, you typically combine: a nonlinear contact model feeding a linear resonator (modal or waveguide). That hybrid gives the most “believable” transient while keeping computation manageable.
3) Detailed technical analysis (with data points you can use)
3.1 Set the excitation bandwidth via contact time
A simple but useful rule: the shorter the force pulse, the higher the spectral centroid of the attack. If you approximate the contact force as a half-sine pulse of duration T, its first spectral null occurs at roughly f ≈ 1/T. That gives quick calibration targets:
- T = 0.2 ms → first null ~5 kHz (significant energy well above 10 kHz). Typical “ticky” metal taps.
- T = 1 ms → first null ~1 kHz (still broadband, but less air/zing). Typical wood-on-wood hits.
- T = 5 ms → first null ~200 Hz (attack dominated by low-mid). Typical rubber mallet thuds.
In a physical model, T emerges from stiffness and effective mass. In a practical sound design tool, you can expose contact stiffness and contact damping as knobs, but validate them by measuring the resulting force-pulse duration (or the resulting audio’s high-frequency roll-off).
3.2 Damping: map physical Q to audible decay
Engineers often speak in Q-factor. For a damped mode at frequency f with exponential decay time constant τ (amplitude decays as e-t/τ), the approximate relation is:
Q ≈ π f τ
Example calibration: a steel-like partial at 2 kHz with a clearly ringing decay might have &tau = 0.6 s. Then Q ≈ π × 2000 × 0.6 ≈ 3770. That’s extremely resonant—consistent with thin metal objects or bells. A wood-like partial at 2 kHz with &tau = 0.08 s gives Q ≈ 503, much drier.
In modal synth, set &tau per mode (higher modes usually decay faster due to radiation and internal loss). A practical heuristic that matches many real objects:
- Set &tau(f) to decrease with frequency, e.g., &tau(f) = &tau0 / (1 + (f/fc)α) with &alpha between 0.5 and 1.5.
3.3 Mode spacing: what makes “metal” sound metallic
A common oversimplification is to use harmonic partials. Many impacts are inharmonic. A rectangular plate, a bar, or a bell has modal ratios that are not integer multiples. That inharmonicity creates the characteristic “clang.”
For a uniform bar with free-free boundary conditions, bending mode frequencies scale approximately with the square of mode number (fn ∝ n2) rather than linearly. That single fact explains why bar/plate strikes feel “hard” and “metallic”: high modes are further apart than harmonic series, yielding distinct spectral lines rather than a harmonic tone.
Practical implication: if your physical model sounds too tonal or “pitched,” you likely have:
- Too few modes (missing dense high-frequency resonances).
- Overly harmonic mode placement.
- Contact too soft (insufficient excitation of higher modes).
3.4 Numerical stability and sample-rate realities
Physical modeling is unforgiving about time step and aliasing. The contact nonlinearity can generate high-frequency components beyond Nyquist. At 48 kHz, Nyquist is 24 kHz; a stiff, fast contact can easily generate force components above that. If you hear “digital grit” or inconsistent brightness with velocity, aliasing is a prime suspect.
Engineering mitigations:
- Oversample the contact stage (2× to 8×). Downsample after force is applied to resonator (or after initial transient) with a proper low-pass.
- Band-limit the nonlinearity (e.g., smooth the indentation signal with a short low-pass) while preserving the intended contact duration.
- Use energy-consistent integrators for FDTD/wave models. Many production systems use carefully tuned schemes to avoid energy blow-up.
3.5 A practical signal path: “contact → resonator → radiation”
Most usable impact models can be decomposed into three blocks:
- Contact force generator (nonlinear spring + damper + collision logic): outputs force F(t).
- Structural response (modal/waveguide/plate model): converts force to velocity/displacement at a pickup point.
- Radiation / pickup filtering: maps structural motion to acoustic pressure at a mic position. Often approximated with a frequency tilt and a few resonant/notch features; more detailed approaches use boundary element methods, but that’s rare in real-time.
If you want the model to behave like a recorded sound, the third block matters more than many designers expect. Two identical plates with different mic distances can differ by 10 dB or more above 8–10 kHz due to air absorption, mic off-axis response, and radiation directivity.
4) Real-world implications and practical applications
4.1 Parameterization: designing “one object, many hits”
Physical modeling shines when you need consistent variation:
- Velocity: should change peak level, brightness, and sometimes perceived pitch (through stiffness nonlinearity or mode-dependent excitation).
- Material: map to Young’s modulus (stiffness), density (mass), and loss factor (damping). In practice, you expose these as “hardness,” “mass,” and “ring.”
- Contact geometry: small contact area (metal tip) yields shorter T and higher bandwidth; broad contact (mallet) yields a softer, longer pulse.
- Strike position: changes mode amplitudes. Hitting at a nodal line suppresses certain modes—very audible on plates/bars.
4.2 Mix integration: peak management without destroying the physics
Impacts create extreme crest factors. A physically plausible hit can have 20–30 dB peak-to-RMS ratios, especially for hard contacts. If you limit too aggressively, you flatten the transient and the modeled contact loses meaning.
Mix-friendly strategies:
- Split-band dynamics: tame 2–8 kHz “spit” separately from sub/low-mid body.
- Transient shaping after modeling: preserve the first 1–3 ms, control the next 10–50 ms.
- Use true-peak aware limiting when delivering to broadcast/streaming pipelines; transient-rich impacts can create inter-sample peaks. (This is standard practice in modern loudness workflows, e.g., ITU-R BS.1770 family for measurement, though impacts are usually handled as effects rather than full-program compliance targets.)
5) Case studies: how physical modeling shows up in professional work
5.1 Film/TV: scalable metal hits for props and UI
A production need: a family of “device taps” that match picture across multiple scenes—same prop, different intensities, close-ups vs wide shots. A typical workflow:
- Model a thin aluminum-like plate (inharmonic modes, moderate damping) with 30–80 modes up to 18 kHz.
- Use a stiff contact for fingernail/metal tool variants (contact duration 0.2–0.6 ms), and a softer contact for knuckle variants (1–3 ms).
- Randomize strike position within a constrained zone to vary mode amplitudes without changing object identity.
- Render close-mic and room-mic perspectives by changing the radiation EQ: close-mic adds 6–10 dB above ~6 kHz and less early room; wide reduces top end and increases early reflections.
Compared with pure sampling, the advantage is continuity: the “same prop” remains consistent across edits while still providing natural variation.
5.2 Games/VR: real-time impacts with performance constraints
Interactive systems need deterministic and lightweight models. A common real-time approach:
- Use a modal bank with 10–40 modes per object category.
- Precompute mode frequencies and decays; compute amplitudes at runtime from strike position and velocity.
- Use a simplified nonlinear contact with a velocity-dependent pulse width (so higher velocity becomes slightly brighter without expensive collision solving).
Concrete performance note: 30 modes at 48 kHz with efficient biquad/oscillator implementations can fit within a small CPU budget on modern hardware, especially if voices are culled by audibility.
5.3 Music production: physically modeled percussion layers that follow tempo
In electronic production, physical modeling is often used to create “designed realism”: impacts that feel acoustic but lock perfectly to a grid.
- Design a bar/plate model with decay &tau tied to tempo subdivisions (e.g., set high-mode decay to ~150–250 ms so the tail clears by the next 16th note at 120 BPM).
- Drive it with MIDI velocity and automate contact stiffness to accent sections.
- Layer with a low-frequency synthesized thump (30–80 Hz) to control sub translation while keeping modeled high-frequency detail for presence.
6) Common misconceptions (and what’s actually true)
Misconception 1: “An impact is just an impulse into a resonator.”
Impulse excitation can approximate very hard, very short contacts, but it misses the velocity-dependent pulse width and nonlinear force curve that change brightness and punch. If all velocities sound like “same timbre, different level,” you’re hearing an impulse model masquerading as physics.
Misconception 2: “More modes always equals more realism.”
Past a point, mode count without correct damping and radiation produces “synthetic shimmer.” Real objects often have frequency-dependent losses; many high modes die quickly. Better 30 modes with believable &tau(f) than 200 modes with uniform decay.
Misconception 3: “Material is just EQ.”
EQ can mimic spectral tilt, but material changes mode distribution, decay behavior, and contact dynamics. A “metal” EQ on a wood-like modal set rarely convinces because the inharmonic spacing and high-Q decays are missing.
Misconception 4: “Physical modeling is inherently more realistic than recordings.”
Physical modeling is more coherent under parameter changes. A well-recorded library remains the gold standard for absolute realism. The best professional workflows hybridize: modeled core for continuity and controllability, layered with recorded micro-details (scrapes, debris, air, room).
7) Future trends: where impact modeling is headed
7.1 Data-driven parameter estimation from recordings
Instead of hand-tuning mode frequencies and decays, emerging tools fit modal parameters to recordings: detect spectral peaks over time, estimate per-mode damping, and infer excitation characteristics. The result is “measured physical modeling”: a model that behaves like a recorded object but stays controllable.
7.2 Better contact models at real-time rates
Nonlinear contact is the realism bottleneck. Expect more robust real-time solvers and perceptually motivated approximations: stiffness curves that track measured force-time profiles, micro-bounce models for hard surfaces, and frictional micro-slip to generate plausible high-frequency noise during glancing blows.
7.3 Spatial/audio radiation modeling for immersive formats
As immersive delivery (binaural, object-based, multichannel) becomes standard, directivity and radiation will matter more. Plate-like objects radiate differently by frequency and mode shape. Even simple directivity models (frequency-dependent polar patterns) can make impacts “sit” in 3D scenes more convincingly than static convolution alone.
7.4 Hybrid rendering: modal core + procedural residual
A promising architecture is to render the deterministic modal component and add a stochastic “residual” that matches measured broadband decay and microphone artifacts. This avoids the sterile quality of purely deterministic models while remaining parameter-stable.
8) Key takeaways for practicing engineers
- Model impacts as a finite contact event, not an ideal impulse. Contact duration (0.2–5 ms range depending on materials) sets your excitation bandwidth.
- Inharmonic mode placement and frequency-dependent damping are the signatures of many impact sources. Use Q/τ relationships (Q ≈ π f τ) to tune decays with intent.
- Stability and aliasing are practical constraints. Oversample or band-limit the contact nonlinearity when stiffness is high, especially at 48 kHz.
- Strike position is not a “randomizer”; it’s a physical control that changes which modes are excited. Use it to create variation that still sounds like the same object.
- Radiation/pickup filtering is part of the model. Close vs distant impacts can differ by >10 dB in the top end; build that into your perspective controls.
- Hybrid workflows win: physical model for coherence + selective recorded layers for micro-detail and environment.
Visual guide (conceptual block diagram)
Diagram description: Imagine a three-box chain. Box 1 is “Nonlinear Contact” with inputs: relative velocity, effective mass, stiffness, damping, and contact area; output is force F(t). Box 2 is “Structure” (modal bank/waveguide/plate FDTD) converting force to velocity at a pickup point, with parameters: mode frequencies fm, decays τm, mode shapes, boundary conditions. Box 3 is “Radiation / Mic” applying frequency tilt, directivity shaping, and distance/air absorption to output acoustic pressure p(t). An optional side path adds “Residual Noise” (friction, debris, buzz) mixed post-structure.
Physical modeling doesn’t require a PhD-level simulation to be effective for impact design. It requires choosing the right physical abstractions, validating them with measurable targets (contact time, mode spacing, decay constants), and building controls that map to real-world causes. When done well, you get impact sounds that stay believable as you push them—harder, softer, bigger, smaller, closer, farther—without falling apart.









