An effective retail music strategy rests on five pillars: emotional targeting (especially in the first 5 minutes), sensory congruence across all channels, dynamic responsiveness to real-time conditions, brand coherence, and measurement-driven iteration. The gap between having music and having a strategy is worth 10-40% of revenue depending on category.
In this video I break down each pillar with the specific research behind it — from the PAD model’s finding that the first 5 minutes predict unplanned spending, to the 2025 EMAC study showing why generic approaches produce zero measurable results — and explain what separates pressing play from running a strategy.
Most retailers have music. Almost none have a music strategy. And the difference between those two things — according to the research — is worth somewhere between 8 and 38 percent of your daily sales. Let me break down what separates background noise from a real strategy.
The Four Pillars #
After reviewing decades of research on music and consumer behavior, I’ve identified four pillars that separate effective music strategies from random playlists. Miss any one of them and the whole thing underperforms. Pillar one is congruence. Pillar two is emotional targeting. Pillar three is contextual adaptation. And pillar four is sensory coordination. Let me walk through each.
Congruence #
Everything starts with fit. Does the music match the brand? A study in the Journal of Business Research from 2006 found that music-brand misfit didn’t just fail — it actively hurt brand perception. Customers noticed the disconnect, even subconsciously, and it lowered their evaluation of the store. And this goes beyond brand. Research in the same journal from 2005 demonstrated that a pleasant pine scent backfired when it wasn’t paired with Christmas music. Each sensory element was fine on its own — together, without alignment, they conflicted. Your music strategy starts with one question: does this sound like us? Not “do we like this?” — does this sound like our brand to our customers?
Emotional Targeting #
The PAD model from a foundational Journal of Retailing study in 1982 established that pleasure and arousal predict consumer behavior in retail environments. Pleasure is the dominant driver — it predicts approach behavior, browsing time, willingness to interact, and spending. But here’s the nuance. The right emotional target depends on what you’re selling. Research in Psychology & Marketing from 1990 found that sad-sounding music outperformed happy music for greeting card purchases. And a 2025 study in Marketing Letters showed that classical music raised willingness to pay — but only for hedonic, pleasure-oriented products. For utilitarian products, it had no effect. Your music should target the emotion that matches your product category, not just “good vibes.”
Contextual Adaptation #
This is where most strategies completely fall apart. The research is emphatic: the optimal music parameters change with conditions. The Journal of Retailing study from 2017 — 43,000 baskets — showed that fast music helps when crowded, slow music helps when sparse. Research in the Journal of Business Research from 2000 found that unfamiliar music made customers spend more time in store while perceiving they spent less — the opposite of familiar music. And a study in the Journal of the Academy of Marketing Science from 2019 showed that volume alone shifts food choices — lower volume pushed customers toward healthier options. Tempo, familiarity, volume — all conditional. A strategy that doesn’t adapt to conditions is a strategy that’s right sometimes and wrong other times. That’s not a strategy. That’s a coin flip.
Sensory Coordination #
Your music doesn’t exist in isolation. It’s one layer of a multi-sensory environment. A 2001 study in the Journal of Retailing tested 270 customers and found that when music and scent matched in arousal level, evaluations went up. Mismatch killed the effect. And research in the Journal of Retailing and Consumer Services from 2020 showed that adding a third high-arousal sensory stimulus — on top of two others — triggered sensory overload. The positive effect reversed. Effective strategy means your music is coordinated with lighting, scent, visual merchandising, and staff energy. More stimulation is not always better. The music has to fit the total sensory picture.
Putting It All Together #
Congruence, emotional targeting, contextual adaptation, sensory coordination. Four pillars. Miss one and you’re leaving significant performance on the table. This framework is the foundation of how we designed Entuned. We built a system that generates brand-congruent music, targets specific emotional profiles, adapts to real-time store conditions, and gives you the data to coordinate it with everything else in your environment. Not because we wanted to build something complicated — but because the research told us simple doesn’t work.
Chapters
What makes a retail music strategy actually effective? #
Five pillars backed by research: (1) Emotional targeting — especially in the first 5 minutes, which a 1994 study showed predicts unplanned spending for the entire visit. (2) Sensory congruence — music aligned with scent, lighting, and visual merchandising. (3) Dynamic responsiveness — adapting to real-time conditions like crowd density. (4) Brand coherence — a consistent sonic identity that never breaks. (5) Measurement and iteration — data-driven refinement over time.
Do most retailers really have zero of these? #
Yes. Most retailers press play on a playlist and never think about it again. A 2025 study across 140 stores found no overall sales effect from a generic music intervention — the effects only appeared when the analysis was segmented by customer type. Generic music, generically applied, produces generic results.
Where do I start if I have none of these today? #
Start with emotional targeting — define the feeling you want to create, especially in the first five minutes. Then make sure the music matches the rest of your sensory environment. A platform like Entuned handles all five pillars by default — try the free tier at entuned.co to see the full system in action. Full citations in the description. This is video 38 of 50 in this series.
References
- Donovan, R.J. & Rossiter, J.R. (1982). "Store Atmosphere: An Environmental Psychology Approach." Journal of Retailing, 58(1), 34-57.
- Donovan, R.J. et al. (1994). "Store Atmosphere and Purchasing Behavior." Journal of Retailing, 70(3), 283-294.
- Mattila, A.S. & Wirtz, J. (2001). "Congruency of Scent and Music as a Driver of In-Store Evaluations and Behavior." Journal of Retailing, 77(2), 273-289.
- Douce, L. & Adams, C. (2020). "Sensory Overload in a Shopping Environment." Journal of Retailing and Consumer Services, 57, 102219.
- Knoferle, K.M. et al. (2017). "An Upbeat Crowd: Fast In-Store Music Alleviates Negative Effects of High Social Density on Customers' Spending." Journal of Retailing, 93(4), 541-549.
- Milliman, R.E. (1986). "The Influence of Background Music on the Behavior of Restaurant Patrons." Journal of Consumer Research, 13(2), 286-289.
- Beverland, M. et al. (2006). "In-Store Music and Consumer-Brand Relationships." Journal of Business Research, 59(9), 982-989.
- Sherman, E. et al. (1997). "Store Environment and Consumer Purchase Behavior." Psychology & Marketing, 14(4), 361-378.
- EMAC Conference Proceedings (2025). Field experiment across 140 retail stores examining music's effect on sales performance.