Section 125 plans deliver real payroll tax savings — but only when they pass the IRS's annual nondiscrimination tests. Most failures come down to the same handful of avoidable mistakes: stale census data, missed deadlines, or misclassified employees. This guide walks through what each test actually measures, who's responsible for running it, and what to do if your plan doesn't pass.
Why Nondiscrimination Testing Exists
The pre-tax treatment of Section 125 contributions is a benefit Congress wrote into the tax code with a clear condition attached: it has to be available broadly, not just to the people running the company. Nondiscrimination testing is the mechanism the IRS uses to confirm that condition is being met every plan year. If the rank-and-file workforce is meaningfully participating in the plan, the tax advantages stand. If the plan is functionally a tax shelter for a small group at the top, the IRS reclassifies their pre-tax contributions as taxable income and the savings disappear.
This isn't a one-time setup check. Cafeteria plans must be tested annually, and the burden falls on the plan sponsor — meaning the employer — not the carrier or the broker. Even plans that have passed comfortably for years can fail after a round of layoffs, an acquisition, a wage adjustment, or a shift in who's enrolling. Treating testing as a yearly housekeeping task rather than a setup formality is what keeps the savings intact.
The Three Tests Every Cafeteria Plan Must Pass
Section 125(b) of the Internal Revenue Code prescribes three distinct tests, and a plan has to clear all three to be considered nondiscriminatory. Each test measures a different angle of the same question: are the tax benefits flowing disproportionately to the wrong people?
The eligibility test looks at who is allowed to participate in the plan in the first place. The contributions and benefits test looks at how the plan's value is actually distributed among those who do participate. The key employee concentration test caps the share of pre-tax benefits flowing to a narrowly defined group of officers and owners. Failing any one of them triggers the same consequence — taxable reclassification of benefits for the favored group — so it pays to understand each one before plan year-end, not after.
Test 1: The Eligibility Test
The eligibility test asks a simple question: does the plan let regular employees in on similar terms as highly compensated employees? In practice, that means the plan can't impose age, service, or classification requirements that conveniently exclude lower-paid workers while letting executives qualify on day one. A plan that requires three years of service to enroll, for example, is going to draw IRS scrutiny if the executive team happens to be the only group that meets that bar.
To pass, employers generally need to show that a sufficient cross-section of non-highly-compensated employees (NHCEs) is eligible to participate, and that participation requirements aren't tilted in favor of the highly compensated group. The cleanest way to clear this test is to use the same eligibility rules — typically a short waiting period, applied uniformly — for everyone. Carve-outs and special tracks for executives are where most eligibility failures begin.
Test 2: The Contributions and Benefits Test
The second test moves past who can join and looks at what they actually receive. Even a plan with universal eligibility can fail here if the benefit structure is engineered in a way that pushes most of the tax savings to the top. The IRS evaluates whether contributions and benefits are available on a nondiscriminatory basis — meaning the same menu, the same employer contributions, and the same opportunity to elect pre-tax treatment, regardless of compensation level.
This is where employers most often run into trouble without realizing it. A higher employer match for executives, a richer FSA option for a specific job grade, or a tiered employer contribution that scales with salary can all flag the plan as discriminatory in operation, even if it looks fine on paper. The test isn't only about how the plan is written — it's about how the benefits actually land across the workforce once enrollment closes.
Test 3: The Key Employee Concentration Test
The concentration test is the one most plans actually fail. The rule is straightforward: no more than 25 percent of the total nontaxable benefits provided under the cafeteria plan can go to "key employees." A key employee is defined under Internal Revenue Code Section 416(i) and generally includes officers above a compensation threshold, more-than-5 percent owners, and more-than-1 percent owners earning above a separate threshold. The IRS adjusts the dollar thresholds periodically.
The reason this test trips up so many plans is that the math is driven by participation behavior, not plan design. If rank-and-file workers decline to enroll because they don't understand the benefit, can't afford the elective contribution, or weren't told the plan existed, the share of total pre-tax benefits flowing to keys can quietly drift past 25 percent. The fix is rarely to limit what executives can elect — it's to drive broader participation downstream so the denominator grows.
What Happens If a Plan Fails
The penalty for a failed nondiscrimination test isn't a fine, an audit, or plan disqualification. It's narrower and more targeted: the IRS reclassifies the pre-tax benefits provided to highly compensated or key employees as taxable income for the year in which the test failed. Those individuals end up owing income tax on amounts they thought were sheltered, the employer typically owes back FICA on those wages, and W-2s have to be corrected. The plan itself continues operating for everyone else.
The damage compounds when failures aren't caught quickly. A failure identified during the plan year can sometimes be cured through corrective action — refunding contributions, adjusting elections, or capping participation by keys before year-end. A failure caught only at the next year's testing cycle generally means amended W-2s, retroactive tax obligations, and a credibility problem with the executives whose tax bill just changed retroactively. Mid-year monitoring is what separates a clean fix from an expensive one.
How to Make Testing a Non-Event
The employers who never worry about nondiscrimination testing aren't lucky — they've built three habits into how they administer their plan. First, they keep the plan design uniform: the same eligibility rules, the same contribution structures, and the same benefit menu across all employee classifications. Special carve-outs for executives are the single biggest source of avoidable failures. Second, they run testing twice — once in mid-year as a projection using current participation data, and once formally at year-end. The mid-year run gives them time to drive enrollment or adjust elections if a test is trending toward failure.
Third, they treat broad participation as a compliance asset, not just a feel-good metric. Communication, education, and easy enrollment processes aren't soft HR work in the context of Section 125 — they're what keeps the concentration test in safe territory. A plan where 90 percent of eligible employees participate has very different testing dynamics than one where only the top quartile bothers to enroll. Benefits TaxShield's preventive care plan is structured so that participation is the default rather than an opt-in obstacle course, which keeps the denominator healthy and the testing math straightforward.
Want testing handled, not handed back to you?
Benefits TaxShield administers Section 125 plans with built-in nondiscrimination testing, automated eligibility monitoring every payroll cycle, and a participation model designed to keep concentration math in safe territory. See your potential savings, or talk through your testing strategy with our team.