Bimodal Control: Incorporating Dual Modes into SPC Strategy

Posted on 2026-02-09 04:17:45

Statistical process control lives on an old assumption: the process you monitor has one stable, well-behaved distribution when it is in control. For many modern operations, that assumption breaks. Teams mix materials from different sources, run alternating recipes, share equipment between product families, or cycle tools through warm-up and steady-state. The data then show two humps, not one. If you try to shoehorn that into a classic X-bar and R chart, you chase noise, send too many false alarms, and miss real shifts. That is where a bimodal perspective earns its keep.

I have spent a chunk of my career wrestling with multimodal data in high-mix manufacturing and labs where test stands rotated between pilot runs and production. Every time we pretended the data were unimodal, we paid for it with downtime or blind spots. Every time we recognized the dual modes and built that reality into the SPC design, alarms quieted, investigations got sharper, and improvement projects stopped burning hours on ghosts.

This article lays out how to recognize a bimodal process, what risks come from ignoring it, and how to incorporate dual operating modes deliberately into your control strategy. A bimodal chart is not magic, it is a structured way to honor context while preserving statistical rigor.

Where bimodality hides in plain sight

Bimodality often comes from structural features of the process, not random variation. Before we get to analytics, it helps to build a mental inventory of common scenarios.

A classic case arises in equipment that needs thermal stabilization. The first few units after startup come out heavier or longer because fixtures are cold, then measurements settle into a different distribution once everything warms. If you mix those subpopulations, your histogram will look like a camel, with a left hump for cold start and a right hump for steady-state.

Another pattern surfaces when switching between two validated recipes. Think of a blending line that alternates between high-viscosity and standard-viscosity products on the same filling machine. Even after changeover, residual effects push the next 20 to 50 fills toward the prior recipe’s center. The distribution becomes a blend of two means with some overlap.

Suppliers create a third pattern. You order the same spec component from two qualified vendors. One consistently ships at the low end of the tolerance, the other at https://claude.ai/public/artifacts/27fcc7f1-8f09-45da-848e-56a2af9eb518 mid-spec. On assembly torque, you now have two operating centers that reflect incoming variation. Try to set one pair of control limits, and you either get false violations on the low vendor or numbness to real shifts on the mid vendor.

Metrology can do it too. When operators use two gages interchangeably, and one gage reads 0.8 units higher on average, every interleaved run will stack into a two-peak distribution. If the gage bias lives inside the product tolerance, it may stay invisible until you aggregate months of data.

The common thread is a stable difference in the mean, sometimes in the spread as well, that ties to a known context flag: warm vs steady, supplier A vs supplier B, recipe X vs recipe Y, gage 1 vs gage 2, day shift vs night shift, or line 3 after maintenance vs normal line 3. That context is the key to handling bimodality.

Why traditional SPC stumbles

Classical Shewhart charts assume one in-control distribution for the statistic being charted. Control limits are anchored to a single center line and a single estimate of standard deviation. If the underlying process is truly bimodal, those limits become a compromise that fits neither mode.

Here is what typically goes wrong:

Inflated false alarms. Points from the lower-mean mode that land below the global lower control limit trigger signals that do not correspond to special causes, just to the routine presence of that mode. Teams chase these and find nothing to fix. Masked real shifts. If one mode drifts, the overall average may remain between the two peaks and inside the broad, compromised limits. Rule-of-seven type runs lose power when the center line already splits two populations. Distorted capability estimates. Capability indices computed from a blended distribution lie to you. Cp might look acceptable because the combined spread is wide, while Cpk collapses to the worse of the two centers. The numbers swing with the mixture proportion instead of revealing risks specific to each mode. Misguided root cause analysis. If the chart multiplies minor supplier or setup differences into constant noise, engineers grow numb to signals. When a genuine out-of-control event arrives, it looks like more noise.

When teams experience these issues, they sometimes respond by tightening rules or adding more tests. That rarely helps. The fix is to honor the dual modes structurally within your SPC.

Clarifying the term: what a bimodal chart means in practice

People use the phrase bimodal chart loosely. In practice, there are three useful interpretations, each applied in a different setting:

Stratified control charts. You create separate charts, one per mode, with mode-specific centers and limits. This is the cleanest approach when you can label each data point by mode in real time. For example, a chart for supplier A and a chart for supplier B, or a chart for cold start and one for steady-state. Conditional control charts. A single chart that dynamically selects the relevant center and limits based on the detected mode. The plot shows a continuous timeline, but the underlying limits switch when the metadata indicates a mode change. This works well when you need one operational view for the line, but you also need statistical purity. Composite charts with mixture-aware limits. A global chart that shows both overall limits and reference lines for subgroups. I use this sparingly, mostly as a communication tool to management when the team cannot avoid mixing modes in daily operations. The chart carries dual visual bands to remind viewers that two stable centers exist.

The right choice depends on whether operators can and will switch context and whether your data pipeline records the mode accurately.

First, prove you have two modes, not heavy tails

Many processes show fat tails or skew rather than two true peaks. The remedy differs. Before changing charts, test the distribution story.

Start simple with visuals. A histogram or kernel density plot across a few weeks, stratified by relevant tags, tells you far more than a single summary statistic. If you see two clear bumps that align with a context variable, you have a real lead.

A practical habit is to overlay stacked density plots by shift, supplier, recipe, gage, and warm-up state. The time you spend on these overlays will save weeks of false SPC work. If you lack clear labels, plot the variable against time of day. Thermal or setup-driven modes often align with early hours of the shift.

Quantitatively, Hartigan’s dip test and Silverman’s test can indicate multimodality, but be careful with interpretation. With small samples, you can get false negatives. With very large samples, minor cosmetic bumps turn significant. Pair the tests with stratified box plots or violin plots and see if the suspected contexts separate cleanly.

Process knowledge must arbitrate. On a filling line with known warm-up effects, you do not need a perfect multimodality test to justify mode-specific limits. Make a small designed data pull, say 30 units in warm-up, 30 in steady-state, and estimate separate means and standard deviations with confidence intervals. If the mean difference exceeds two pooled standard deviations, treat them as distinct modes for SPC purposes.

Building the data spine for dual modes

Bimodal control starts in the database, not in the chart. Without trustworthy labels that identify the mode at the time of measurement, any chart will regress to a blended mess.

You need a set of features that uniquely and repeatably identify the operating state relevant to the variable you monitor. In most implementations, this means adding a few discrete fields to each record in the data lake or historian:

Mode tag. The categorical stamp that denotes supplier, recipe, tool, gage, warm-up vs steady, or any other driver of the mean shift. It must be defined by a deterministic rule, not operator memory. Timestamped state transitions. When did the system enter or exit a mode? Can the line automatically label the next N units after startup as warm-up based on a counter? Can the MES write the supplier code into the lot header and propagate it downstream? Sample grouping. If you use X-bar charts, subgroups must not straddle modes. Enforce subgroup integrity in the sampling logic so that the statistic represents one mode at a time.

In one plant, we cut false alarms by 70 percent simply by adding a tool-warm counter that flagged the first 15 cycles after a cold start. That label fed the SPC engine, which then drew from warm-up data for warm-up limits and steady-state data for steady-state limits. Operators did not need to flip any switches. The line knew its mode because the PLC already tracked cycle counts.

Once you have stable mode identification, computing separate descriptive statistics is straightforward. Store mode-specific means, variances, and counts, refresh them on a schedule suited to the stability of that mode, and control the influence of outliers through robust estimators when needed.

Choosing the right control chart family for each mode

Shewhart charts still do heavy lifting, even in bimodal contexts, provided you use them one mode at a time. For continuous measurements with near-normal distributions within mode, an X-bar and R chart or an individuals and moving range chart works well. The difference lies in how you maintain the reference parameters.

Several practical points matter:

Use initial, clean windows for baseline limits. If you suspect both modes are already interleaved in your historical data, carve out pure samples by time or lot to build the first set of limits. Do not back-calculate mode boundaries using the noisy mix. Refresh limits with care. Modes with small sample counts need more time to stabilize. You may set a threshold, such as 100 to 200 points per mode, before recalculating. Smaller datasets increase the chance of shrinkage or expansion of limits just by chance. Consider robust estimators for standard deviation. Within a mode, if mild skew or fat tails remain, the median and median absolute deviation can produce steadier limits. Just be explicit with stakeholders about the method, so your Cp and Cpk align with the estimator. Do not forget run rules. Western Electric or Nelson rules retain value, but revisit them in light of frequency of data and the known autocorrelation within mode. Warm-up segments, for instance, might have drifting trajectories inside their mode, which changes how you interpret sequences.

For attributes data, the same logic applies. If defect rates differ by supplier or shift, a p-chart per mode produces cleaner signals than one giant p-chart that masks differences.

If the data within a mode are not normal, avoid forcing transformations on a combined dataset. It is more coherent to transform within mode, if necessary, because the transformation function often depends on the generating mechanism. For example, a log transform may suit warm-up viscosity, while steady-state behaves fine untransformed.

Operating a conditional chart in real time

A conditional chart displays a single timeline while its center and limits switch automatically as the mode tag changes. This gives operators one visual surface to watch without toggling tabs. It also prevents a jarring flood of false alarms during predictable mode changes.

Design a conditional chart with a few rules:

Transitions are first-class events. Show a vertical marker when the mode changes and annotate it with the reason, such as supplier flip or recipe changeover. Operators should not need to guess why limits moved. Handle brief excursions. If a line momentarily flips modes for a handful of points, you can end up with a noisy staircase of limit changes. Avoid this by bundling minimum dwell times, for example, the mode must hold for 10 cycles before limits switch, unless the operator manually confirms an earlier change. Lock within-segment rules. Evaluate run rules only within a continuous segment of the same mode. A seven-point run that crosses a mode change is meaningless. Preserve history. When you revise limits after a periodic update, keep the previous set available for overlays or back-plotting to assess stability. Changes to the limits should not silently rewrite the meaning of historical signals.

I have adopted a simple visual convention: pale gray bands show inactive mode limits faintly in the background for context, while active mode limits appear in strong color. That helps leaders quickly see that a point sitting near the global average may actually be deep into the active mode’s tail.

Capability, quality costs, and the economics of dual modes

When you separate modes in control, you should separate them in capability and cost analysis as well. Blended capability often encourages the worst management decisions, like forcing a single target that fits neither mode well.

A more rational approach looks like this: estimate Cp and Cpk by mode, then weight the resulting defect probabilities by the forecasted mix. If supplier A runs at Cpk 1.1 and supplier B at 1.6, the composite defect rate depends on their shares. Instead of squeezing both to a single center, negotiate different on-targets or guard bands that minimize overall scrap under realistic mix constraints.

If the modes differ mainly by center and not by spread, two-target control is a powerful lever. Give each mode its own recipe bias or setpoint offset that places its center symmetrical to the tolerance. I have seen a 40 percent reduction in overfill cost from six sigma mode-specific targets on fillers, with no capital spend, purely by telling the line to aim higher for low-viscosity product and lower for high-viscosity product while holding the same total variation.

Include logistics in the calculus. Sometimes the cheapest path to better quality is to avoid mixing modes within a batch or lot. If you can schedule supplier A parts into Monday builds and supplier B into Tuesday builds, the SPC gets cleaner, changeover overhead drops, and investigations become crisp. That is not always feasible, but when demand allows, the benefit is real.

Edge cases and failure modes worth planning for

Bimodal design does not eliminate hard problems. It changes them. Several edge cases recur:

Unknown or drifting mode boundaries. In processes without explicit tags, you may need a statistical classifier to infer mode based on the measured variable or proxies like temperature and speed. That can work but be cautious. Misclassification introduces bias. If the classifier changes over time with retraining, your control logic may become unstable. Overlapping modes. If the two distributions overlap heavily, you still gain value from stratification during analysis and capability studies, but real-time mode-specific control becomes less decisive. In such cases, you may prefer to control on a surrogate that separates modes better, such as a machine-state variable rather than the output characteristic itself. Three or more modes. Tool families or multi-supplier networks often produce more than two stable centers. The same principles hold, but you quickly tax operator cognition. If the number of modes exceeds what you can communicate simply, rethink your flow to reduce the practical modes visible to operators, even if analytics still tracks the full set behind the scenes. Mode-dependent dynamics. Warm-up is not a static mode but a glide from one center to another. A fixed center and limits for warm-up may be less accurate than a ramping target with a time constant. Consider model-based control in such cases, where the expected mean is a function of cycles since startup. Regulatory constraints. In validated environments, new charts and limits require re-approval. Build your rationale with data. Present the false alarm rate under blended control vs separated control across a defined historical window. Auditors respond to transparent, risk-based arguments.

Practical example: dual-supplier component torque

On an assembly line for small gearboxes, final torque after load run-in became the critical characteristic. The team had been chasing unstable Cpk, ranging from 0.9 to 1.4 month to month. Histograms hinted at two peaks, but the SPC software displayed a single individuals chart with wide limits and frequent two-out-of-three tests firing.

A quick stratification by supplier explained most of the variance. Supplier A consistently delivered shafts at the low end of diameter tolerance, Supplier B closer to nominal. The interference fit drove torque. With that context, two distributions appeared: Supplier A units centered around 8.4 N·m, Supplier B around 9.1 N·m, with similar standard deviations near 0.3 N·m. The specification was 8.0 to 10.0 N·m.

The SPC redesign followed the bimodal pattern:

The MES began appending supplier code to each unit’s serial record, propagating to the torque station. Two target setpoints were defined in the run-in program: 8.9 N·m for Supplier A and 9.2 N·m for Supplier B, derived to minimize tail risk with 0.3 N·m sigma. Two individuals charts ran in a single conditional display. When the station recognized a Supplier A unit, it drew limits centered at 8.9 N·m with 3-sigma bands using the Supplier A sigma estimate. For Supplier B, it switched to the other center and sigma. Limits were locked for eight weeks to accrue stable counts, then updated with a minimum of 200 points per supplier.

Within one quarter, false alarms fell by 65 percent, and scrap tied to low torque dropped by half. The blended Cpk stabilized near 1.35, with Supplier A at 1.28 and Supplier B at 1.45. The team could now see true drifts quickly, such as a gradual rise in sigma for Supplier A when a honing tool wore out. That signal would have drowned in a single blended chart.

Warm-up states: a different flavor of bimodality

Thermal or lubrication warm-up creates a mode that is both distinct and transient. Here the goal is not to control the warm-up distribution itself, but to manage the transition safely and predictably.

I prefer to characterize warm-up as a curve rather than a static cluster. On a molding press, for instance, cavity pressure peaks shift during the first 12 to 20 shots. You can estimate an exponential approach to steady center using historical data and then define a warm-up band that accommodates the expected ramp with reasonable tolerance. The control logic becomes: ignore violations within the warm-up band but alert if the ramp fails to converge by the expected cycle count, or if the band is exited on the wrong side.

Operators appreciate seeing a pale ramp overlay on the chart that shows expected convergence. The station declares steady-state automatically when residuals fall inside steady-state limits for a defined number of consecutive cycles. From that point on, traditional mode-specific rules apply.

Importantly, do not co-mingle warm-up data when updating steady-state limits. Keep a separate store of warm-up statistics for diagnostic use, such as tracking the time constant or the initial offset. Changes there often hint at upstream maintenance issues.

Communication and culture: making dual modes normal

The biggest barrier to adopting bimodal control is not math, it is habit. Many practitioners were trained on a single flavor of chart and feel uneasy when limits jump. The antidote is storytelling with data.

Start by showing a simple bimodal histogram alongside the current control chart. Walk through a few real alarms from the past month, classify them by mode, and show how many led to no-findings. Then overlay mode-specific limits on those dates and show how the false alarms evaporate. Next, point to a drift that the blended chart missed and show how mode-specific monitoring would have caught it earlier.

Document the operational rules in plain language. Spell out when limits switch, how many points are needed before rule tests evaluate, and who owns the mode definitions. Put a small footer on the chart that reminds viewers which mode they are looking at.

Finally, empower operators with quick annotations. When they perform a changeover or swap a gage, they should be able to tag the event in the chart. Those annotations later become the backbone of your improvement narratives.

Software and data considerations without vendor hype

Most commercial SPC tools can support stratified or conditional charts with some configuration, but the data model must expose the mode tags cleanly. If you have control over the data pipeline, invest early in a tidy schema:

Observations table with measurement, timestamp, unit ID, and mode fields that are not nullable. Modes table with definitions, validity windows, and owners. Limits table keyed by characteristic and mode, with versioning and effective dates.

That small discipline pays off when auditors ask who decided to change Supplier B limits on March 2, or when your CI team wants to A/B test a new target for the warm-up ramp.

If you run homegrown dashboards, implement the conditional logic server-side so that all viewers see the same mode and limits for a given timestamp. Avoid leaving mode inference to the browser. Cache mode-resolved limits to keep chart performance snappy on the floor.

When to avoid bimodal control and focus on simplification

Bimodal SPC is a pragmatic response to complexity, but sometimes the right move is to remove the bimodality itself.

If the second mode stems from gage bias, fix the gage system. If it comes from mixing suppliers with distinct process centers, negotiate tighter incoming tolerances or align targets upstream. If warm-up spans half the shift, invest in preheat or predictive controls that shorten or eliminate the transient.

A rule of thumb I share with teams: if the second mode consumes more than 20 percent of production time and requires separate corrective actions or spares, it deserves either its own line or a process change to fold it back into the primary mode. SPC should not be a bandage for structural fragmentation.

A short field checklist for adopting bimodal SPC

Verify with stratified plots that the two modes tie to concrete, labelable contexts and show distinct means. Add deterministic mode tags to each record, and ensure subgroups do not cross mode boundaries. Establish initial mode-specific baselines from clean data windows, and lock limits until minimum counts accrue. Choose stratified or conditional charting based on operator needs, and mark transitions visibly. Separate capability analysis and targets by mode, then combine defect risk using realistic mix weights.

Keep the list light, revisit it after your first cycle, and refine the rules with operator feedback.

The payoff: cleaner signals, focused action

When you treat dual modes as first-class citizens in your SPC, you buy two valuable outcomes. First, you shrink the fog of false alarms. Operators waste less time explaining normal variation. Second, you gain power to detect the real shifts that matter inside each mode. That is where quality, cost, and delivery improve.

Bimodal control is not a niche technique. It is a common-sense extension of SPC to processes that live in more than one legitimate state. If your histogram looks like a camel, stop forcing it to be a horse. Label the humps, set fair expectations for each, and let your charts tell the truth.