How Wine Producers Optimized Product Segmentation with AI-Driven Data Analysis

Results + Metrics

Scoop’s AI pipeline surfaced latent structure in the dataset, turning previously opaque misclassifications into actionable insights. It identified not only overall accuracy gaps but precisely where model performance was highest and lowest. By revealing how certain chemical thresholds mapped unequivocally to class assignments, quality teams could intervene with confidence—reworking models, updating labeling standards, or tightening process controls. Standout findings included perfect classification rules for specific wine classes, uncovering compositional factors previously overlooked by manual analyses, and highlighting the segments with systemic error. For example, the combination of high proline and alcohol content always predicted Class 1 wines, enabling automated flagging of matches; conversely, the HLH chemical profile emerged as reliably classifiable, while LHH signatures were never correctly predicted under the old model. This level of granularity, impossible to achieve with static dashboards, provided the evidence needed for targeted model iteration and cross-team alignment.

46%

Overall Wine Class Prediction Accuracy

Agentic ML modeling surfaced that only 46% of wine samples were correctly classified under legacy algorithms, highlighting a substantial opportunity for improvement.

54%

Incorrect Classification Rate

Wines with Proline ≥ 760 and Alcohol ≥ 13.05% were always correctly identified as Class 1—enabling this threshold to be used as an automated check.

100%

Perfect Rule Accuracy for Proline/Alcohol (Class 1)

Wines with Proline ≥ 760 and Alcohol ≥ 13.05% were always correctly identified as Class 1—enabling this threshold to be used as an automated check.

68%

Classification Accuracy for HLH Chemical Profile

The HLH chemical profile group exhibited a markedly higher model accuracy—68%—versus profiles like LHH, which had 0%, informing targeted process improvements.

179

Total Sample Size Analyzed

Scoop's automated pipeline processed and segmented 179 lab-analyzed wine samples, spanning a broad array of chemical properties for robust pattern mining.

Industry Overview + Problem

Wine producers face complex classification challenges, balancing the subtle chemistry of fermentation and aging with ever-changing market demands for consistency and authenticity. The analyzed dataset, comprising 179 wine samples, captured a diverse array of chemical measurements—ranging from alcohol, phenols, and acids to color intensity and hue. Yet, despite this granular data, traditional modeling yielded disappointing results: 54% of wines were misclassified, undermining confidence in segmentation efforts. Fragmented insights from lab tests, lack of automated feature extraction, and insufficient transparency into misclassification drivers left quality teams second-guessing their labeling protocols and unable to systematically improve predictive accuracy using traditional BI tools. Key questions—Which chemical thresholds truly define wine classes? Where are current classification boundaries failing?—remained unanswered.

Solution: How Scoop Helped

The project began with a unified dataset drawn from laboratory analyses of wine samples, featuring 179 rows and dozens of columns capturing core chemical attributes (alcohol content, malic acid, phenols, flavanoids, color factors, and more). The central objectives were: increase predictive accuracy of wine class assignment, identify the chemical signatures underpinning each class, and reveal systematic drivers of prior misclassifications.

Scoop delivered an automated, agentic pipeline as follows:

Solution: How Scoop Helped

The project began with a unified dataset drawn from laboratory analyses of wine samples, featuring 179 rows and dozens of columns capturing core chemical attributes (alcohol content, malic acid, phenols, flavanoids, color factors, and more). The central objectives were: increase predictive accuracy of wine class assignment, identify the chemical signatures underpinning each class, and reveal systematic drivers of prior misclassifications.

Scoop delivered an automated, agentic pipeline as follows:

Automated Dataset Scanning and Metadata Inference: Scoop instantly ingested the wine chemistry dataset, mapped columns to domain-relevant metrics (e.g., proline, OD ratio, color intensity), and inferred key categorical variables, removing the need for manual data wrangling. This supported a rapid understanding of the data's full structure and potential for segmentation.
Feature Engineering and Enrichment: Agentic AI routines derived new features such as the flavanoid/non-flavanoid ratio and chemical profile groupings (e.g., HLH, HHL), automatically surfacing variables most likely to impact classification. Such enrichment revealed compositional thresholds that had previously been invisible to analysts.
KPI and Slide Generation: Scoop constructed dashboards highlighting class distributions, segmentation boundaries, and accuracy metrics (e.g., bar and pie charts for class breakdowns, KPIs for prediction success rates). This step crystallized the underperformance of historic classification, flagging the urgent need for algorithmic improvements.
Agentic ML Modeling for Rule Discovery: Rather than applying a generic black-box model, Scoop’s agentic AI built transparent rulesets revealing the exact combinations of features and thresholds associated with perfect or failed prediction. Notably, it exposed clear, high-impact splits—such as Proline ≥ 760 with Alcohol ≥ 13.05 for Class 1, or Flavanoids ≤ 1.39 plus Color Intensity ≥ 4 for Class 3—surfacing new opportunities for hand-crafted model refinement or automated downstream corrections.
Pattern and Error Analysis: The pipeline not only reported accuracy scores (46% overall), but also drilled down to analyze which chemical signatures corresponded with 100% class assignment (e.g., HLH profiles at 68% accuracy, LHH at 0%), thus showing where model confidence could be safely increased or where data warranted further review.
Narrative Synthesis for Decision Support: Through natural language narrative, Scoop provided domain-grounded, actionable explanations—distilling how core chemical measurements mapped to class boundaries, and indicating next areas for model or process changes.

Deeper Dive: Patterns Uncovered

Traditional dashboards and static BI approaches would have missed the decisive influence of specific compound thresholds and the ramifications for model architecture. Scoop’s agentic ML unraveled how certain rules delivered perfect accuracy for major wine classes—such as low flavanoids and high color intensity defining Class 3 (46 out of 46 samples correctly classified), and high proline with elevated alcohol content marking Class 1. Additionally, the analysis revealed that nearly all wines incorrectly classified shared underlying chemical profiles (notably LHH), pointing to systematic model blind spots. Phenolic richness and color categorization, for instance, were found to be perfectly predictable based solely on total phenols and color intensity, some with 100% model accuracy—insights that would require custom statistical investigation or domain experts to approximate manually. Furthermore, Scoop highlighted that the largest source of predictive error stemmed not from 'noisy' data but from missed inflection points in chemical attribute thresholds. It pinpointed that medium color wines, comprising 50% of the dataset, formed a homogeneous group amenable to single-factor classification. Such depth, nuance, and actionable directions cannot be replicated with conventional self-serve BI alone.

Outcomes & Next Steps

Guided by Scoop’s findings, the analytics team realigned their feature selection and model architecture—prioritizing OD ratio, color intensity, flavanoids, and proline as core discriminators in subsequent classification tasks. Rule-based checks for well-defined chemical thresholds are now embedded directly into QC procedures, enabling rapid, automated flagging of outlier or confidently-classifiable samples. Plans are underway to explore enrichment with additional chemical variables and testing alternative segmentation boundaries, informed by Scoop’s transparent explanations. The clarity around which rules yield perfect accuracy empowers both data science and product teams to iterate models faster and target retraining efforts purely where benefit is demonstrable.

Transform Slack Into Your Data HQ

Chat Your Way Through the Full Analytics Stack

AI That Does Data Science

ML-Powered Insights Without the PhD

Data sources

Enterprise-Grade Security, Startup-Speed Innovation

Your AI Data Scientist

Chat with your data. Discover what’s really going on

Find what’s hiding in your data—before it costs you.

If you only track KPIs, you're already behind.

Find what’s influencing your outcomes—before it’s too late.

You know who they are. You just don’t know why they matter.

How Wine Producers Optimized Product Segmentation with AI-Driven Data Analysis

Results + Metrics

Overall Wine Class Prediction Accuracy

Incorrect Classification Rate

Perfect Rule Accuracy for Proline/Alcohol (Class 1)

Classification Accuracy for HLH Chemical Profile

Total Sample Size Analyzed

Industry Overview + Problem

Solution: How Scoop Helped

Solution: How Scoop Helped

Deeper Dive: Patterns Uncovered

Outcomes & Next Steps

How Wine Producers Optimized Product Segmentation with AI-Driven Data Analysis

See Scoop in action