Almost every PV inspection vendor's pitch deck mentions "AI defect detection." The reality on the shop floor is less romantic: great demos, very few stably-running production AI systems. The bottleneck is not the algorithm — ResNet, EfficientNet, Transformer architectures all handle image classification just fine. The bottleneck is the engineering path from algorithm to line.
This article distills MVCreate's experience building production AI into SC-EPL, SC-PLEL-PS, SC-MC-W and SC-Seed, and lays out the full path.
Versus generic vision tasks (cat/dog classification, face recognition), PV defect detection is structurally different:
Common PV defects run to 30+ types: hidden crack, broken finger, dark corner, dark spot, dark edge, concentric ring, snake line, cold solder, over-etch, mismatch, PID, hotspot precursor, moon-crescent mark, dislocation line, fiber anomaly, and more. All need to be recognized.
But distribution is brutally imbalanced — tens of thousands of hidden-crack examples per year versus hundreds of snake-line examples. Vanilla classification loss severely overfits to majority classes under this long tail.
What counts as a "significant hidden crack"? What is a "mild crack, pass"? Even experienced inspectors agree only ~80% of the time on the same image. This is a natural source of label noise.
An AI grader that misses a severely cracked module and lets it ship to the downstream customer incurs rework costs on the order of 100× the inspection cost. This zero-tolerance regime puts extreme pressure on recall — especially recall on severe defects.
The most expensive, most tedious stage. It also sets the ceiling for everything that follows.
Diversity matters. The training set must span:
MVCreate has accumulated 2.5M+ labeled samples across 30+ defect classes over the past five years.
Double-blind labeling with arbitration. Each image is labeled by two independent inspectors; disagreements are resolved by a third-party arbitrator. Final label consistency holds above 95%.
Active learning to cut cost. Not all samples need human labeling. The current model predicts on new data; only the lowest-confidence samples go to human review. Active learning cuts labeling cost to roughly one-tenth of the naive approach.
For fine-grained PV vision, the standard structure is two-stage:
MVCreate's production model uses a customized two-stream architecture:
This design measurably improves recognition of "defects only visible against context" — mild dark spots, subtle edge degradation.
Training strategies:
Line cycle time is a hard constraint. SC-EPL runs at 0.5–2 s per cell; AI inference must fit inside that envelope.
Acceleration stack:
SC-EPL's AI inference latency stabilizes at 200–400 ms, comfortably under the 2-second cycle ceiling.
Two deployment patterns:
Pattern A — local inference. Each station has a GPU; model runs locally. Low latency, data never leaves the facility. Higher hardware cost and harder multi-station updates.
Pattern B — edge server, centralized inference. One GPU server per line, all stations stream images to it. Higher hardware utilization, one-shot updates. Network latency, single point of failure.
MVCreate recommends Pattern B for large lines (>10 stations) and Pattern A for R&D and small lines.
AI models are not trained once and forgotten. Two drift modes hit production:
Monitoring:
The AI system has to keep seeing and learning from new data. MVCreate's loop:
After many years of this work, three lessons stand out:
Effort spent on labeling and data hygiene caps everything else. Months cleaning labels often beats months tuning architecture.
Over-kill can be mopped up by downstream human review. A miss cannot. Every optimization decision must respect severe-defect recall ≥99.5%.
An AI system on a production line cannot be a black box. MVCreate's AI outputs both a verdict and a saliency heatmap — which region, why. Letting inspectors "see what the model is thinking" earns more trust than another 0.5% accuracy would.
Misconception 1 — AI replaces human inspectors. It doesn't. AI handles 90–95% of routine samples; the remaining 5–10% boundary cases still need humans.
Misconception 2 — bigger models are better. On a production line, inference speed is a hard constraint. Real deployed models are typically 5–10× smaller than SOTA.
Misconception 3 — train once, run forever. False. Retrain every 3–6 months or drift silently degrades the model.
Getting AI defect detection from a slide bullet to a stably-running production system means five engineering stages done right: data, algorithms, deployment, monitoring, iteration. MVCreate's AI product line isn't built on a one-shot "big model" — it's built on a full engineering stack spanning collection, labeling, training, deployment, monitoring, and iteration.
To discuss AI-based PV inspection or arrange a demo, reach the MVCreate technical team (+86 159-5048-9233).
Website: www.mvcreate.com
Originally published by Vision Potential / MVCreate.
contact
Be the first to know about our new product launches, latest blog posts and more.
Nanjing Vision Potential Intelligent Technology Co.,Ltd.Established based on the Nanjing Xiangning Artificial Intelligence Research Institute, we have brought together a number of outstanding industry... Any question or request?
Click below, we’ll be happy to assist. contact