Mobile App Object Detection Implementation

NOVASOLUTIONS.TECHNOLOGY is engaged in the development, support and maintenance of iOS, Android, PWA mobile applications. We have extensive experience and expertise in publishing mobile applications in popular markets like Google Play, App Store, Amazon, AppGallery and others.

8+Years of workmore info 900+Completed projectsmore info 100+In house employeesmore info 19+Partnersmore info

Development and support of all types of mobile applications:

Information and entertainment mobile applications

News apps, games, reference guides, online catalogs, weather apps, fitness and health apps, travel apps, educational apps, social networks and messengers, quizzes, blogs and podcasts, forums, aggregators

E-commerce mobile applications

Online stores, B2B apps, marketplaces, online exchanges, cashback services, exchanges, dropshipping platforms, loyalty programs, food and goods delivery, payment systems.

Business process management mobile applications

CRM systems, ERP systems, project management, sales team tools, financial management, production management, logistics and delivery management, HR management, data monitoring systems

Electronic services mobile applications

Classified ads platforms, online schools, online cinemas, electronic service platforms, cashback platforms, video hosting, thematic portals, online booking and scheduling platforms, online trading platforms

These are just some of the types of mobile applications we work with, and each of them may have its own specific features and functionality, tailored to the specific needs and goals of the client.

Offered services

Showing 1 of 1 servicesAll 1735 services

Mobile App Object Detection Implementation

Medium

~1-2 weeks

FAQ

Our competencies:

Free consultation

Book a free consultation if you have any questions. A dedicated specialist will advise you.

Cost calculation

If you know what exactly you need to develop, or you already have a ready-made technical task.

Development stages

Latest works

Development of a mobile application for FEEDME
761
Development of a mobile application for XOOMER
649
Development of a mobile application for RHL
1071
Development of a mobile application for ZIPPY
947
Development of a mobile application for Affhome
884
Development of a mobile application for the FLAVORS company
466

Show more works

Object Detection Implementation in Mobile Applications

Object detection on mobile isn't just "find and box." It's also frame-to-frame tracking, correct bounding box projection onto preview layer, handling overlapping detections, and managing performance at 30 FPS video. The last point often becomes the bottleneck.

Model Selection: YOLO vs SSD vs NanoDet

Three main families work on mobile:

MobileNet SSD — classic, excellent TFLite Task Library and ML Kit support. On Pixel 7: 18–25 ms on 320×320. COCO mAP: ~23–27.
YOLOv8n/YOLOv5n — best accuracy/speed balance in 2024. After TFLite or Core ML conversion: 22–40 ms depending on input size. COCO mAP: 37+.
NanoDet — for truly weak devices, <10 ms on Snapdragon 665.

For real-time video on modern Android flagships, use YOLOv8n with GPU delegate. For offline photos across a wide device range, use MobileNet SSD v2.

Bounding Box: Projection to Camera

Most common visual error: bounding box doesn't align with the object on preview. Reason: the model receives resized input (e.g., 320×320), while camera preview is 1920×1080 with AspectFill or AspectFit. Recalculate coordinates accounting for scale and offsets.

On iOS with AVCaptureVideoPreviewLayer:

let converted = previewLayer.layerRectConverted(fromMetadataOutputRect: normalizedRect)

VNDetectedObjectObservation returns boundingBox in normalized coordinates (0..1, y from bottom). Before projecting to UIKit coordinates, invert the Y-axis: CGRect(x: box.minX, y: 1 - box.maxY, width: box.width, height: box.height).

On Android with CameraX + ImageAnalysis: detection results are in input image coordinates, preview in PreviewView coordinates. Use MappingUtils from ML Kit or compute transformation manually via matrix.

Frame-to-Frame Tracking

Detecting every frame is expensive. Correct approach: detect every N frames (typically every 5–10), between frames use SORT or ByteTrack, or iOS built-in VNDetectRectanglesRequest with ObjectTrackerObservation.

ML Kit Object Detection & Tracking supports tracking out-of-the-box via .enableMultipleObjects() and .enableClassification(). Each tracked object gets a stable trackingID—allowing display of object info without flickering when lost/reappearing.

NMS (Non-Maximum Suppression) is important. Default iouThreshold = 0.5. If objects overlap in frame (e.g., packed goods on conveyor), lower to 0.3–0.35. Otherwise, the detector "glues" adjacent objects into one.

Real Case Study

Queue people-counting app via static camera (tablet on stand). YOLOv8n, TFLite, GPU delegate on Android 11+. Problem: dense queues (>8 people) had detector missing center people—overlap >60%. Solution: lowered nmsThreshold to 0.3, added minDetectionConfidence = 0.4 (vs 0.5). False misses dropped from 31% to 9%. Additionally: fine-tuned model on overlapping frames via Roboflow dataset.

Timeline

Integrating detection model with preview projection and NMS tuning: 1–2 weeks. Fine-tuning on custom classes + integration: 2–3 weeks. Cost calculated individually.