AI Video Editing for Mobile App

NOVASOLUTIONS.TECHNOLOGY is engaged in the development, support and maintenance of iOS, Android, PWA mobile applications. We have extensive experience and expertise in publishing mobile applications in popular markets like Google Play, App Store, Amazon, AppGallery and others.
Development and support of all types of mobile applications:
Information and entertainment mobile applications
News apps, games, reference guides, online catalogs, weather apps, fitness and health apps, travel apps, educational apps, social networks and messengers, quizzes, blogs and podcasts, forums, aggregators
E-commerce mobile applications
Online stores, B2B apps, marketplaces, online exchanges, cashback services, exchanges, dropshipping platforms, loyalty programs, food and goods delivery, payment systems.
Business process management mobile applications
CRM systems, ERP systems, project management, sales team tools, financial management, production management, logistics and delivery management, HR management, data monitoring systems
Electronic services mobile applications
Classified ads platforms, online schools, online cinemas, electronic service platforms, cashback platforms, video hosting, thematic portals, online booking and scheduling platforms, online trading platforms

These are just some of the types of mobile applications we work with, and each of them may have its own specific features and functionality, tailored to the specific needs and goals of the client.

Showing 1 of 1 servicesAll 1735 services
AI Video Editing for Mobile App
Complex
~1-2 weeks
FAQ
Our competencies:
Development stages
Latest works
  • image_mobile-applications_feedme_467_0.webp
    Development of a mobile application for FEEDME
    761
  • image_mobile-applications_xoomer_471_0.webp
    Development of a mobile application for XOOMER
    649
  • image_mobile-applications_rhl_428_0.webp
    Development of a mobile application for RHL
    1071
  • image_mobile-applications_zippy_411_0.webp
    Development of a mobile application for ZIPPY
    947
  • image_mobile-applications_affhome_429_0.webp
    Development of a mobile application for Affhome
    884
  • image_mobile-applications_flavors_409_0.webp
    Development of a mobile application for the FLAVORS company
    466

Implementing AI Video Editing in a Mobile App

AI video editing — broad term breaking into separate functions: automatic pause/filler removal, background replacement, color correction by reference, smart cropping, auto-highlight cuts. Each requires different technical solution.

Auto-cut pauses and filler words

Perhaps most demanded feature for content creators. User records conversational video, AI removes pauses >0.5 sec and filler words like "um", "like", "you know".

Pipeline:

  1. Transcribe audio via Whisper API (OpenAI) or Deepgram
  2. Get JSON with timestamps per word
  3. Find pauses and filler-list words
  4. Generate FFmpeg cut-list and assemble final video
# Backend: generate FFmpeg filter from Whisper transcript
def build_cut_filter(transcript_words, pause_threshold=0.5, filler_words=None):
    filler_words = filler_words or {"um", "like", "you know", "basically"}
    segments_to_keep = []
    prev_end = 0.0

    for i, word in enumerate(transcript_words):
        gap = word["start"] - prev_end
        if gap > pause_threshold:
            # Pause — skip
            pass
        if word["word"].lower().strip(".,!?") in filler_words:
            continue
        segments_to_keep.append((word["start"], word["end"]))
        prev_end = word["end"]

    # Convert to FFmpeg select/aselect filter
    filter_parts = "+".join(
        f"between(t,{s},{e})" for s, e in merge_segments(segments_to_keep, gap=0.05)
    )
    return f"select='{filter_parts}',setpts=N/FRAME_RATE/TB"

Whisper word_timestamps=True gives ±20 ms accuracy — sufficient for smooth cuts. On mobile: upload video, run backend task, get processed file. Play processed via AVPlayer/ExoPlayer.

Background replacement in video

Trickier than it sounds. Static camera — portrait segmentation via MediaPipe Selfie Segmentation (30 fps real-time on modern devices). Moving camera with objects — SAM 2 (Segment Anything Model 2) via API.

MediaPipe on Android:

val options = ImageSegmenterOptions.builder()
    .setBaseOptions(BaseOptions.builder().useGpu().build())
    .setOutputCategoryMask(false)
    .setOutputConfidenceMasks(true)
    .build()
val segmenter = ImageSegmenter.createFromOptions(context, options)

Result — confidence mask 0..1. Apply to each frame via Metal/Vulkan, replace background with static image or video. At 1080p 30fps — requires GPU rendering, CPU can't handle.

Server processing (SAM 2): best quality for complex scenes, but 2–5 minutes per video minute even on A100.

Smart crop (Auto Reframe)

Converting 16:9 to 9:16 with smart crop — object tracking task. Adobe Premiere calls it Auto Reframe. On mobile implement via:

  • Face detection per keyframe (every 0.5 sec) — VNDetectFaceRectanglesRequest
  • Build subject motion trajectory
  • Smooth pan with EasingCurve (no harsh jump cuts)
  • FFmpeg crop filter with dynamic params
# FFmpeg: crop with motion (x changes 0 to 540 over 10 sec)
ffmpeg -i input.mp4 \
  -vf "crop=608:1080:'min(max(cx-304,0),672)':0" \
  -c:v libx264 output_9x16.mp4

Where cx — subject x-coordinate from tracking data. Implementation: Python script on backend generates FFmpeg command with params, executes, returns result.

AI color correction

By reference photo — CinematicLUT via Core Image on iOS (100 ms per frame). By text description ("make it like golden hour morning") — call backend for LUT-generation via Stable Diffusion + ControlNet Color Pipeline.

Timeline

Pause/filler removal + playback UI — 5–7 days. Multi-function (reframe, background, color correction) with FFmpeg backend integration — 3–4 weeks. Performant real-time background removal — additional 2 weeks.