Skip to content

Benchmark for mobile ARM CPU/GPU #387

@sinwel

Description

@sinwel

Could you please share any existing performance data or insights regarding the following?

  1. CPU Performance (ARMv8-A):
    ◦ What is the expected decoding/encoding throughput (e.g., ms per megapixel) on typical ARM cores (e.g., Cortex-A76/A78/X1)?
    ◦ Are there specific compiler flags (e.g., NEON optimizations, -mcpu tuning) recommended for best performance?
    ◦ How does the floating-point precision (FP32 vs. FP16) impact performance vs. quality trade-offs on mobile chips?
  2. GPU Acceleration:
    ◦ Is there any planned support or existing implementation for GPU-accelerated gain map application (e.g., via OpenGL ES, Vulkan, or OpenCL)?
    ◦ Currently, does the library rely entirely on CPU for tone mapping and gain map blending?
  3. Memory & Latency:
    ◦ For a standard 12MP image, what is the typical end-to-end latency on a mid-range ARM SoC?
    ◦ Are there known memory bandwidth bottlenecks when applying gain maps on mobile devices?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions