Our project is built around ARM Cortex-A7 SoC with no operating system. We expected to run TensorFlow lite for microcontrolles (tflm) on this hardware with models that we had previously tuned on the TensorFlow lite version (tflite). However, it turned out that some of the tflite models are not suitable for tflm. In addition, those models that we can managed to run take too long execution time, most likely due to tflm does not use the hardware capabilities of Cortex-A7 (NEON) and also does not use the NNPACK library (maybe we are wrong?).
Our idea is to run tflite on our system without an operating system (bare metal). Is that possible? And what do you recommend to us?