Deep neural networks achieve outstanding results in challenging image
classification tasks. However, the design of network topologies is a complex
task and the research community makes a constant effort in discovering
top-accuracy topologies, either manually or employing expensive architecture
searches. In this work, we propose a unique narrow-space architecture search
that focuses on delivering low-cost and fast executing networks that respect
strict memory and time requirements typical of Internet-of-Things (IoT)
near-sensor computing platforms. Our approach provides solutions with
classification latencies below 10ms running on a 35devicewith1GBRAMand5.6GFLOPSpeakperformance.Thenarrow−spacesearchoffloating−pointmodelsimprovestheaccuracyonCIFAR10ofanestablishedIoTmodelfrom70.6474.87to82.0783.45thebestofourknowledge,wearethefirstthatempiricallydemonstrateonover3000trainedmodelsthatrunningwithreducedprecisionpushestheParetooptimalfrontbyawidemargin.Underagivenmemoryconstraint,accuracyisimprovedbyover7thebestmodelindividualformat.