Molina R., Loor F., Gil-Costa V., Nardini F. M., Perego R., Trani S.
Ranking FPGA Artificial Intelligence Theoretical Computer Science Hardware and Architecture Software Computer Networks and Communications
System-on-Chip (SoC) based Field Programmable Gate Arrays (FPGAs) provide a hardware acceleration technology that can be rapidly deployed and tuned, thus providing a flexible solution adaptable to specific design requirements and to changing demands. In this paper, we present three SoC architecture designs for speeding-up inference tasks based on machine learned ensembles of decision trees. We focus on QuickScorer, the state-of-the-art algorithm for the efficient traversal of tree ensembles and present the issues and the advantages related to its deployment on two SoC devices with different capacities. The results of the experiments conducted using publicly available datasets show that the solution proposed is very efficient and scalable. More importantly, it provides almost constant inference times, independently of the number of trees in the model and the number of instances to score. This allows the SoC solution deployed to be fine tuned on the basis of the accuracy and latency constraints of the application scenario considered.
Source: Journal of parallel and distributed computing (Print) 155 (2021): 38–49. doi:10.1016/j.jpdc.2021.04.008
Publisher: Academic Press,, New York, N.Y. , Stati Uniti d'America
@article{oai:it.cnr:prodotti:458030, title = {Efficient traversal of decision tree ensembles with FPGAs}, author = {Molina R. and Loor F. and Gil-Costa V. and Nardini F. M. and Perego R. and Trani S.}, publisher = {Academic Press,, New York, N.Y. , Stati Uniti d'America}, doi = {10.1016/j.jpdc.2021.04.008}, journal = {Journal of parallel and distributed computing (Print)}, volume = {155}, pages = {38–49}, year = {2021} }