CUDA FFM – 50-70x faster training

Last Updated on: 13th June 2024, 08:17 pm

Field-aware Factorization Machines on CUDA.

Today we’re open-sourcing CUDA FFM – our tool for very fast FFM training and inference.

You can expect 50-70x speed up in training (comparing to the CPU implementation) and 5-10x speed up in inference (compaing to non-AVX-optimized implementation).

What’s inside?

very fast FFM trainer that trains FFM model using GPU
- very fast FFM prediction C++ library (using CPU)
Java bindings to that library (via JNI)
few dataset management utils (splitting, shuffling, conversion)

Field-aware Factorization Machines (FFM) is a machine learning model described by the following equation:

Checkout out the original paper for the details or our README for a quick summary.

CUDA FFM – 50-70x faster training

Field-aware Factorization Machines on CUDA.

More in Machine Learning

Model Explainer

Large language models in recommendation systems

FastEmbedding

Popular Tags

Popular Search

Field-aware Factorization Machines on CUDA.

More in Machine Learning

Model Explainer

Large language models in recommendation systems

FastEmbedding

Latest Posts

Breaking Down the Bidding Service Monolith into Microservices

Transitioning to Capacity-Based Pricing in Google BigQuery

Near Real-Time Document Categorization with Apache Solr and RTB House Percolator Plugin

Popular Tags

Popular Search