Google Launches AI Powered Malicious File Detection Tool Magika


Google has introduced Magika, an AI-powered file-type identification system that assists in detecting binary and textual file formats.

Even when utilizing a CPU, Magika’s unique, highly-tuned deep-learning model allows for accurate file identification in a matter of milliseconds.

Internally, Magika is widely utilized to route Gmail, Drive, and Safe Browsing files to the appropriate security and content policy scanners, hence enhancing the safety of Google users.

“Magika uses a custom, highly optimized deep-learning model designed and trained using Keras that only weighs about 1MB. At inference time Magika uses Open Neural Network Exchange (Onnx) as an inference engine to ensure files are identified in a matter of milliseconds, almost as fast as a non-AI tool even on CPU”, Google said.

Magika’s Benefits

When tested on a 1M file benchmark that includes over 100 file kinds, Magika’s AI model and big training dataset allow it to outperform rival solutions by roughly 20% in terms of performance.

Google Launches AI Powered Malicious File Detection Tool Magika
Magika detection quality compared to other tools on 1M files benchmark

When we break down the performance benefits by file type, we can observe even more gains on textual files, such as code files and configuration files, which can be problematic for other tools.

Google Launches AI Powered Malicious File Detection Tool Magika
Performance of a variety of file-type identification tools

“In particular, this increase in accuracy allows us to scan 11% more files with our specialized malicious AI document scanners and reduce the number of unidentified files to 3%”, Google said.

Google also mentions that compared to the prior system, which depended on handcrafted rules, Magika improves file type detection accuracy by 50%, as demonstrated by a weekly average of hundreds of billions of files.

How To Install?

The Magika command line tool is used to determine the type of a variety of files. 

Google Launches AI Powered Malicious File Detection Tool Magika
Magika command line tool used to identify the type of a diverse set of files

You can check out the Magika web demo or install it as a Python library and standalone command line tool with the standard command pip install Magika

Magika code and model are freely available starting today in Github under the Apache2 License. They also have an experimental npm package if you want to utilize the TFJS version.

The future integration of Magika with VirusTotal will enhance the platform’s current Code Insight feature, which uses generative artificial intelligence (AI) from Google to analyze and identify malicious code.

Additionally, Magika will pre-filter files before Code Insight analyzes them, increasing the accuracy and efficiency of the platform.

Because of VirusTotal’s collaborative nature, this integration directly supports a safer digital environment by contributing to the worldwide cybersecurity ecosystem.

Stay updated on Cybersecurity news, Whitepapers, and Infographics. Follow us on LinkedIn & Twitter.





Source link