By clicking “Accept”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.

Reviewing Malware with LLMs: OpenAI vs. Vertex AI

At Endor Labs, we continue evaluating the use of large language models (LLMs) for all kinds of use-cases related to application security. And we continue to be amazed about high-quality responses … until we’re amused about the next laughably wrong answer.

Open Report

View Report

Written by

Henrik Plate

Published on

June 5, 2023

Topics

Security

SCA

Open Source

Open Report

View Report

The risk assessment of potentially malicious code snippets is a use case that is particularly well suited for evaluating and comparing the performance of LLMs, the reason being that the question whether a given piece of code is malicious or not can (often) be answered without ambiguity. Admittedly, it can be tedious for malware analysts to untangle minified and obfuscated code, but the assessment typically concludes with a clear answer. This is different from other evaluation techniques, e.g. ones based on preferences expressed by humans or GPT-4, which seem much more subjective.

Compared to our last blog post, we improved the LLM-assisted review in a couple of ways: The removal of comments in suspicious code snippets reduces the exposure to prompt injection (more on that later), and instead of asking the LLM for a binary classification, we ask it to respond with a risk score on a scale between 0 and 9, from little to highly suspicious. We also increased the size of the context, which additionally benefited from the comment removals.

We also got new company in the form of a second LLM assistant, this time from Google, i.e. every suspicious code snippet is not only presented to GPT models from OpenAI, but also to the models available on Google’s Vertex AI platform. Both of them receive the same prompt and code snippets, and their temperature and related configuration parameters are set such that responses are as reproducible as possible.

OpenAI vs. Vertex AI

Obviously, we wanted to know how both LLMs compare against each other. To this end, the following histogram shows the differences of risk scores provided by gpt-3.5-turbo from OpenAI and text-bison from Vertex AI for the same code snippet. If, for example, gpt-3.5-turbo comes up with a risk score of 2 and text-bison with a risk score of 5 for the same file “index.js”, that difference is counted in bin 3.

As visible from the histogram, both LLMs agree in the majority of cases: In 488 out of 1098 assessments of the same code snippet, both models came up with the exact same risk score, in another 514 cases, the risk score differed only by one point. What’s more interesting are cases where the models’ assessment differs more significantly, three of which will be presented below.

‍

Example #1: techghoshal123

Listing 1 is an example where the risk score differed by 3 points. It shows the file “index.js” from the malicious npm package https://registry.npmjs.org/techghoshal123/1.7.0 (which has been removed following our notification of npm). This file has been called by a pre-install hook upon package installation, and exfiltrates sensitive information such as the user and hostname to burpcollaborator.net.

Listing 1:

‍

Despite the fact that the author did not make any effort to make the code look benign or hide it between legitimate code, the two LLMs come to diverging conclusions: On a scale from 0 to 9, gpt-3.5-turbo gave it a 2, and thus can be considered a false-negative. The model text-bison gave it a 5 (please find the complete answers below). They both properly described the behavior, however, did not conclude that it is malicious.

gpt-3.5-turbo

Risk score 2: The code appears to be sending a POST request to a remote server using the https module. The request contains tracking data in the form of a JSON object that includes information about the package, directory, hostname, user, DNS servers, and package version. The tracking data is then stringified and sent as a message in the POST request. While this code does not appear to be malicious, it is possible that the tracking data could be used for nefarious purposes if it falls into the wrong hands. Developers should be cautious about sending sensitive information in this manner and ensure that the remote server is trustworthy.

text-bison

Risk score 5: This code is not malicious. It is a simple script that sends tracking data to a remote server. The tracking data includes the package name, current working directory, home directory, hostname, username, DNS servers, resolved dependencies, and package version. The data is sent using the POST method to the Burp Collaborator server "xbcewrlf42d7quiugasdtk3m0d64ut.burpcollaborator.net".

For this particular example, we also asked OpenAI’s gpt-4 model about the risk score, using the exact same prompt as for the other models. The performance regarding this particular example is much better, as the malicious behavior is described in much more detail and the higher risk score better reflects its malicious intention.

gpt-4

Risk score 8: This JavaScript code snippet is highly suspicious and likely malicious, as it collects various information about the user's system and then sends this data to a remote server via HTTPS POST request. This could be a part of a data exfiltration operation or a reconnaissance activity, suggesting that it's potentially harmful.

The information collected includes the following:

The name of the package from the local package.json file.
The directory where the script is currently running (__dirname).
The user's home directory.
The hostname of the user's machine.
The username of the current user.
DNS servers that are currently in use on the user's machine.
Information from the package.json file such as the resolved fields, the version, and the whole package.json object itself.

This collected data is then stringified, URL encoded, and sent via a POST request to the specified host on port 443. The host (xbcewrlf42d7quiugasdtk3m0d64ut.burpcollaborator.net) does not look like a regular or trustworthy domain, which adds to the suspicion.

This behaviour clearly classifies as potentially malicious activity, as it can be used to gather sensitive information about the user's machine and system, which can then be used for further malicious activities. The potential impact on users or developers executing this script can be quite severe, as it might lead to the breach of sensitive personal or corporate information. Therefore, it's advised to avoid running this script and to thoroughly investigate the full source code for additional suspicious behavior.

Example #2: tvb-framework

One of the examples where both models completely disagreed is the file “sylvester.js” from the benign PyPI package tvb-framework 2.8.1.1. This file has been minified and packed, which made gpt-3.5-turbo assess it with a risk score of 9, while text-bison gave it a risk score of 0 (please find both answers below).

But while text-bison is right with its assessment, it is still surprising that it “judges” the packed code as “well-structured and documented”.

gpt-3.5-turbo

Risk score 9: The provided code is highly obfuscated and difficult to understand. It is likely that this code is malicious and is attempting to hide its true purpose. The code appears to be using various encoding schemes and cryptography to obfuscate its true intent. Additionally, the code appears to be manipulating arrays and performing mathematical operations, which could be used to exfiltrate sensitive information or perform other malicious activities. Without further context or analysis, it is impossible to determine the full extent of the risk posed by this code. It is recommended to avoid executing this code and to further investigate its purpose and origin.

text-bison

Risk score 0: The code is not malicious.The code is a JavaScript library for linear algebra. It provides a number of functions for performing operations on vectors and matrices, such as addition, subtraction, multiplication, and inversion. The code is well-structured and documented, and it does not contain any suspicious or malicious code.

Again, for completeness, find the assessment of gpt-4 below.

gpt-4

Risk score 7: The given JavaScript code appears to be utilizing obfuscation methods. These are common practices in malicious scripts to make it difficult for human readers and automated tools to understand the true intent of the code. Obfuscation itself does not mean the code is malicious, but it's a common practice in malicious scripts.

Here's what the code appears to be doing:

This code defines a function that seems to encode/decode data using a custom scheme. The use of String.fromCharCode and c.toString(36) are indicative of a conversion to a different base, which is typically seen in obfuscation and encryption schemes.
The function replace is used several times throughout the code. This is often used to alter data or code in obfuscated scripts.
There are also multiple cases where data is fetched from an array and checked against various conditions, possibly indicating some form of secret data handling or hidden instructions.

However, it's difficult to tell exactly what this code does because of the obfuscation, and because it is incomplete. There are also no clear signs of network connections, file operations, or other more overtly malicious behavior. So, the risk score is not at the maximum.

If the obfuscated code is de-obfuscated and turns out to be malicious, the potential impacts on users could include execution of unwanted actions, unauthorized access, data theft, and more. However, further analysis is required to make any conclusions about specific threats.

Example #3: prismstudio-dev

Another example where the assessment differed by 9 points is the package prismstudio-dev 1.1.0, which uses PyArmor for obfuscation purposes.

The file “_dbinfo.py” was judged malicious by gpt-3.5-turbo with a risk score of 9, and benign by text-bison with a risk score of 0. Apparently, text-bison only relied on the filename, which has been included in the prompt, to infer its functionality, and brushes over the fact that the content is highly obfuscated. Gpt-3.5-turbo, on the other hand, only takes its conclusion due to the fact that the payload is encrypted.

text-bison

Risk score 0: The code is a Python script that is used to extract information from a database. The script does not appear to be malicious and does not contain any suspicious code.

gpt-3.5-turbo

Risk score 9: The given source code snippet is obfuscated using PyArmor. The code is unreadable and it is not possible to determine its behavior. PyArmor is a tool used to protect Python scripts by encrypting and obfuscating the code. This can be used by attackers to hide malicious behavior from malware analysts. The use of PyArmor in this code snippet raises a high suspicion of malicious intent. It is recommended to further investigate the source of this code and its purpose.

But what lessons do we draw from such comparisons and examples?

Personally, as illustrated by the example above, we find that gpt-4 outperforms the other models for non-obfuscated code: It consistently provides better risk ratings and source code explanations.

On the one hand, this observation echoes competitions/comparisons such as https://chat.lmsys.org/?arena, where gpt-4 leads the pack (see Image 1). However, the Elo ratings computed on the basis of pairwise model comparisons seem suggesting a close race, while we find that gpt-4 often performs significantly better than the other tested models.

The situation is more complex for obfuscated code: Obviously, the hallucinations of text-bison regarding a snippet’s functionality, well-structuredness and documentation are odd. Here, we prefer a more conservative approach to highlight obfuscated code as potentially malicious, as done by the GPT models. After all, this technique is often used by attackers. However, due to a considerable number of minified and packed JavaScript files included both in npm and PyPI packages, this also results in a considerable number of false-positive findings.

Prompt Injection

A lot has been written about security risks and vulnerabilities of LLMs, and the OWASP just announced a plan to create a dedicated OWASP Top-10 for LLM Applications.

One particular risk in the context of our use-case is prompt injection, which we briefly discussed already in our previous blog post. It is caused by the fact that the prompt contains both our assessment instructions as well as potentially malicious, attacker-controlled code snippets. This in turn allows attackers to inject content that suggests benign behavior or that overrides our instructions.

However, in the case of the specific use-case discussed in this blog post, prompt injection is maybe not so much of a problem as originally anticipated:

The main reason is that code snippets extracted from open source packages have to be syntactically correct to be executable by, for example, Python or JavaScript interpreters. This requirement makes sure that suspicious code can be parsed and processed prior to being included in the prompt.

Before calling LLMs in our malware scanning pipeline, for example, we use the Pygments library in order to parse the respective source code and remove all comments. Along the same lines, we could also change/randomize all the identifiers of variables, functions, parameters, etc. This preprocessing is somewhat comparable to obfuscation techniques, only with a completely different goal in mind, which is to significantly constrain the attacker’s possibilities for prompt injection.

In case of the following (artificially constructed) code snippet, for example, the removal of Python docstrings and comments changes the risk assessment of gpt-3.5-turbo from 4 (less risky) to 5 (more risky):

‍

Again, GPT is surprisingly good at explaining the source code, in particular pinpointing problematic data flows, including the lack of any sanitization logic in the function “validate_spider_code”. Here’s its response for the undocumented code snippet:

gpt-3.5-turbo

(response on the code snippet without docstrings and comments):

Risk score 5: The provided source code snippet contains functions related to fetching spider code from a URL, validating it, and executing it. However, without the implementation details of the fetch_spider_code and validate_spider_code functions, it is difficult to determine the complete behavior of the code and whether it is malicious or not.

Suspicious activities to consider:

Downloading code from the internet: The fetch_spider_code function retrieves code from the URL https://tinyurl.com/3z3nu. If this URL points to malicious code, it could pose a risk.
Execution of downloaded code: The exec function is used to execute the retrieved spider code. If the spider code contains malicious instructions or commands, it could have an impact on the users or developers executing it.
Lack of validation: The validate_spider_code function is empty in the provided code snippet. If it does not perform any validation on the retrieved spider code, it could allow the execution of potentially malicious or untrusted code.

What cannot be easily removed or changed, at least not without changing the logic of the respective source code, are string literals. They are often used by malware authors for hostnames or URLs of attacker-controlled resources, e.g., “xbcewrlf42d7quiugasdtk3m0d64ut.burpcollaborator.net” in Listing 1. Their processing requires other or additional techniques to make sure that they are preserved where possible, to help determine maliciousness, and are removed otherwise.

Conclusion

The two take-aways of today’s blog post are as follows:

The risk assessment of the OpenAI model gpt-3.5-turbo and the Vertex AI model text-bison are very comparable. This, however, does not mean that they perform greatly: Both models have both false-positives and false-negatives (stay tuned for some numbers). Moreover, the OpenAI model gpt-4 outperforms the others when it comes to providing source code explanations and risk ratings for non-obfuscated code.

Last, on a more positive note, we explained why we believe that the risk of prompt injection is more manageable in this specific use-case than in others. This is mainly due to the fact that attackers do not live in a world free of rules … they still need to comply with the syntactic rules of the respective interpreters or compilers, which opens up the possibility for defenders to sanitize the prompt input.

Download

The Challenge

The Solution

The Impact

Get new posts in your inbox.

Welcome to the resistance

Oops! Something went wrong while submitting the form.

Get new posts in your inbox.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Get new posts in your inbox.

Article

Under the Hood: People.ai's Proactive Approach to AI Security

Hear how a CISO at an AI-first company is thinking about securing AI, and how AI should improve security programs.

Click to read

Article

People.ai transforms security and compliance with Endor Labs

People.ai uses Endor Labs for application security that provides an outstanding developer experience and makes it easier (and cheaper) to hit compliance targets.

Click to read

Article

Streamline Investigation with Enriched Vulnerability Search

Endor Labs Vulnerability Search helps you investigate CVEs with enriched metadata, call paths, and precise impact analysis—resolving conflicts across public feeds.

Click to read

Article

What is AppSec? A 2025 Guide for Security Practitioners

Learn what Application Security (AppSec) is, why it matters, and how to build a modern, scalable AppSec program across the SDLC.

Click to read

Article

Cracking the Code: Solving the Challenges of C/C++ Software Composition Analysis

This whitepaper details Endor Labs’ novel approach to indexing open source dependencies and detecting vulnerabilities in C and C++ codebases.

Click to read

Article

Mysten Labs Improves DevEx with Endor Labs

Within weeks of deployment, Endor Labs helped Mysten Labs transform its application security strategy.

Click to read

Article

Under the Hood: Mysten Labs’ Strategies for Building the Most Secure Blockchain

How Mysten Labs builds secure and low-friction systems for blockchain by focusing on code ownership, usability, and AppSec strategy.

Click to read

Article

Zebra Technologies Cuts SCA Noise by 97% with Endor Labs

With fewer alerts and more accuracy, Zebra Technologies now spends more time building and less time chasing false positives.

Click to read

Article

Next-Gen SCA for C/C++: Closing the Detection Gap

A new method for identifying OSS dependencies and vulnerabilities in C/C++ with greater accuracy and precision than legacy tools.

Click to read

Article

Critical SQL Injection Vulnerability in LlamaIndex (CVE-2025-1793) – Advisory and Analysis

The critical SQL injection vulnerability in LlamaIndex shows how LLMs can be a backdoor into your vector store

Click to read

Article

AppSec’s Exploitation Era: What Verizon, Mandiant, and Datadog Are Telling Us

A breakdown of DBIR, M-Trends, and DevSecOps reports and what they reveal about the future of AppSec in the age of AI.

Click to read

Article

Benchmarking Opengrep Performance Improvements

Opengrep's improvements to rule load times resulted in 3.15x faster average scan times than Semgrep

Click to read

Article

The UK Software Security Code of Practice through a Software Supply Chain Lens

How the UK Software Security Code of Practice reshapes supply chain security—and how Endor Labs helps vendors meet its core requirements.

Click to read

Article

CVE-2025-47949 Reveals Flaw in samlify That Opens Door to SAML Single Sign-On Bypass

Information on the likelihood and impact of CVE-2025-47949

Click to read

Article

Endor Labs Policies: Developer-Friendly Security Automation

This whitepaper talks about how Endor Labs uses context-aware security policies, like finding, action, exception, and remediation policies, to reduce noise, improve remediation speed, and help developers focus on real risks.

Click to read

Article

CVE-2025-4641 is Critical, But Likely Unreachable

Critical CVE-2025-4641 in WebDriverManager likely poses low real-world risk, but it should still be on radar. Here’s what you need to know, plus quick steps to check versions, upgrade, and secure CI pipelines.

Click to read

Article

Mastering Security Automation: Exception and Remediation Policies

Learn how Endor Labs cuts through security noise, stops unnecessary build breaks, and keeps developers focused on real risks—making security policy automation truly developer-friendly.

Click to read

Article

5 Tips for Managing Bazel Dependencies (Without Losing Friends)

Upgrading dependencies in a Bazel monorepo? Learn 5 tips to avoid breakages, reduce risk, and keep your team (and builds) running smoothly.

Click to read

Article

Why Security Policies Frustrate Developers (and How We Can Fix Them)

Most security policies create more problems than they solve, overwhelming developers with noise and unnecessary build breaks. Here's what a better approach looks like.

Click to read

Article

Open Source Gets Political: What The easyjson Debate Misses (and what to do about it)

A look at the easyjson controversy, open source provenance, and how Go's built-in protections help teams manage risk without overreacting.

Click to read

Article

Why We Raised a $93M Series B (In This Market)

Endor Labs raised a $93M Series B to accelerate its mission of securing the AI-driven software era. Learn why top investors preempted the round—and how Endor is redefining AppSec for modern development.

Click to read

Article

Secure AI-Generated Code at the Source

This solution brief shows how application security teams can fix risks from AI-generated code earlier in development and become the catalyst for secure, scalable adoption of AI coding tools like GitHub Copilot and Cursor in their organizations.

Click to read

Article

AI Security Code Review: A Multi-Agent Approach for Detecting Security Design Flaws at Scale

This whitepaper introduces how AI Security Code Review works, what it detects, how it integrates into your workflows, and why it represents the next generation of code scanning technology — built for the complexity and speed of AI-native software development.

Click to read

Article

Introducing the Endor Labs MCP Server: fix-first security for the vibe coding era

Endor Labs MCP Server powers real security fixes for vibe coding and AI-generated code—reduce noise and help AI tools fix risks for you.

Click to read

Article

Introducing AI Security Code Review

Endor Labs helps application security teams identify the few code changes that impact their security architecture across thousands of pull requests.

Click to read

Article

Meet the application security platform built for the AI era

The era of vibe coding is here. Learn how Endor Labs is helping AppSec teams secure and fix AI-generated code with a new agentic AI platform.

Click to read

Article

Critical RCE Vulnerability in Apache Parquet (CVE-2025-30065) – Advisory and Analysis

Endor Labs advisory: Critical CVE-2025-30065 in Apache Parquet lets attackers run code via schema parsing. Patch now by upgrading to version 1.15.1.

Click to read

Article

OWASP OSS Risk 2: Compromise of Legitimate Package

OWASP OSS Risk 2: Explore the compromise of legitimate open-source packages, with an in-depth case study of the tj-actions/changed-files GitHub Action supply chain attack.

Click to read

Article

Blast Radius of the tj-actions/changed-files Supply Chain Attack

Analysis of the tj-actions/changed-files GitHub Actions compromise, assessing the impact and damage from the attack.

Click to read

Article

What You Need to Know About UK Cyber Essentials Certification

Cyber Essentials helps UK businesses guard against internet-based attacks and prove their security measures are truly effective.

Click to read

Article

GitHub Action tj-actions/changed-files supply chain attack: what you need to know

GitHub Action tj-actions/changed-files was compromised, exposing CI/CD secrets. Learn how this attack impacts repositories and what steps to take now.

Click to read

Article

Application Security Posture Management (ASPM) Explained

Learn when application security posture management (ASPM) solutions work, their limitations, and alternatives for cutting through security alert noise.

Click to read

Article

How Endor Patches Are Built and Tested

Endor Patches are backported open-source security fixes. Learn how we build and test Endor Patches for compatibility and security.

Click to read

Article

The AppSec Maturity Staircase: Climbing Faster, Not Harder with Endor Labs

Each stage of the application security maturity staircase evolves your program—and Endor Labs is your escalator to the top.

Click to read

Article

How to Get Developers to Accept Security PRs Faster

Improve your mean time to remediation (MTTR) with smarter automatic pull requests that use upgrade impact analysis to reduce alert fatigue for developers.

Click to read

Article

DeepSeek R1: What Security Teams Need to Know

Learn how to evaluate security risk factors for DeepSeek R1, and about important considerations for working with open source AI models.

Click to read

Article

How to Discover Open Source AI Models in Your Code

Use Endor Labs to discover, evaluate, and enforce policies governing the usage of open source AI models from Hugging Face in your applications.

Click to read

Article

Remote Code Execution Vulnerabilities in Apache Struts

CVE-2024-53677 and CVE-2023-50164 are vulnerabilities in Apache Struts that could pave the way for remote code execution, or RCE. Learn how to figure out if you’re affected, and if so what to do about it

Click to read

Article

Everything You Need to Know About Opengrep

Opengrep is a fork of Semgrep's open source static code analysis engine. Learn about the benefits and how you can contribute.

Click to read

Article

Uncover Trends and Show AppSec Value with the Endor Labs Dashboard

Vulnerability metrics can help you uncover remediation and SLA trends, and demonstrate the value of AppSec investments to your leadership.

Click to read

Article

Identifying and Tracking FedRAMP False Positives

False positives can make FedRAMP ConMon costly. Learn why it’s hard to accurately identify false positives and some tactics for making this process less challenging.

Click to read

Article

How Endor Labs Prioritizes Open Source Security Patches

Learn how Endor Labs targets the critical dependencies that are responsible for most of the open source vulnerabilities in the software supply chain.

Click to read

Article

Why Reachability Analysis for JavaScript Is Hard (and How We Fixed It)

JavaScript reachability is tricky for SCA tools because of how JavaScript approaches dependency resolution, dependency imports, and functions.

Click to read

Article

Endor Patches Whitepaper

When upgrading is too risky, complex, or time consuming due to regressions, breaking changes, or new bugs, you can use Endor Patches to stay safe now while still meeting your SLA requirements.

Click to read

Article

Grip Security Reduces Noise by 99%

Grip Security replaced their traditional SCA tool with Endor Labs to improve their ability to build trust with customers without taxing developers.

Click to read

Article

Grip Security Builds Customer Trust with AppSec

Grip Security values strong application security because it helps them build trust with their customers. Learn how a security company approaches AppSec.

Click to read

Article

The Uncomfortable Truth of Vulnerable and Outdated Software Components

Learn where common industry sayings such as “stay up to date” come from and how you can help Endor Labs help you overcome those challenges.

Click to read

Article

Reduce FedRAMP Compliance Costs

Endor Labs reduces false positives and prioritizes real vulnerabilities, helping your team meet FedRAMP requirements with less stress and lower costs.

Click to read

Article

Why OVAL Feeds Outperform NVD for Linux Vulnerability Management

Learn why OVAL feeds, curated by Linux distributions, offer more precise vulnerability data than the NVD, reducing container scanning false positives and wasted efforts.

Click to read

Article

Achieving FedRAMP’s Container Scanning Requirements

Click to read

Article

Breaking Changes, Breaking Trust

Click to read

Article

Reducing FedRAMP Compliance Costs with Endor Labs

Vulnerability Management for FedRAMP compliance is expensive; your SCA tool should help you make it cheaper and easier.

Click to read

Article

Microsoft Defender for Cloud Natively Integrates with Endor Labs

Integrate Microsoft Defender for Cloud with Endor Labs for reachability analysis and attack path visibility — available natively within the Defender for Cloud console. Prioritize what to fix without switching tools.

Click to read

Article

Hugging Face Model Score Curation at Endor Labs

Understand how models are factored and scored at Endor Labs, new exploration tab for HuggingFace models

Click to read

Article

Endor Labs Announces Integrated SAST Offerings

Endor Labs now integrates Static Application Security Testing (SAST) into your application security testing stack.

Click to read

Article

Understanding the Cyber Resilience Act

The Cyber Resilience Act (CRA) sets mandatory security requirements for hardware and software. This blog covers key compliance objectives, challenges with OSS vulnerabilities, and best practices for maintaining security throughout the product life cycle.

Click to read

Article

Start Clean With AI: Select Safer LLM Models with Endor Labs

You can now use Endor Labs to evaluate AI models on HuggingFace for security, popularity, quality, and activity.

Click to read

Article

The U.S. Government Prioritizes Open Source Governance and Security

The U.S. Federal government's FY 2026 Cybersecurity Priorities focus on securing open source software, improving governance, and supporting OSS sustainability to strengthen the software supply chain.

Click to read

Article

Understanding the Basics of Large Language Models (LLMs)

Understand what LLMs are, how foundational LLMs are built, the opportunities they offer and the risks they pose.

Click to read

Article

Container Layer Analysis: Clarity in Remediation

Container layer analysis tells you which layer contains a vulnerability so you can prioritize remediation efforts more effectively and meet SLAs like FedRAMP.

Click to read

Article

Endor Labs Achieves 92% Reduction in SCA Alerts

Endor Labs reduces open-source vulnerability noise by 92%, boosting productivity and improving collaboration between development and security teams.

Click to read

Article

Karl Mattson Joins Endor Labs as Chief Information Security Officer

We're thrilled to have Karl Mattson as Endor Labs first Chief Information Security Officer (CISO)!

Click to read

Article

Highlights from Our 2024 Dependency Management Webinar

Get key insights from the 2024 Dependency Management webinar with Darren Meyer and Henrik Plate. We discuss how to prioritize vulnerabilities, navigate breaking changes, and leverage public vulnerability databases effectively.

Click to read

Article

Relativity Blocks Risks with Endor Labs

Relativity changed their security program from a blocker to an enabler by integrating security into developer workflows and empowering developers to prevent risks before they ship to production.

Click to read

Article

Blocking with Confidence: Relativity's Dev Experience Journey

Relativity changed their security program from a blocker to an enabler by integrating security into developer workflows and empowering developers to prevent risks before they ship to production.

Click to read

Article

48 most popular open source tools for Python applications, scored

Discover the top open-source tools for Python applications, ranked by Endor Scores based on security, activity, popularity, and code quality.

Click to read

Article

FedRAMP Requirements for Vulnerability Management and Dependency Upgrades

This blog covers key steps to simplify FedRAMP vulnerability management, helping you reduce risks and meet compliance timelines. It also provides practical tips to empower developers and streamline fixes for a smoother FedRAMP process.

Click to read

Article

Fix Vulnerabilities Faster with Auto Patching and Endor Patches

Automatically patch open source libraries with Endor Patches during the build process, ensuring software is continuously protected against vulnerabilities without manual intervention.

Click to read

Article

Dependency Management Report

Click to read

Article

Announcing the 2024 Dependency Management Report

Our third-annual Dependency Management Report explores how emerging trends in open source security should guide SDLC security strategy.

Click to read

Article

Starburst Gets 98.3% Noise Reduction with Endor Labs

Starburst, an open data lakehouse, replaced Rezillion with Endor Labs for SCA. They improved their ability to identify and prioritize open source while complementing the developer experience.

Click to read

Article

Building a DevSecOps Practice at Starburst

Wondering how to build or revamp a DevSecOps program? Get some immediately useful tips that you can apply to your startup or mature enterprise…or anywhere in between.

Click to read

Article

What is CI/CD Security and What Tools Do You Need to Do it?

Learn what CI/CD security is, why it’s important, and discover the key tools Endor Labs offers to help you secure your CI/CD pipelines.

Click to read

Article

PWN Request Threat: A Hidden Danger in GitHub Actions

Endor Labs provides comprehensive CI/CD security for GitHub action workflows that detect patterns that may indicate PWN request threats.

Click to read

Article

Address Open Source Risks with Endor Labs

Click to read

Article

Endor Labs Brand Guidelines

Click to read

Article

Give Devs the Confidence to Fix: Making Remediation Less Painful

Endor Labs’ newest capabilities help you reduce the research required to understand the impact of dependency upgrades and Endor Magic Patches help you stay safe without changing versions.

Click to read

Article

Endor Labs Partners with Microsoft to Strengthen Software Supply Chains

Endor Labs is now available on Azure Marketplace!

Click to read

Article

Prioritize Open Source Risks with Endor Labs

Endor Labs provides several filters to help you prioritize which risks to address first, resulting in an average 92% noise reduction.

Click to read

Article

Discover Open Source Risks with Endor Labs

Use Endor Labs to get accurate dependency inventories and complete vulnerability data sources.

Click to read

Article

48 most popular open source tools for npm applications, scored

Discover the 48 most popular open-source npm tools, complete with Endor Scores, to help you choose the best dependencies for your projects based on security, activity, popularity, and code quality.

Click to read

Article

Benchmarking Endor Labs vs. Snyk’s GitHub Apps

Compare Endor Labs and Snyk GitHub Apps.

Click to read

Article

Using Artifact Signing to Establish Provenance for SLSA

Use artifact signing, a feature of Endor Labs, to support build provenance requirements for SLSA.

Click to read

Article

Fixed is Better than Found | Upgrades & Remediation with Endor Labs

At Endor Labs, we believe your application security tooling must go beyond alerting—it should also helpyou fast-track remediation.

Click to read

Article

How to Fix Vulnerabilities Without Breaking Changes

Click to read

Article

Introducing Upgrades & Remediation: Give Developers the Confidence to Fix

Upgrade Impact Analysis shows you what breaking changes a fix could cause. Endor Patches are trusted patches you can use when upgrades are too painful.

Click to read

Article

Static SCA vs. Dynamic SCA: Which is Better (and Why It's Neither)

Software composition analysis (SCA) tools can take a static or dynamic approach. Learn the pros and cons of each option and see how the results differ.

Click to read

Article

33 Most Popular Open Source Tools for Maven Applications, Scored

Explore the top 33 open source tools for Maven, scored by Endor Labs on security, activity, popularity, and code quality.

Click to read

Article

Endor Labs Partner Program Overview

Click to read

Article

Jellyfish Enables Data-Driven AppSec with Endor Labs

Jellyfish replaced Snyk with Endor Labs to improve their ability to identify, prioritize, address, and predict open source risk. Learn more!

Click to read

Article

Jellyfish’s Data-Driven Security Program

Learn how Jellyfish’s security team uses a data-driven approach to risk management and the role SCA plays in their strategy.

Click to read

Article

What's a Security Pipeline? - On-Demand Webinar

Learn about common patterns and tradeoffs for security pipelines in this introductory webinar.

Click to read

Article

Secure Everything Your Code Depends On With Endor Labs

While conventional code security tools drown teams in false positives, Endor Labs zeroes in on real risks, empowering developers without without slowing them down.

Click to read

Article

Endor Labs Receives Strategic Investment from Citi Ventures

Endor Labs, a leader in software supply chain security, today announced a strategic investment from Citi Ventures.

Click to read

Article

We made the Inc. Best Workplaces List for 2024!

Endor Labs is named to Inc.’s annual Best Workplaces list for 2024.

Click to read

Article

New CocoaPods CVEs: Swift and Objective-C Supply Chains Are Fragile

Three CocoaPods CVEs raise serious security concerns for consumers of Swift and Objective-C libraries used for macOS and iOS mobile development.

Click to read

Article

Questions to Ask Your Software Composition Analysis Vendor

When choosing an SCA tool, you’ll need to understand how the tool generates an inventory, correlates to risks, helps you prioritize results, and integrates into your toolchain.

Click to read

Article

Backstage and Endor Labs: AppSec in a Dev’s Dream Workspace

The Endor Labs plugins for Backstage create an application security experience that doesn’t require developers to leave Backstage.

Click to read

Article

Managing Open Source Vulnerabilities for PCI DSS Compliance - On-Demand Webinar

Watch this 30-minute on-demand webinar to learn about changes to PCI DSS that impact OSS vulnerability management.

Click to read

Article

Container Scanning + SCA = Better Together

We’re excited to announce that Endor Labs now extends our software supply chain platform to include container scanning.

Click to read

Sorry, we couldn't find what you're looking for.

View All Results

Reviewing Malware with LLMs: OpenAI vs. Vertex AI

Open Report

View Report

Open Report

View Report

Contents

Share This Resource

Related Resources

OpenAI vs. Vertex AI

Example #1: techghoshal123

gpt-3.5-turbo

text-bison

gpt-4

Example #2: tvb-framework

gpt-3.5-turbo

text-bison

gpt-4

Example #3: prismstudio-dev

text-bison

gpt-3.5-turbo

Prompt Injection

gpt-3.5-turbo

Conclusion

Share This Resource

Related Resources

The Challenge

The Solution

The Impact

Get new posts in your inbox.

Get new posts in your inbox.

Get new posts in your inbox.

Get new posts in your inbox.

Get new posts in your inbox.

Get new posts in your inbox.

Under the Hood: People.ai's Proactive Approach to AI Security

People.ai transforms security and compliance with Endor Labs

Streamline Investigation with Enriched Vulnerability Search

Under the Hood: People.ai's Proactive Approach to AI Security

People.ai transforms security and compliance with Endor Labs

Streamline Investigation with Enriched Vulnerability Search

What is AppSec? A 2025 Guide for Security Practitioners

Cracking the Code: Solving the Challenges of C/C++ Software Composition Analysis

Mysten Labs Improves DevEx with Endor Labs

Under the Hood: Mysten Labs’ Strategies for Building the Most Secure Blockchain

Zebra Technologies Cuts SCA Noise by 97% with Endor Labs

Next-Gen SCA for C/C++: Closing the Detection Gap

Critical SQL Injection Vulnerability in LlamaIndex (CVE-2025-1793) – Advisory and Analysis

AppSec’s Exploitation Era: What Verizon, Mandiant, and Datadog Are Telling Us

Benchmarking Opengrep Performance Improvements

The UK Software Security Code of Practice through a Software Supply Chain Lens

CVE-2025-47949 Reveals Flaw in samlify That Opens Door to SAML Single Sign-On Bypass

Endor Labs Policies: Developer-Friendly Security Automation

CVE-2025-4641 is Critical, But Likely Unreachable

Mastering Security Automation: Exception and Remediation Policies

5 Tips for Managing Bazel Dependencies (Without Losing Friends)

Why Security Policies Frustrate Developers (and How We Can Fix Them)

Open Source Gets Political: What The easyjson Debate Misses (and what to do about it)

Why We Raised a $93M Series B (In This Market)

Secure AI-Generated Code at the Source

AI Security Code Review: A Multi-Agent Approach for Detecting Security Design Flaws at Scale

Introducing the Endor Labs MCP Server: fix-first security for the vibe coding era

Introducing AI Security Code Review

Meet the application security platform built for the AI era

Critical RCE Vulnerability in Apache Parquet (CVE-2025-30065) – Advisory and Analysis

OWASP OSS Risk 2: Compromise of Legitimate Package

Blast Radius of the tj-actions/changed-files Supply Chain Attack

What You Need to Know About UK Cyber Essentials Certification

GitHub Action tj-actions/changed-files supply chain attack: what you need to know

Application Security Posture Management (ASPM) Explained

How Endor Patches Are Built and Tested

The AppSec Maturity Staircase: Climbing Faster, Not Harder with Endor Labs

How to Get Developers to Accept Security PRs Faster

DeepSeek R1: What Security Teams Need to Know

How to Discover Open Source AI Models in Your Code

Remote Code Execution Vulnerabilities in Apache Struts

Everything You Need to Know About Opengrep

Uncover Trends and Show AppSec Value with the Endor Labs Dashboard

Identifying and Tracking FedRAMP False Positives

How Endor Labs Prioritizes Open Source Security Patches

Why Reachability Analysis for JavaScript Is Hard (and How We Fixed It)