Data sources

CVE Database Comparison: NVD vs OSV vs GHSA vs Snyk Intel (2026)

Published April 21, 2026

Why the Database Matters More Than the Scanner

Every vulnerability scanner is only as good as the data it queries. Two scanners using identical detection logic will produce wildly different results depending on which vulnerability database they consult. A scanner checking only the NVD will miss ecosystem-specific advisories. One checking only GHSA will miss binary-level CVEs without package manager metadata.

The vulnerability data landscape in 2026 is fragmented. NIST's NVD experienced significant enrichment backlogs in 2024-2025, pushing many organizations to supplement with alternative sources. Google's OSV database emerged as the de facto standard for open source package vulnerabilities. GitHub Security Advisories (GHSA) became the primary source for many ecosystem-specific tools. Meanwhile, commercial databases like Snyk Intel offer proprietary research at a premium.

This guide compares the five most important vulnerability databases available in 2026, explains their strengths and weaknesses, and shows how multi-source scanners like ScanRook combine them for maximum accuracy.

Head-to-Head Comparison

FeatureNVDOSVGHSASnyk IntelRed Hat OVAL
Maintained byNISTGoogleGitHubSnykRed Hat
Total entries250,000+100,000+50,000+ProprietaryRHEL only
Update speed24-48hReal-timeReal-timeReal-timeDaily
API accessYes (rate limited)Yes (free)Yes (token)Paid onlyDownload only
Package matchingCPE (ambiguous)Exact (ecosystem)ExactExactRPM-specific
ScoringCVSS v3.1/v4.0Via aliasesCVSSProprietary + CVSSSeverity
CostFreeFreeFreePaidFree
Best forBinary/CPEPackage managersGitHub reposEnterpriseRHEL/CentOS

Deep Dive: National Vulnerability Database (NVD)

The NVD, maintained by NIST since 2005, remains the canonical reference for CVE identifiers. Every CVE ID issued globally eventually appears in the NVD with enrichment data including CVSS scores, CWE classifications, and CPE (Common Platform Enumeration) affected product strings. With over 250,000 entries, it provides the broadest coverage of any single public database.

The NVD's primary weakness is its CPE-based matching system. CPE strings are manually assigned by NIST analysts, creating a bottleneck. In 2024, the NVD experienced a months-long enrichment backlog where thousands of CVEs existed without CPE data, making them effectively invisible to scanners relying solely on NVD. The matching itself is imprecise — CPE vendor/product strings are ambiguous and version ranges often over-match.

The NVD API provides free access with rate limiting (5 requests/30s without a key, 50/30s with one). For high-volume scanning, most tools maintain a local mirror that syncs via the CVE Change History API. The NVD remains essential for binary scanning where no package manager metadata exists — when all you have is a product name and version string, CPE matching is often the only option.

Deep Dive: Open Source Vulnerabilities (OSV)

Google's OSV database, launched in 2021, solved the package matching problem that plagues the NVD. Instead of mapping CVEs to CPE strings, OSV uses exact ecosystem identifiers — the same package names and version specifiers used by package managers like npm, PyPI, Maven, and Go modules. When you query OSV with a package name and version, you get a precise answer: affected or not.

OSV aggregates advisories from ecosystem-specific sources including GHSA, RustSec, PyPA (Python), Go vulnerability database, and Linux distribution advisories. This aggregation means a single OSV query covers multiple upstream sources. The API is free, unrestricted, and supports batch queries — making it ideal for scanners processing hundreds of packages per image.

The limitation of OSV is coverage scope. It focuses exclusively on open source packages with known ecosystem identifiers. Binary artifacts without package metadata, proprietary software, and hardware CVEs are not covered. For container scanning, OSV excels at matching installed packages detected via installed-state scanning but cannot identify vulnerabilities in statically-linked binaries without SBOM data.

Deep Dive: GitHub Security Advisories (GHSA)

GHSA is GitHub's curated vulnerability database covering packages across all major ecosystems. Unlike the NVD which relies on CPE, GHSA maps vulnerabilities directly to package ecosystem identifiers with precise affected version ranges. Each advisory is reviewed by GitHub's security team and often includes remediation guidance and patch references.

GHSA serves as an upstream source for OSV, meaning most GHSA entries also appear in OSV queries. However, GHSA provides additional value through its Dependabot integration — GitHub repositories automatically receive pull requests when a dependency has a known GHSA entry. The API requires a GitHub token but is free for all authenticated users.

GHSA's coverage is strongest for the npm, Maven, PyPI, and RubyGems ecosystems where GitHub hosts the majority of source code. Coverage for system-level packages (apt, apk, rpm) is thinner since those ecosystems have their own advisory infrastructure. For teams already using GitHub for source code, GHSA provides the tightest integration between code, dependencies, and vulnerability data.

Deep Dive: Snyk Intel

Snyk's proprietary vulnerability database combines public CVE data with original research from Snyk's security team. Snyk claims to discover vulnerabilities an average of 46 days before they appear in the NVD, providing an early-warning advantage for paying customers. The database also includes proprietary severity scoring that accounts for exploitability context beyond standard CVSS.

The primary advantage of Snyk Intel is its remediation guidance. Each entry includes not just affected versions but specific upgrade paths, function-level reachability data, and exploit maturity assessments. This additional context helps security teams prioritize remediation more effectively than raw CVSS scores alone.

The obvious limitation is cost. Snyk Intel is only accessible through Snyk's commercial products. Organizations using alternative scanners cannot query Snyk's database directly. For teams that can justify the expense, the earlier detection and richer context can reduce mean time to remediation. For cost-sensitive teams, combining free sources (NVD + OSV + GHSA) provides comparable detection rates with slightly delayed coverage.

Deep Dive: Red Hat OVAL

Red Hat publishes OVAL (Open Vulnerability and Assessment Language) data that reflects the actual patch status of packages in RHEL, CentOS, and Fedora. This data is critical because Red Hat backports security fixes into older package versions without changing the upstream version number. A package showing version 1.2.3 on RHEL might contain patches for CVEs fixed upstream in version 1.2.5 — but the NVD only knows about the upstream version range.

Without OVAL data, scanners produce massive false-positive counts on RHEL-based images. Every backported fix appears as an unpatched vulnerability when matched against NVD version ranges. Red Hat's OVAL definitions authoritatively state which RHEL package versions are affected, resolving this ambiguity. For detailed coverage of this topic, see our guide on Red Hat backporting.

OVAL data is distributed as XML files downloadable from Red Hat's security data repository. There is no real-time API — scanners must download and parse the full OVAL definitions periodically. Updates are published daily. The data is strictly limited to Red Hat ecosystem packages and cannot be used for other distributions or ecosystems.

When to Use Which: Decision Tree

Choosing the right database depends on what you are scanning and what metadata is available:

Scanning application dependencies (npm, pip, Maven, Go)?

Use OSV or GHSA. They provide exact package-version matching with zero ambiguity.

Scanning compiled binaries without package metadata?

Use NVD. CPE matching is imprecise but it is the only option when package manager data is unavailable.

Scanning RHEL/CentOS container images?

Use Red Hat OVAL as the primary source. NVD will produce false positives due to backporting.

Need earliest possible detection of new CVEs?

Snyk Intel (paid) or GHSA (free). Both publish faster than the NVD enrichment pipeline.

Need maximum coverage across all target types?

Combine all available sources. No single database covers every scenario.

How ScanRook Combines Multiple Databases

ScanRook queries multiple vulnerability databases in a single scan to maximize detection accuracy while minimizing false positives. The approach is target-aware: the scanner first identifies what type of artifact it is analyzing, then selects the appropriate data sources.

For container images, ScanRook reads the installed package state from the image's package manager databases (dpkg, rpm, apk). These exact package names and versions are queried against OSV for ecosystem-wide coverage. If the image is RHEL-based, Red Hat OVAL definitions are consulted to eliminate backporting false positives. Binary files without package metadata are fingerprinted and checked against the NVD via CPE matching.

This multi-source approach means ScanRook detects vulnerabilities that single-database scanners miss. A scanner using only OSV would miss binary-level CVEs. One using only NVD would produce false positives on RHEL and miss ecosystem-specific advisories. By combining sources intelligently based on context, ScanRook provides comprehensive coverage with high precision. See our benchmark results for detection rate comparisons.

The 2024-2025 NVD Enrichment Crisis

In February 2024, NIST dramatically reduced its NVD enrichment output, creating a backlog of thousands of CVEs without CPE data, CVSS scores, or CWE classifications. This event exposed the fragility of the security ecosystem's reliance on a single government-funded database.

Organizations that relied solely on NVD-based scanners experienced significant blind spots during this period. CVEs existed in the database but lacked the CPE mappings needed for automated matching. This accelerated adoption of alternative sources — particularly OSV, which was unaffected since it uses ecosystem-native identifiers rather than NVD-assigned CPE strings.

The crisis reinforced a critical principle: vulnerability intelligence should never depend on a single source. Organizations that had already adopted multi-source scanning were largely unaffected. Those locked into NVD-only tooling scrambled to add supplementary data sources.

Emerging Standards: VEX and CSAF

Beyond vulnerability databases, two emerging standards are changing how vulnerability data is communicated. VEX (Vulnerability Exploitability eXchange) documents allow software producers to state whether a CVE actually affects their product — providing official "not affected" or "fixed" status that scanners can consume to suppress false positives.

CSAF (Common Security Advisory Framework) is the successor to Red Hat's OVAL format, providing machine-readable security advisories in JSON format. Red Hat, Cisco, and other major vendors now publish CSAF documents alongside traditional advisories. Scanners that consume CSAF data can make more precise affected determinations than those relying on CPE version ranges alone.

ScanRook's architecture is designed to consume both traditional database APIs and newer VEX/CSAF documents as they become more widely available. This forward-looking approach ensures that as the vulnerability data ecosystem evolves, scan accuracy improves without requiring fundamental architectural changes.

Data Freshness and Caching Strategies

Vulnerability data changes constantly. New CVEs are published daily, existing entries are revised with updated severity scores, and affected version ranges are corrected. Scanners must balance data freshness against performance — querying APIs for every package in real-time is too slow for large-scale scanning.

ScanRook uses a tiered caching strategy. OVAL definitions are synced daily since they change infrequently. OSV data is queried in batch with short-lived caches (hours) to capture rapid ecosystem updates. NVD data uses a local mirror synced via the change history API. This combination ensures scans complete in seconds while maintaining data freshness within acceptable bounds.

For air-gapped or on-premise deployments, ScanRook supports fully offline scanning using pre-downloaded database snapshots. See our guide on on-prem vs SaaS scanning for deployment architecture options.

Frequently Asked Questions

What is the most comprehensive CVE database?

The NVD is the most comprehensive single source with over 250,000 entries. However, combining multiple databases provides better overall coverage since each source has different strengths.

How fast does the NVD update after a CVE is published?

Typically 24-48 hours for enrichment (CVSS, CPE). During backlog periods, enrichment can take weeks. The CVE ID itself appears quickly, but without enrichment data, automated matching does not work.

Is OSV better than NVD for package vulnerability scanning?

For package manager dependencies, yes. OSV uses exact ecosystem identifiers providing precise matching. For binaries without package metadata, NVD's CPE system is still necessary.

Can I use the NVD API for free?

Yes. Without a key: 5 requests per 30 seconds. With a free API key: 50 per 30 seconds. Sufficient for individual use but inadequate for high-volume scanning without local caching.

What is CPE matching and why does it cause false positives?

CPE uses vendor:product:version strings to identify affected software. These strings are manually assigned and often ambiguous. Different products may share similar names, and version ranges can over-match packages that are not actually vulnerable.

Does GHSA only cover GitHub-hosted projects?

No. GHSA covers vulnerabilities in packages from all major ecosystems regardless of where the source code is hosted. The name refers to the maintainer, not the coverage scope.

How does Red Hat OVAL differ from NVD data?

OVAL reflects actual RHEL patch status accounting for backported fixes. NVD tracks upstream versions only, causing false positives on RHEL systems where fixes are backported without version bumps.

Why do scanners need multiple vulnerability databases?

No single database covers all scenarios. NVD handles binaries, OSV handles packages, GHSA provides curated assessments, and OVAL eliminates RHEL false positives. Multi-source scanning maximizes both detection and precision.

Scan with All Five Databases at Once

ScanRook queries NVD, OSV, GHSA, and Red Hat OVAL in a single scan. Upload a container image, source archive, or binary and get comprehensive vulnerability results in seconds.

Related Posts

More on this topic.