signzy

API Marketplace

downArrow
Logo
Responsive
NAICS Code Lookup by Company Name: 4 Methods Ranked by What Actually Works

NAICS Code Lookup by Company Name: 4 Methods Ranked by What Actually Works

5 Minutes
Key Highlights
  • A Critical Risk Signal, Not a Label: In a KYB (Know Your Business) context, a NAICS code is a frontline compliance signal. Misclassifying a high-risk entity (like an obscure holding company) as a standard consulting firm creates immediate security blind spots.
  • The Flaw in Government Data: Official government registries and census databases are fundamentally unreliable for specific company lookups because they depend entirely on self-reported, stale data that carries no penalties for inaccuracies.
  • Automated Platforms Rank Highest: Modern KYB platforms provide the most accurate lookups by using machine learning to cross-reference multiple live sources—including actual website content—instead of relying on single government forms.
  • The Power of Mismatch Detection: Defensible compliance requires a system that doesn't just pull a code, but actively flags inconsistencies when a company's real-world operational footprint contradicts its registered paperwork.

If you have ever tried to find a NAICS code for a specific business you are about to onboard, you already know the frustration.

The Census Bureau's official NAICS site is excellent at explaining what a six-digit code means. It is terrible at telling you which code Apex Horizon Capital LLC actually uses. Government databases list codes that companies self-reported in a census survey, often years ago. Industry keyword searches hand you a list of possibilities, not a verified answer.

This matters more than most people think. When you are onboarding a business client — a borrower, a marketplace vendor, a payment processing customer — their NAICS code is not just a classification label. It is one of the first risk signals your compliance team acts on. Get it wrong, and a high-risk entity clears onboarding with the wrong risk score attached to it.

This guide ranks four ways to look up a NAICS code by company name, from most reliable to least, based on one question: can you use this method to get the NAICS code of a specific business with enough confidence to make a compliance decision?

Why the NAICS Code Is a Risk Signal, Not Just a Label

NAICS codes are six-digit numbers that classify businesses by their primary economic activity across the US, Canada, and Mexico. Each digit narrows the classification further:

DigitsWhat They RepresentExample
1-2Economic sector55 = Management of Companies
1-3Subsector551 = Management of Companies and Enterprises
1-4Industry group5511 = Management of Companies
1-5NAICS industry55111 = Management of Companies
All 6National industry (full code)551112 = Offices of Other Holding Companies

That last row is worth pausing on. Offices of Other Holding Companies (551112) describes entities that hold equity interests in other companies without actively managing them. It is a legitimate structure for private equity firms and family offices. It is also the most common legal vehicle used in layered ownership arrangements designed to obscure who ultimately controls a business. When a KYB system encounters 551112, the correct response is to automatically flag it for enhanced due diligence and initiate a UBO trace.

When a manual process misclassifies that same entity as 541611 (Management Consulting), nothing happens. Nobody flags it. Nobody traces the ownership. The business onboards cleanly.

That gap is the problem this article is about.

The Reliability Problem with Self-Reported Codes

Before you trust any government database for a company's NAICS code, there is something you need to understand about how those codes get there.

A 2022 peer-reviewed study in Statistics and Public Policy, conducted by IRS researchers analyzing tax filings, found that NAICS codes are self-reported on tax forms and that reporting incorrect codes or even leaving the code blank has no tax consequences. The researchers developed predictive ML models specifically because self-reported codes were so frequently unusable.

The study found that a meaningful share of businesses in government databases either have the wrong code or no code at all. Their models could correctly identify the sector for over half of businesses that had reported no informative code whatsoever.

When Middesk was building its industry classification product in 2019, they found coverage gaps as high as 60% when sampling industry codes from incumbent data vendors. Their conclusion at the time: without a dedicated classification layer, operations teams end up reviewing websites, Yelp, and Google in a best-effort guess to determine the industry of new clients — a problem that has only grown as the volume of business onboarding has increased.

This is the data quality problem that all four methods below are trying to solve, with very different levels of success.

Method 1: Automated KYB and Business Verification Platforms

Reliability for company-name lookup: Highest

Modern KYB platforms classify a business's industry based on what it actually does, not what it reported in a government survey. You submit a company name, registered address, and optionally a website URL. The platform cross-references multiple sources simultaneously — state business registries, Secretary of State filings, commercial data, website content analysis, and ML models trained on millions of business profiles. A classification comes back in seconds.

This is meaningfully different from a government lookup in one specific way: the best platforms detect mismatches. If the website of a company registered as Management Consulting describes international payment processing services for overseas merchants, the system surfaces that inconsistency before the application proceeds. A Census Bureau keyword search would have returned whatever the company self-reported and moved on.

Arva AI, a Y Combinator-backed KYB startup, described the downstream consequence from observing hundreds of fintech onboarding processes: human analysts end up in manual review over 40% of the time because existing data sources cannot provide a complete picture of the business. When industry classification is unclear, the back-and-forth with the applicant can stretch for weeks. In some cases, onboarding drop-off can reach 50% before a business ever completes the process.

Signzy's KYB platform addresses this directly. It screens businesses across all 50 US states against company databases and registries, verifies EINs against official records, analyzes beneficial ownership chains using AI-powered document forensics, and cross-references multiple databases in real time. When a business submits for onboarding, Signzy surfaces company registration status, ownership structure, director information, and risk signals — including industry context — through a unified API, without requiring weeks of back-and-forth document collection.

This gives compliance teams what government databases cannot: a multi-source picture of what the business actually is, not just what it says it is.

What to look for when evaluating any automated KYB platform for NAICS classification:

What to EvaluateThe Non-Obvious Question to AskRed Flag Answer
Data sourcesDo they cross-reference web content against registry data, or just query a single source?"We pull from one data partner"
Confidence scoringDo they return a confidence score alongside the code, or just a code?No confidence signal at all
Mismatch detectionWhat happens when a company's website describes different services than its registered classification?No mismatch logic; they return the registry code by default
Private company coverageWhat percentage of LLCs and sole proprietors are classifiable vs. returned as null?"Our coverage is best for incorporated businesses"
Data freshnessHow often is classification updated and what triggers a refresh?"We refresh annually" or no clear policy
SIC alongside NAICSDo they return both codes simultaneously?NAICS only, requiring a separate lookup for SIC-dependent systems
API batch throughputWhat is the rate limit and latency at scale?No documented SLA or per-request limits that bottleneck onboarding volume

Where this method does not fit: Cost and setup time make it the wrong tool if you need to look up one company once. Methods 3 and 4 below cover that case.

Method 2: Direct Registry and Government Database Search (Manual)

Reliability for company-name lookup: Moderate, with important caveats

This is what most compliance analysts do without an automated platform: manually search government registries and business databases to find self-reported codes. The most useful sources, ranked by reliability:

SEC EDGAR — for public companies and registered investment vehicles. Public companies include SIC codes in their 10-K filings. Search efts.sec.gov by company name, find the SIC code in the filing header, then crosswalk it to NAICS. Reliable for public companies. Useless for the private LLCs that make up the vast majority of business onboarding queues.

SAM.gov — for federal contractors. If a business has ever contracted with the federal government, it has a self-reported NAICS code in SAM.gov. Works well for contractors and professional services firms. Does not help for businesses with no government exposure.

State Secretary of State portals — every US state has one. They confirm legal name, entity type, registered agent, and formation date but do not show NAICS codes directly. Delaware is the most searched because so many businesses incorporate there regardless of where they actually operate.

Census Bureau NAICS search at census.gov/naics — lets you search by industry keyword, not by company name. You are guessing what code a company might use based on their described activities. Useful for narrowing possibilities; not a verification method.

The honest limitation: Even when you find a self-reported code, it may be wrong. A holding company that classifies itself as Management Consulting (541611) may genuinely believe that is accurate because it provides advisory services to its subsidiaries. But 541611 does not trigger the same compliance review that 551112 does. The downstream consequences are real.

Reliability for company-name lookup: Low — right tool for self-classification only

The NAICS Association's drill-down table lets you navigate from the two-digit sector level down to a full six-digit code by progressively choosing subcategories. It is well-built for one specific purpose: helping a business owner figure out where they fit in the classification hierarchy.

Where it genuinely works:

  • A new fintech deciding whether they are 522390 (Other Activities Related to Credit Intermediation) or 523910 (Miscellaneous Intermediation) before registering
  • A compliance officer mapping the full range of codes within a sector to build an internal prohibited business list

Where it breaks down: You cannot search for "Apex Horizon Capital LLC" and get a result. There is no company lookup. You end up with a best guess based on described activities, which as the IRS research above shows, frequently does not match actual classification.

The NAICS Association's Company Lookup Tool covers over 26 million US companies with free search and per-record purchase at $9.95. Worth trying for a one-off lookup. It is not built for volume, and data freshness is not guaranteed.

Method 4: SIC-to-NAICS Crosswalk

Reliability for company-name lookup: Moderate — only useful when you already have a SIC code

SIC codes predate NAICS and still appear in SEC filings and some state-level licensing databases. If you have already found a company's SIC code and need the NAICS equivalent, the crosswalk at naics.com is a legitimate bridge.

The problem is many-to-one mapping. SIC code 6726 (Investment Offices, Not Elsewhere Classified) maps to multiple NAICS codes including 523910, 525910, and 551112. Without additional context about the specific company, you cannot determine which applies. You end up with the same problem as Method 3: an educated guess rather than a verified answer.

Use this method when you are working with legacy data that already has SIC codes and need NAICS for a newer system. Do not use it as a primary lookup method for fresh onboarding data.

Side-by-Side: Which Method for Which Situation

Your SituationBest MethodWhy
Onboarding 200+ businesses per monthAutomated KYB platformOnly option that scales without adding headcount or review time
Verifying a specific private LLC or partnershipAutomated KYB platformGovernment databases have poor coverage of private entities
Looking up a public company quicklyManual — SEC EDGARFree, fast, reliable for SEC-registered companies
Finding your own business's NAICS codeNAICS drill-down tableBuilt for self-classification; no third-party data needed
You have a SIC code and need NAICSSIC-to-NAICS crosswalkDirect translation — but verify the output before acting on it
Company is a federal contractorManual — SAM.govSelf-reported codes are present and accessible for free
Company incorporated in Delaware but operates elsewhereAutomated KYB platformState registry confirms formation only, not classification
You suspect the self-reported code understates riskAutomated KYB platformMismatch detection is only possible with multi-source cross-referencing

The Codes That Should Pause Any Onboarding Flow

A lookup that returns a code is only valuable if your system knows what to do with it. Here are the NAICS codes that should trigger a pause — along with what they are actually signaling beyond a generic "high risk" label:

NAICS CodeIndustry LabelWhat It Actually Signals in a KYB Context
551112Offices of Other Holding CompaniesPassive holding structures are the most common legal vehicle for obscuring UBO chains. FATF guidance specifically flags these as high-risk entity types.
531390Other Activities Related to Real EstateReal estate is one of the most consistently flagged sectors for money laundering risk in FATF guidance, which dedicated a full report to the sector's vulnerabilities. "Other activities" within it is the hardest sub-classification to verify accurately.
713210Casinos (except Casino Hotels)High cash volume by definition. State-level gaming regulations vary significantly; many compliance programs carry outright prohibitions.
522390Other Activities Related to Credit IntermediationIncludes check cashers, some payday lenders, and money service business overlap. The "other" label masks significant variation in actual activity.
523130Commodity Contracts DealingCovers crypto asset dealing under several state frameworks. Classification is frequently stale given how fast the sector evolves.
812990All Other Personal ServicesA catch-all used when nothing else fits cleanly. Its presence should prompt a question about why a more specific code was not applied.
453998All Other Miscellaneous Store RetailersAnother catch-all. Appears in front business structures because it is difficult to challenge without additional investigation.

Seeing any of these does not mean you decline the business. It means you apply enhanced due diligence — additional document requests, UBO verification, source of funds questions. The NAICS code is a trigger for a process, not a verdict.

What Happens When the Code Is Wrong

The practical consequence of a misclassification is not just a data quality issue. It is a compliance gap that sits undetected until something goes wrong downstream.

A Bureau of Labor Statistics research paper on NAICS accuracy noted that, based on reports from state agencies, respondents are often confused by NAICS descriptions — particularly the "Does Not Include" components — and report incorrectly as a result. Most NAICS misclassification is not deliberate. It is businesses genuinely not knowing which code applies, or choosing one that was accurate at registration and has since drifted as the business evolved.

The consequence for the institution onboarding them: a real estate holding company sits in the portfolio classified as a consulting firm. A high-volume cash operation is scored at standard risk. An entity with complex beneficial ownership clears onboarding because the structure that should have flagged it was never surfaced.

TrueBiz, a Y Combinator-backed KYB startup, described this as the core problem they built around: financial services companies onboarding business accounts often revert to manual review because existing data sources cannot provide a complete picture of the business. When that picture is incomplete, the compliance team either slows down to investigate or accepts the risk and moves on. Neither is a good outcome at scale.

What the End-to-End Flow Looks Like When It Works

A well-designed KYB onboarding process treats NAICS classification as one input in a broader risk signal, not a standalone checkbox:

  1. Business submits name and address at onboarding — website URL optional but increases classification accuracy
  2. KYB platform retrieves and validates company registration, EIN, beneficial ownership chain, and industry classification from multiple sources simultaneously
  3. Risk engine scores the result against a pre-built risk matrix — prohibited list, EDD triggers, standard review
  4. Mismatch detection runs — if the website describes services inconsistent with the registered classification, a flag surfaces before the application proceeds
  5. Compliance team sees a clear outcome — standard onboarding, or "EDD required: NAICS 551112, holding company structure, UBO trace initiated"
  6. Ongoing monitoring enrolls — if the business changes its primary activity later, the system detects the shift and re-scores automatically

Step 2 is where the lookup method determines everything that follows. A Census keyword search feeding a manual spreadsheet will not produce the inputs that steps 3 through 6 require. A multi-source automated platform that validates ownership chains, cross-references business registries across all 50 states, and surfaces classification signals in real time will.

This is what Signzy's KYB infrastructure is built around. Trusted by 500+ global companies, Signzy handles everything from US LLC formations to complex international corporate structures — surfacing the risk signals that manual processes and single-source databases routinely miss. If your onboarding process is still relying on what businesses self-reported to the government, it is worth seeing what a complete verification layer looks like in practice.

Book a demo with Signzy to see how it works in your onboarding flow.

Bottom Line

Looking up a NAICS code by company name sounds like a data task. It is a compliance decision in disguise.

The Census Bureau's tools were built for economic statistics, not business verification. Self-reported codes are frequently inaccurate, stale, or missing. For compliance teams onboarding businesses at any meaningful scale, the only method that produces a defensible classification is one that cross-references what a company says it does with what it actually does — pulling from registries, documents, web presence, and ownership data simultaneously.

For one-off lookups on public companies, SEC EDGAR and SAM.gov get you close. For self-classification, the NAICS drill-down is the right tool. But for any volume-driven onboarding workflow where a wrong code has real consequences downstream, you need a system that does not rely on what a business reported to the government years ago.

Spread the knowledge!

Found this useful ? Share what you learned!

XLinkedIn
Saurin Parikh

Saurin Parikh

Saurin is a Sales & Growth Leader at Signzy with deep expertise in digital onboarding, KYC/KYB, crypto compliance, and RegTech. With over a decade of professional experience across sales, strategy, and operations, he’s known for driving global expansions, building strategic partnerships, and leading cross-functional teams to scale secure, AI-powered fintech infrastructure.

Onboard User

Websites can't replace conversations. Let's talk?

We're just one call away, ready to answer all your queries and provide the perfect solution for your business needs.