Auto-classification scans your column names and metadata to identify potentially sensitive data like emails, phone numbers, and SSNs. This helps you track PII for compliance without manual tagging.Documentation Index
Fetch the complete documentation index at: https://docs.anomalyarmor.ai/llms.txt
Use this file to discover all available pages before exploring further.
How It Works
Auto-classification runs automatically during discovery:- Discovery scans your database metadata
- Pattern matching checks column names against known PII patterns
- Tags applied automatically to matching columns
- Results visible in the asset detail page
Auto-classification analyzes column names and types only, never actual data values. A column named
email gets tagged, but we never read the emails inside it.Detected Patterns
| Pattern | Example Column Names | Tag Applied |
|---|---|---|
email, user_email, email_address | pii:email | |
| Phone | phone, phone_number, mobile | pii:phone |
| SSN | ssn, social_security, tax_id | pii:ssn |
| Credit Card | cc_number, card_number, credit_card | pii:credit-card |
| IP Address | ip, ip_address, client_ip | pii:ip-address |
| Address | address, street, zip_code | pii:address |
| Name | first_name, last_name, full_name | pii:name |
Viewing Classified Data
To see which columns have been auto-classified:- Go to Assets in the sidebar
- Click on any asset to view details
- Look for Classification tags on columns
- Or filter: Assets → Filter → Classification → select a PII type
Enabling Auto-Classification
Auto-classification is enabled by default for all data sources. To verify or change:- Go to Settings → Data Classification
- Check that Auto-classification is toggled ON
- Select which patterns to detect (all enabled by default)
Accuracy and Limitations
Auto-classification uses pattern matching on column names, which means: It will catch:- Columns with obvious names (
email,phone_number,ssn) - Common naming conventions (
user_email,billing_address)
- Abbreviated names (
em,ph) - Non-English names
- Ambiguous names (
data,value,field1)
email_sent_countmight be tagged as email (it’s not PII)- Review classifications and remove incorrect tags manually
Customizing Detection
To adjust which patterns are detected:- Go to Settings → Data Classification
- Under Auto-classification Patterns, toggle patterns on/off
- Changes apply to the next discovery run
Manual Override
If auto-classification gets it wrong: To remove an incorrect tag:- Go to the asset detail page
- Find the column with the incorrect tag
- Click the tag → Remove
- Go to the asset detail page
- Click Add Tag on the column
- Select the classification that matches the data type (e.g.,
pii:emailfor email addresses,pii:phonefor phone numbers)
Verification
After discovery runs, verify auto-classification is working:- Go to Assets → Filter → Classification → Has any
- You should see assets with classified columns
- Click into an asset to see which columns were tagged
- Verify auto-classification is enabled in Settings
- Check that discovery has run recently
- Your column names may not match detection patterns
Common Questions
Does auto-classification read my actual data?
No. Auto-classification looks only at column names, types, and metadata. Column values are never read or transmitted. A column calledemail is tagged based on its name, not on inspecting what’s inside.
What PII types get detected out of the box?
Email, phone, SSN, credit card, IP address, physical address, and name. Detection runs automatically during discovery and tags matching columns with values likepii:email or pii:phone.
Why did a non-PII column like email_sent_count get tagged?
Pattern matching is name-based, so anything with email in the name matches. Open the column, click the tag, and remove it manually. Manual overrides persist across future discovery runs.
Will it miss PII in abbreviated or non-English column names?
Yes, it’s likely to miss short names likeem or ph, non-English names, and ambiguous ones like data or value. Add the appropriate pii:* tag manually from the asset detail page for those columns.
Is auto-classification on by default?
Yes, for all data sources. You can toggle individual patterns on or off in Settings → Data Classification. Changes take effect on the next discovery run.Next Steps
Custom Tags
Add your own classification tags
Data Handling
How we protect your data
