Amazon Macie - Overview.
Scope:
- Intro,
- Core Capabilities,
- Management and Automation,
- Pricing and Availability,
- Architecture Flow,
- Architecture Flow (Data Sources),
- Architecture Flow (Discovery & Classification Engine),
- Architecture Flow (Findings & Evaluation),
- Architecture Flow (Integrations & Automation),
- Architecture Flow (Data Security Lifecycle),
- Architecture Flow (Security & Compliance),
- Visual Architecture Layout.
- Amazon Macie is a fully managed data security and privacy service from Amazon Web Services (AWS).
- Amazon Macie uses machine learning (ML) and pattern matching to:
- Discover,
- Classify,
- Protect sensitive data in Amazon S3.
- Sensitive Data Discovery: Automatically identifies sensitive information such as:
- Personally identifiable information (PII),
- Financial records (e.g., credit card numbers),
- Intellectual property using machine learning.
- Automated S3 Inventory: Continually evaluates twtech S3 bucket inventory to:
- Monitor security
- Access controls,
- Identifying risks like unencrypted
- Publicly accessible buckets.
- Classification & Risk Visibility: Assigns business value to data items and provides an interactive data map to visualize where sensitive data resides across twtech AWS environment.
- Custom Identifiers: Allows users to define custom data types using regular expressions that detect patterns specific to their business, such as internal employee ID formats.
- Integration: Findings are sent to Amazon EventBridge and can be published to AWS Security Hub to trigger automated remediation workflows.
- Discovery Jobs: twtech-admin (Users) can run:
- One-time,
- Daily,
- Weekly,
- Monthly discovery jobs that scan all or a subset of objects in the S3 buckets.
- Multi-Account Support: Supports centralized monitoring across multiple AWS accounts through integration with AWS Organizations.
- Allow Lists: Enables users to specify text or patterns like sample test data, that Macie would ignore during scans process to reduce false positives.
- According to AWS documentation , Macie pricing is based on three main dimensions:
- S3 Bucket Assessment: Charged per bucket per month (e.g., $0.10) to monitor encryption and public status after a 30-day free trial.
- Automated Discovery: Charged based on the number of objects evaluated for sampling (e.g., $0.01 per 100,000 objects).
- Sensitive Data Discovery: Charged per GB of data actually scanned, with volume discounts starting after the first 50 TB (e.g., $1.00 per GB for the first 50 TB, then dropping to $0.50 per GB).
- Free Trial: New users can benefit from a 30-day free trial for automated sensitive data discovery and bucket evaluation.
The concept of
- Amazon Macie is a fully managed data security and privacy service.
- Amazon Macie uses machine learning (ML) and pattern matching to discover and protect sensitive data in Amazon S3.
- Amazon Macie helps twtech to:
- Identify personally identifiable information (PII),
- Financial data,
- Credentials,
- Custom-sensitive data types, for automated
compliance and security workflows.
Architecture Flow
1. Architecture Flow (Data Sources)
- Macie primarily operates on Amazon S3 buckets, continuously or
on-demand analyzing:
- S3 objects and metadata
- Bucket-level
configurations (e.g.,
public access, encryption)
- Object contents (text, structured/unstructured data)
Additional input signals include:
- AWS
Organizations (for multi-account management)
- AWS Config and CloudTrail (context
for resource inventory and activity)
2. Architecture Flow (Discovery & Classification Engine)
- At the heart of Macie is the Discovery Engine that uses:
- Machine Learning Models to
classify sensitive data (e.g.,
names, addresses, credit card numbers)
- Pattern Matching via predefined and custom data identifiers
- Sampling and content analysis for large datasets
Steps:
- Inventory Discovery –
Scans S3 buckets to map data assets.
- Classification Jobs – Evaluates object contents to detect sensitive data.
- ML-based Categorization – Labels files with detected data types (PII, credentials, etc.).
- Risk Scoring – Generates severity based on exposure (public, shared, unencrypted).
3. Architecture Flow (Findings & Evaluation)
- Results are generated as Macie Findings, categorized into:
- Policy Findings –
Misconfigurations (e.g., publicly
accessible S3 buckets).
- Sensitive Data Findings – Actual detection of sensitive data (e.g., PII in objects).
Each finding includes:
- Bucket/object
details
- Detected data types and count
- Sensitivity score
- Exposure level (public/private)
4. Architecture Flow (Integrations & Automation)
- Macie finds integrate natively with AWS services for remediation and alerting:
|
Integration |
Purpose |
|
Amazon EventBridge |
Triggers automated workflows (e.g., Lambda remediation, SNS alerts). |
|
AWS Security Hub |
Centralized view of findings with
other AWS security tools. |
|
AWS Organizations |
Centralized Macie management
across accounts. |
|
AWS CloudWatch |
Monitors classification job
metrics and performance. |
|
AWS Lambda / Step
Functions |
Custom remediation workflows (e.g., encrypt or quarantine sensitive
data). |
5. Architecture Flow (Data Security Lifecycle)
- Data flow in Macie aligns with a continuous data protection cycle:
- Discover – Identify where data resides in S3.
- Classify – Detect and label sensitive data.
- Evaluate – Assess configuration risks and data exposure.
- Act – Trigger alerts and automate remediation.
- Monitor – Continuously evaluate new or modified objects.
6. Architecture Flow (Security & Compliance)
- Macie supports multiple compliance frameworks:
- GDPR,
HIPAA, PCI-DSS, ISO 27001
- Encryption via AWS KMS
- Access control via IAM policies
Visual Architecture
Flow (Layout)
S3 Data Sources → Discovery & Classification Engine → Findings → Integrations
No comments:
Post a Comment