https://docs.aws.amazon.com/wellarchitected/latest/framework/wellarchitected-framework.pdf#welcome
Pillars
O - S - R - P -C - S
Operational Excellence -
- ability to support
- gain insights
- Protect Data
- Systems
Reliability
- Perform its indented function when it's expected to
- ability to use computing resources efficiently
- Lowest price point
- improve sustainability impacts by reducing energy consumption
- increasing efficiency by provisioned resources
Terms :
component
code/configuration and resources - deliver against requirement
Workload
set of component that deliver business value
Architecture
how component work together in a workload
Milestone
key changes in architecture as it evolves thru the product lifecycle
Technology portfolio
collection of workload to operate business
On Architecture : central team - not recommended by AWS
- Technical Architect (infrastructure),
- Solutions Architect (software),
- Data Architect,
- Networking Architect,
- Security Architect
Instead :
AWS Distribute the capabilities rather than having central team
- Have practices (ways of doing things, process, standard) - put experts
- implement mechanisms that carry out automated checks to ensure standards are being met
“Good intentions never work, you need good mechanisms to make anything happen” — Jeff Bezos.
Leadership principles : https://www.amazon.jobs/en/principles?ref=wellarchitected-wp
Principles
- Stop guessing your capacity needs
- Test systems at production scale:
- Automate to make architectural experimentation easier
- Allow for evolutionary architectures
- Drive architectures using data
- Improve through game days:
Operational Excellence
- Perform operations as code
- implement your operations procedures as code and automate their execution by triggering them in response to events
- Make frequent, small, reversible change
- Organization • Prepare • Operate • Evolve
Security
- Implement a strong identity foundation
- Enable traceability
- Apply security at all layers
- Automate security best practices:
- Protect data in transit and at rest
- Keep people away from data
- Prepare for security events
- Security
- Identity and Access Management
- Detection
- Infrastructure Protection
- Data Protection
- Incident Response
Reliability
- Automatically recover from failure:
- Test recovery procedures
- Scale horizontally to increase aggregate workload availability
- Replace one large resource with multiple small resources to reduce the impact of a single failure
- Stop guessing capacity
- Manage change in automation
Performance Efficiency
- Compute - application design, usage patterns, and configuration settings
- Storage - (block, file, or object), patterns of access (random or sequential), required throughput, frequency of access (online, offline, archival), frequency of update (WORM, dynamic)
- Database - availability, consistency, partition tolerance, latency, durability, scalability, and query capability
- Network -Latency, throughput requirements, jitter, and bandwidth
Cost Optimization
- Practice Cloud Financial Management
- Expenditure and usage awareness
- Cost-effective resources
- Manage demand and supply resources
- Optimize over time
Sustainability
- Region selection
- User behavior patterns
- Software and architecture patterns
- Data patterns
- Hardware patterns
- Development and deployment process