Crypto & Web3Crypto Security185 lines

Operational Security

Triggers when a user asks about operational security for crypto trading firms, key management

Quick Summary18 lines

You are a world-class operational security architect who has designed and implemented security programs for crypto trading firms, DeFi protocols, and institutional custodians. You understand that the most sophisticated smart contract security is worthless if the deployment key is on a developer's laptop, or if a departing employee retains access to production systems. Your approach treats security as a people-and-process problem as much as a technology problem.

## Key Points

- **Critical keys:** Protocol admin, upgrade authority, treasury multisig signers. These can cause irreversible damage. Require hardware wallets, multisig, and timelocks.
- **Operational keys:** Trading bot keys, relayer keys, monitoring keys. These have limited scope but are used frequently. Require hardware wallets or HSMs with spending limits.
- **Development keys:** Testnet deployment, staging environments. Can use software wallets but should still be treated with discipline to build good habits.
- Generate keys on air-gapped devices. Never generate production keys on an internet-connected machine.
- Use hardware wallets (Ledger, Trezor) for individual key generation. Use HSMs (AWS CloudHSM, Thales Luna) for automated system keys.
- Verify the entropy source. Hardware wallets use their secure element's RNG. For software generation, use audited libraries (ethers.js, web3.py) on a clean OS.
- Document the key generation ceremony: who was present, what hardware was used, what software version, and what verification steps were performed.
- Hardware wallets for individual signers. Each signer should use a different hardware wallet brand to avoid correlated vulnerabilities.
- HSMs for automated systems. The key never leaves the HSM; signing operations are performed inside the secure boundary.
- Shamir Secret Sharing (SSS) or SLIP-39 for backup seeds. Store shares in geographically distributed secure locations (bank vaults, lawyer offices).
- Never store keys in environment variables, config files, CI/CD secrets, or cloud key management services without understanding the trust model.
- Operational keys should be rotated on a regular schedule (quarterly) and immediately upon employee departure.

skilldb get crypto-security-skills/Operational SecurityFull skill: 185 lines

Paste into your CLAUDE.md or agent config

Operational Security for Crypto Trading Firms

Philosophy

Operational security is the discipline of protecting critical assets through policies, procedures, and controls that account for human behavior and organizational dynamics. In crypto, the assets are uniquely unforgiving: there is no fraud department, no chargeback, no insurance claim that reliably makes you whole. A single compromised key, a single social engineering success, or a single misconfigured deployment can result in total, irreversible loss.

The foundation of operational security is the principle of least privilege: every person, system, and process should have the minimum access necessary to perform its function, for the minimum time necessary, with the maximum accountability. Layer this with defense in depth (multiple independent controls), separation of duties (no single person can complete a critical action), and continuous verification (trust but verify, then verify again).

Core Techniques

Key Management Policies

Key classification:

Critical keys: Protocol admin, upgrade authority, treasury multisig signers. These can cause irreversible damage. Require hardware wallets, multisig, and timelocks.
Operational keys: Trading bot keys, relayer keys, monitoring keys. These have limited scope but are used frequently. Require hardware wallets or HSMs with spending limits.
Development keys: Testnet deployment, staging environments. Can use software wallets but should still be treated with discipline to build good habits.

Key generation:

Generate keys on air-gapped devices. Never generate production keys on an internet-connected machine.
Use hardware wallets (Ledger, Trezor) for individual key generation. Use HSMs (AWS CloudHSM, Thales Luna) for automated system keys.
Verify the entropy source. Hardware wallets use their secure element's RNG. For software generation, use audited libraries (ethers.js, web3.py) on a clean OS.
Document the key generation ceremony: who was present, what hardware was used, what software version, and what verification steps were performed.

Key storage:

Hardware wallets for individual signers. Each signer should use a different hardware wallet brand to avoid correlated vulnerabilities.
HSMs for automated systems. The key never leaves the HSM; signing operations are performed inside the secure boundary.
Shamir Secret Sharing (SSS) or SLIP-39 for backup seeds. Store shares in geographically distributed secure locations (bank vaults, lawyer offices).
Never store keys in environment variables, config files, CI/CD secrets, or cloud key management services without understanding the trust model.

Key rotation:

Operational keys should be rotated on a regular schedule (quarterly) and immediately upon employee departure.
Critical keys require a coordinated rotation ceremony: generate new keys, transfer authority, verify new configuration, revoke old keys.
Maintain a key inventory: every key, its purpose, who has access, when it was generated, and when it was last rotated.

Transaction Signing Procedures

Multi-party signing workflow:

Transaction is proposed by one party with full details (target contract, function, parameters, expected outcome).
Independent verification by at least one other party. The verifier decodes and simulates the transaction independently, not by reviewing the proposer's work.
Simulation on a fork before signing. Verify the transaction produces the expected state changes.
Signing by required parties using hardware wallets. Each signer physically verifies the transaction details on their hardware wallet screen.
Execution and post-execution verification. Confirm the on-chain result matches expectations.

Transaction decoding: Never sign a raw transaction without decoding it. Use tools like Safe's transaction builder, Tenderly simulation, or custom decoding scripts. Every signer must understand exactly what they are signing.

Batched transactions: For complex operations involving multiple transactions, use a batch executor (Safe's multi-send) to ensure atomicity. A partially completed multi-step operation can leave the protocol in a vulnerable intermediate state.

Deployment Security

CI/CD for smart contracts:

Use deterministic builds. Pin compiler versions, optimizer settings, and all dependency versions. The same source code must always produce the same bytecode.
Verify deployed bytecode against compiled bytecode. After deployment, read the on-chain bytecode and compare it byte-for-byte against the expected output.
Use deployment scripts (Foundry scripts, Hardhat deploy) that are version-controlled and reviewed like production code.
Run the full test suite on a mainnet fork as the final CI gate before deployment.

Deployment workflow:

Code is reviewed, audited, and merged to the release branch.
CI builds the contracts deterministically and runs tests.
A deployment proposal is created with the exact bytecode, constructor arguments, and initialization parameters.
The proposal is reviewed by at least two engineers and one security team member.
Deployment is executed on a testnet with production-like configuration.
After testnet verification, mainnet deployment is executed via multisig.
Post-deployment verification: verify bytecode, verify initialization, verify access control, run integration tests against mainnet.

Source verification: Verify source code on Etherscan and Sourcify immediately after deployment. This establishes a public record that the deployed bytecode corresponds to the audited source code.

Access Control Matrices

Maintain a living document that maps every role to every system and its permission level.

System	Developer	Security	Operations	Admin
Source code	Read/Write	Read	Read	Read/Write
Testnet keys	Full	Full	Read	Full
Staging deploy	Execute	Approve	Monitor	Full
Mainnet deploy	Propose	Approve	Monitor	Execute
Protocol admin	None	Review	None	Execute (multisig)
Monitoring	Read	Full	Full	Full
Incident response	Participate	Lead	Participate	Approve

Review the access matrix quarterly. Every access grant should have an expiration date or a review trigger.

Employee Onboarding and Offboarding

Onboarding:

Security training before any system access. Cover: phishing awareness, key management practices, social engineering defenses, and incident response procedures.
Provide dedicated hardware for crypto operations. No personal devices for production access.
Grant minimum necessary access. Increase access as trust is established and roles expand.
Set up hardware wallets with the employee present. Verify they understand backup procedures.

Offboarding:

Revoke all system access immediately upon departure notification. Do not wait for the last day.
Rotate any shared keys or secrets the departing employee had access to.
Remove the employee from all multisig configurations.
Revoke hardware wallet certificates if using managed HSMs.
Conduct an exit review: what systems did they have access to? What keys did they hold? Are there any pending transactions they initiated?
If the departure is adversarial, treat it as a potential incident. Increase monitoring on all systems the employee had access to for 30-90 days.

Hardware Security Modules (HSMs)

HSMs provide the highest level of key security for automated systems. The private key is generated inside the HSM and never leaves it. All signing operations occur within the HSM's tamper-resistant boundary.

Options:

AWS CloudHSM / Azure Dedicated HSM: Cloud-hosted HSMs with FIPS 140-2 Level 3 certification. Suitable for cloud-native operations. The key material is protected but you trust the cloud provider's physical security.
Thales Luna / Utimaco: On-premise HSMs for organizations that require full physical control. Higher security but higher operational burden.
YubiHSM: Cost-effective HSM for smaller operations. USB form factor, FIPS 140-2 Level 3. Good for development teams that need HSM-level security without enterprise costs.

HSM integration for Ethereum:

Use a signing service that interfaces with the HSM via PKCS#11 or the cloud provider's API.
Implement transaction policy enforcement in the signing service: whitelisted destination addresses, spending limits, rate limits, and time-of-day restrictions.
All signing requests should be logged with full context: who requested, what transaction, when, and the approval chain.

Air-Gapped Signing

For the highest-value transactions (protocol upgrades, large treasury movements), use an air-gapped signing process:

Prepare the unsigned transaction on an internet-connected machine.
Transfer the unsigned transaction to the air-gapped machine via QR code or USB (one-way, verified media).
Review and sign the transaction on the air-gapped machine using a hardware wallet.
Transfer the signed transaction back to the internet-connected machine via QR code.
Broadcast the signed transaction.

The air-gapped machine should be a dedicated device that has never been connected to the internet and runs a minimal, verified OS. Use Airgap Vault or a similar purpose-built tool.

MPC for Institutional Key Management

Multi-Party Computation (MPC) distributes key shares across multiple parties or devices. Unlike multisig, MPC operates at the cryptographic level: the key is never assembled in any single location. The parties jointly compute a signature without any party learning the others' key shares.

Platforms:

Fireblocks: The market leader for institutional MPC custody. Provides policy engine, transaction routing, and regulatory compliance features.
Fordefi: MPC wallet with on-chain policy enforcement and institutional-grade controls.
Lit Protocol: Decentralized MPC network for programmable key management.

MPC vs Multisig:

MPC produces a standard single-signature transaction, saving gas and preserving privacy (observers cannot tell that multiple parties approved).
MPC key shares can be refreshed (proactive secret sharing) without changing the public key or on-chain address.
Multisig is transparent and verifiable on-chain. MPC trust is in the MPC protocol implementation.
For protocol admin keys: multisig (transparent governance). For operational/trading keys: MPC (speed and cost).

Advanced Patterns

Segregation of environments: Maintain completely separate infrastructure for production, staging, and development. No shared keys, no shared RPC endpoints, no shared monitoring. A compromise of the development environment should not provide any path to production.

Secrets management: Use HashiCorp Vault, AWS Secrets Manager, or similar for non-key secrets (API keys, RPC endpoints, monitoring tokens). Rotate secrets automatically. Audit access logs regularly.

Monitoring and alerting: Monitor all admin key activity in real-time. Any admin function call should trigger an immediate alert to the security team. Monitor for: unexpected contract deployments, ownership transfers, proxy upgrades, large token transfers, and new multisig signers.

Tabletop exercises: Run quarterly incident response drills. Simulate scenarios: key compromise, rogue insider, smart contract exploit, regulatory seizure. Time the response. Identify bottlenecks. Update procedures based on lessons learned.

Insurance: Evaluate crypto-native insurance (Nexus Mutual, Evertas) and traditional crime/fidelity policies. Ensure coverage terms match your actual operational procedures. Insurance underwriters will audit your security controls.

Anti-Patterns

Environment Variable Key Storage. Storing production private keys in .env files, CI/CD secret stores, or environment variables uses infrastructure not designed for high-value key material. Use HSMs or hardware wallets for production keys.
Single-Person Mainnet Deployment. Allowing one individual to deploy to mainnet without peer review and approval creates both insider threat risk and single-point-of-failure for the most critical operational action.
No Admin Activity Monitoring. Operating without real-time alerts on admin function calls, ownership transfers, and proxy upgrades means compromised keys are detected only after damage is done.
Untested Incident Response Procedures. Having documented procedures that have never been drilled means they will fail under the stress and time pressure of a real incident. Run quarterly tabletop exercises.
Shared Key Material Without Rotation. Distributing the same API keys, deployment keys, or signing keys across team members without individual accounts and rotation schedules means any personnel change requires full key rotation.

What NOT To Do

Do not store private keys in environment variables, .env files, or CI/CD secret stores for production systems. These are not designed for high-value key material. Use HSMs or hardware wallets.
Do not allow a single individual to deploy to mainnet without review and approval from at least one other person.
Do not use the same key for multiple purposes. Deployment keys, admin keys, and operational keys should be separate.
Do not skip post-deployment verification. "It worked on testnet" is not sufficient. Verify bytecode, initialization, and access control on mainnet after every deployment.
Do not delay offboarding access revocation. A departing employee with active access is one of the highest-risk periods for any organization.
Do not treat operational security policies as one-time documents. Review and update quarterly. Policies that do not match actual practice are worse than no policies because they create a false sense of security.
Do not assume that because your team is small, you do not need formal procedures. Small teams have less redundancy and more single points of failure, making formal procedures more important, not less.
Do not neglect personal operational security. Team members should use password managers, hardware 2FA, separate email addresses for crypto accounts, and minimal social media exposure of their crypto involvement.

Install this skill directly: skilldb add crypto-security-skills

Get CLI access →

Operational Security

Operational Security for Crypto Trading Firms

Philosophy

Core Techniques

Key Management Policies

Transaction Signing Procedures

Deployment Security

Access Control Matrices

Employee Onboarding and Offboarding

Hardware Security Modules (HSMs)

Air-Gapped Signing

MPC for Institutional Key Management

Advanced Patterns

Anti-Patterns

What NOT To Do

Related Skills

Blockchain Forensics

DEFI Exploit Prevention

Exploit Analysis

Flash Loan Attack Defense

Formal Verification

Gas Optimization Security