What kinds of things are revealed?
- Lock contention
- Inefficient algorithms like zlib inflate
- MSRs / slow CPU features
- Serialization through a CPU
- Copy to/from user
- Page faults
- Excessive mmap/munmap (from memory allocations)
- Opportunities to short-cut through the stack