- clarify how data flows between components
- identify system entities and their interactions
- this is where you will get information to make informed
decisions on data management: storage, transportation,
encryption, partitioning
-Twitter example:
- User: userId, Name, Email, LastLogin, etc
- tweet: tweetID, content, numLikes, timestamp, etc
- userFollow: userID1, userID2, etc
- relational or non-relational database?
- what kind of block storage?
- where do you store which type of data?