There is a pattern in microservices architecture: Command and Query Responsibility Segregation (CQRS). This pattern helps to design multi-purpose data lake.
Read moreTag: design
Book notes – Release It! Second Edition
You can buy this book by amazon.com.
Read moreMeta data service and schema registry in data lake
Maintaining data description is useful feature. There are some ideas, how to implement this.
Read moreORM (object-relational mapping) analog for data in data lake
We start saving data in HDFS using avro format. In previous post we have discussed about forward and backward compatibility of avro schemas. How to use this concept?
Read moreBook notes – Microservice patterns: with examples in java
Two infrastructure layers for distributed systems
It looks like, that separation between two infrastructure layers is increasing.
Read more
Why contracts are important in data intensive applications with microservices
Main purpose of using microservices architecture is to increase velocity of development and reduce system complexity.
Read more
Book notes – Kubernetes: Up and Running: Dive into the Future of Infrastructure
Hybrid cloud architecture for data lake applications
Big data technologies nowadays are very mature. Typically you use HDFS, or another distributed file systems, like S3, for storing data, Spark as a processor engine, and YARN as a resource manager. Next steps, wich you probably would like to achieve, are implement CI/CD (continuous integration and delivery) and move workload on demand in cloud.
Read more