A practice for Apache families (Hadoop / HDFS / Yarn / Spark / Kafka / Zookeeper/Flink)
Recently I have been frequently using the data processing frameworks from Apache (Hadoop/HDFS/Yarn/Spark/Kafka/ZooKeeper). And I designed a comprehensive project that uses all of them in one project and the purpose is to analyze the crime distribution in LA. I would like to have a brief summary here and move on to other techs. I think the most fundamental and earliest framework is Hadoop. Hadoop... 1935
The steps you take don't need to be big; they just need to take you in the right direction
My Github name has been changed to `Shaofanl`! Goodbye `piscesdream` and fight on!
Ready to build something cool at Google!!!
Feel free to discuss anything related to the posts. If you successfully implement some of my ideas, please notify me. It is not obligated, but it can help me further extending those ideas.