big data, hadoop, kafka, hdfs, confluent, mongodb, hbase, cassandra, java, scala, python, aws, azure