This is a group for anyone interested data engineering, data driven development, data processing, applied data science and building data driven applications.
A lot of good open source tools and techniques are continually be developed and improved. It is important to keep on learning about these tools.
We started this group to share knowledge and experience using these tools & frameworks. Our experience lies in building big data pipelines with Apache Spark in Scala and building applications connecting to these API's. We are not bound to specific tools, but our focus will be on tools like Spark, Hadoop, Airflow, Elasticsearch, Solr, Kafka, Flink, Storm, Cassandra, MLlib, Neo4j, etc, etc.