News

Spark SQL, part of Apache Spark, is used for structured data processing by running SQL queries on Spark data. Srini Penchikala discusses Spark SQL module & how it simplifies data analytics using SQL.
Apache Flink has contained SQL functionality since Flink version 1.1, which introduced a SQL API based on Apache Calcite and a table API, too. While the combined SQL and Table API today provides ...
Fast, flexible, and developer-friendly, Apache Spark is the leading platform for large-scale SQL, batch processing, stream processing, and machine learning.
Apache Spark has released version 1.3 of their project. The main improvements are the addition of the DataFrames API, better maturity of the Spark SQL, as well as a number of new methods added to ...
Apache Software Foundation extends reach of Drill tool for launching SQL queries to include wider range of platforms.
Databricks, the primary commercial steward behind the popular open source Apache Spark project, published a new report indicating the technology is still red-hot, driven by more use of SQL, streaming ...
Apache Phoenix is a relatively new open source Java project that provides a JDBC driver and SQL access to Hadoop’s NoSQL database: HBase. It was created as an internal project at Salesforce ...
Apache's Hive Stinger Initiative is a three-phase project targeted at improving the SQL interface with Hadoop. Learn more now.
The Apache Drill framework aims to provide just such a SQL engine. Drill can operate across multiple distributed data stores such as HDFS or Amazon S3, relational databases that support JDBC or ODBC, ...
Apache Kafka is a key component in data pipeline architectures when it comes to ingesting data. Confluent, the commercial entity behind Kafka, wants to leverage this position to become a platform ...