Course Outline
Introduction
- Apache Arrow vs Parquet
Installing and Configuring Apache Arrow
Overview of Apache Arrow Features and Architecture
Exploring Data with Pandas and Apache Arrow
Exploring Data with Spark and Apache Arrow
Exploring Data with R and Apache Arrow
Exploring Data with MapD and Apache Arrow
Other Data Analysis Integrations
- PySpark, Parquet files on S3, and Oracle tables and Elasticsearch indices
Troubleshooting
Summary and Conclusion
Requirements
- A basic undersanding of SQL
- Familiarity with Python or R
- Some familiarity with Apache Spark
Testimonials (5)
All the topics which he covered including examples. And also explained how they are helpful in our daily job.
madduri madduri - Boskalis Singapore Pte Ltd
Course - QGIS for Geographic Information System
I liked Pablo's style, the fact that he covered a lot of subjects from report design , customization with html to implementing simple ML algortithms. Good balance theoretical information / exercices. Pablo really covered all topics i was interested in and gave comprehensive answers to my questions.
Cristian Tudose - SC Automobile Dacia SA
Course - Advanced Data Analysis with TIBCO Spotfire
Actual application of spotfire and all basic functions.
Michael Capili - STMicroelectronics, Inc.
Course - Introduction to Spotfire
The thing I liked the most about the training was the organization and the location
Hamid Tuama - Ability with Innovation General Contracting (DMCC Branch)
Course - ArcGIS for Spatial Analysis
I genuinely enjoyed the lots of labs and practices.