{{theTime}}

Search This Blog

Total Pageviews

What is Apache Spark?

- Fast, Scalable distributed processing engine
- Provides high level API for in memory processing and significant performance improvements over Hadoop MapReduce
- Spark SQL integrates with structued or tabular data
- Steams for processing streaming data in real time
- MLlib machine learning and GraphX for processing graphs
- It's written in Scala (JVM language)
- It supports Java, Python and Ruby

No comments:

gradlew : The term 'gradlew' is not recognized as the name of a cmdlet, function, script file , or operable program

The command gradlew was not found, but does exist in the current location.  Windows PowerShell does not load commands from the current locat...