Skip to main content
  1. Data Science, AI & Machine Learning/

Big Data

·153 words·1 min· loading · ·
ML Courses Machine Learning ML Courses

Big Data

Big Data Analytics
#

Big Data Systems
#

  1. What is Big Data
  2. Data Warehouse, Data Lakes
  3. Hadoop – Components
  4. Storage – HDFS, Hbase
  5. Resource Manager (MapReduce, YARN)
  6. Types of data formats (JSON, ORC, Parquet, AVRO)
  7. Scripting  (Hive, Pig)
  8. Stream Processing
  9. Massive Parallel Processing (Spark, Imapala, Mahout)
  10. RDDs in Spark
  11. Data Migration (Scoop/ Flume)
  12. Schedular (Oozie)
  13. Resource Negotiator (Zookeeper)
  14. RDBMS Database
  15. Columnar Database
  16. Multimodel Database
  17. NoSQL (HBase, Cassandra, MongoDB, DynamoDB)
  18. RDBMS (MySQL, PostgreSQL)
  19. CosmoDB
  20. In memory database (Redis)
  21. Spark SQL
  22. Case Study

Stream Processing & Analytics
#

  1. Real Time Streaming Architecture
  2. Service Configuration and Coordination
  3. Data Flow Management, Storing and Processing Streaming Data
  4. Visualization Techniques for Real Time Streaming Data
  5. Aggregation (Timed Counting, Multi Resolution Time Series Aggregation)
  6. Statistical Approximation
  7. Approximating with sketches

PySpark
#

  1. Overview & Installation.
  2. RDD
  3. Dataframe.
  4. Architecture.
  5. MLLib
  6. NLP
  7. Linear regression
  8. Logistic regression
  9. Decision tree
  10. Naive Bayes
  11. XGBoost
  12. Timeseries
  13. Spark Job automation with Scheduler
  14. NYC Parking Case Study: Apache Spark

Related

AI Agent Building
·801 words·4 mins· loading
ML Courses AI Agents Large Language Models LangChain Production AI Agentic AI
AI Agent Building Course # Everyone can say “we should build AI agents.” Very few teams can define …
Technology Board Advisory
·746 words·4 mins· loading
Board Advisory Technology Governance Independent Director
Technology Board Advisory Service # Boards do not need another slide on “what is generative AI.” …
AI for Prospective Email Writing
·491 words·3 mins· loading
ML Courses TensorFlow Lite Android Development
AI for Prospective Email Writing # Course Objective # Equip participants with the skills to draft …
GenAI for Cybersecurity
·483 words·3 mins· loading
ML Courses TensorFlow Lite Android Development
Generative AI for Cybersecurity # Course Overview: This 3–4 day hands-on workshop introduces how …
Train Tensorflow Lite Models for Android
·850 words·4 mins· loading
ML Courses TensorFlow Lite Android Development
Developing Solutions with Agentic AI # Course Outline # Module 1: Introduction to Agentic AI # 1.1 …