1. Home
  2. Disclaimer
  3. Required Tools
  4. Setup Workspace
  5. 1. Big Data Overview
    1. 1.1. Introduction
    2. 1.2. Job Opportunities
    3. 1.3. What is Data?
    4. 1.4. How does it help?
    5. 1.5. Types of Data
    6. 1.6. The Big V's
      1. 1.6.1. Variety
      2. 1.6.2. Volume
      3. 1.6.3. Velocity
      4. 1.6.4. Veracity
      5. 1.6.5. Other V's
    7. 1.7. Trending Technologies
    8. 1.8. Big Data Concerns
    9. 1.9. Big Data Challenges
    10. 1.10. Data Integration
    11. 1.11. Scaling
    12. 1.12. CAP Theorem
    13. 1.13. PACELC Theorem
    14. 1.14. Optimistic Concurrency
    15. 1.15. Eventual Consistency
    16. 1.16. Concurrent vs Parallel
    17. 1.17. GPL
    18. 1.18. DSL
    19. 1.19. Big Data Tools
    20. 1.20. NO Sql Databases
    21. 1.21. Learning Big Data means?
  6. 2. Developer Tools
    1. 2.1. Introduction
    2. 2.2. UV
    3. 2.3. Other Python Tools
  7. 3. Data Format
    1. 3.1. Introduction
    2. 3.2. CSV-TSV
    3. 3.3. JSON
    4. 3.4. Parquet
    5. 3.5. Arrow
    6. 3.6. Avro
    7. 3.7. YAML
    8. 3.8. Duck DB
  8. 4. Protocol
    1. 4.1. Introduction
    2. 4.2. HTTP
    3. 4.3. Monolithic Architecture
    4. 4.4. Statefulness
    5. 4.5. Microservices
    6. 4.6. Statelessness
    7. 4.7. Idempotency
    8. 4.8. REST API
    9. 4.9. API Performance
    10. 4.10. API in Big Data world
  9. 5. Advanced Python
    1. 5.1. Data Frames
    2. 5.2. Decorator
    3. 5.3. Unit Testing
    4. 5.4. Error Handling
    5. 5.5. Logging
  10. 6. Containers
    1. 6.1. CPU Architecture Fundamentals
    2. 6.2. Introduction
    3. 6.3. VMs or Containers
    4. 6.4. What Container does
    5. 6.5. Docker
    6. 6.6. Docker Examples
  11. 7. CICD
    1. 7.1. Introduction
    2. 7.2. CICD Tools
    3. 7.3. CI Yaml
    4. 7.4. CD Yaml
  12. 8. Data Engineering
    1. 8.1. Introduction
    2. 8.2. Batch vs Streaming
    3. 8.3. Kafka
      1. 8.3.1. Kafka use cases
      2. 8.3.2. Kafka Software
      3. 8.3.3. Python Scripts
      4. 8.3.4. Different types of streaming
    4. 8.4. Quality & Governance
    5. 8.5. Medallion Architecture
    6. 8.6. Data Engineering Model
    7. 8.7. Data Mesh
  13. 9. Cloud Computing
    1. 9.1. Introduction
    2. 9.2. Types of Cloud Services
    3. 9.3. Challenges of Cloud Computing
    4. 9.4. High Availability
    5. 9.5. Azure Cloud
      1. 9.5.1. Services
      2. 9.5.2. Storages
      3. 9.5.3. Demo
    6. 9.6. Terraform
  14. 10. CLI Tools
    1. 10.1. Introduction
    2. 10.2. Linux Commands 01
    3. 10.3. Linux Commands 02
    4. 10.4. AWK
    5. 10.5. CSV SQL
    6. 10.6. JQ
    7. 10.7. YQ
  15. 11. Miscellaneous
    1. 11.1. Additional Reading
    2. 11.2. Good Reads
    3. 11.3. Roadmap Data Engineer
    4. 11.4. Notebooks vs IDE
  16. Tags

Big Data Tools & Techniques

[Avg. reading time: 0 minutes]

Continuous Integration Continuous DeploymentVer 6.0.18