| S.No |
Topic |
Sub-Topics |
| 1 | Apache Iceberg | Data Lake vs Data Warehouse, Table Format Concept, Iceberg Architecture Overview, Iceberg vs Hive Tables |
| 2 | Iceberg Ecosystem | Iceberg with Apache Spark, Apache Flink, Trino/Presto, Hive, Databricks |
| 4 | Iceberg Table Concepts | Table Metadata, Data Files, Manifest Files, Manifest Lists, Snapshot Concept |
| 5 | Catalogs in Iceberg | Hadoop Catalog, Hive Metastore Catalog, REST Catalog, Glue Catalog, Catalog Configuration |
| 6 | Creating Iceberg Tables | SQL Table Creation, Spark API Table Creation, Table Properties, Table Location, Schema Definition |
| 7 | Schema Evolution | Add Columns, Rename Columns, Delete Columns, Change Column Types, Backward Compatibility |
| 8 | Partitioning | Hidden Partitioning, Partition Transforms, Partition Spec Evolution, Partition Pruning, Best Practices |
| 9 | Data Operations | Insert Data, Overwrite Data, Append Mode, Batch Writes, Streaming Writes |
| 10 | Querying Iceberg Tables | SQL Queries, Spark SQL Queries, Filtering Data, Aggregation Queries, Query Optimization |
| 11 | Time Travel | Snapshot IDs, Querying Past Data, Snapshot Retention, Use Cases, SQL Examples |
| 12 | Snapshots | Snapshot Creation, Snapshot Metadata, Snapshot Expiration, Snapshot Rollback, Snapshot History |
| 13 | Data Versioning | Versioned Tables, Commit Operations, Metadata Tracking, Branching Concepts, Tagging Data Versions |
| 14 | ACID Transactions | Atomic Operations, Consistency Guarantees, Isolation Levels, Concurrent Writes, Failure Recovery |
| 15 | File Formats | Parquet Support, ORC Support, Avro Support, Compression Options, File Size Optimization |
| 16 | Manifest Files | Manifest Structure, Metadata Tracking, File Listings, Performance Benefits, Manifest Merging |
| 17 | Metadata Management | Metadata Files, Metadata Evolution, Metadata Size Optimization, Metadata Cleanup, Metadata Storage |
| 18 | Data Compaction | Small File Problem, Compaction Strategies, Rewrite Data Files, Optimize Table, Scheduling Compaction |
| 19 | Delete Operations | Equality Deletes, Position Deletes, Delete Files, Row-Level Deletes, Delete Performance |
| 20 | Update Operations | Row Updates, Update Strategies, Merge Statements, Update Performance, Handling Conflicts |
| 21 | Merge Operations | MERGE INTO Syntax, Upserts, CDC Processing, Incremental Updates, Merge Optimization |
| 22 | Streaming Integration | Spark Structured Streaming, Flink Streaming, Kafka Integration, Streaming Writes, Streaming Reads |
| 23 | Performance Optimization | Partition Design, File Size Tuning, Metadata Optimization, Query Planning, Caching Strategies |
| 24 | Security | Access Control, Authentication, Authorization, Data Encryption, Governance Policies |
| 25 | Monitoring | Query Monitoring, Table Metrics, Logging, Alerts Setup, Performance Tracking |
| 26 | Iceberg Maintenance | Expire Snapshots, Remove Orphan Files, Rewrite Manifests, Table Optimization, Maintenance Scheduling |
| 27 | Integration with Data Lakes | AWS S3 Storage, Azure Data Lake Storage, Google Cloud Storage, HDFS Storage, Hybrid Storage |
| 28 | Comparison | Iceberg vs Delta Lake, Iceberg vs Apache Hudi, Performance Comparison, Feature Comparison, Use Case Differences |
| 29 | Production Best Practices | Data Layout Design, Partition Strategies, Governance, Performance Monitoring, Disaster Recovery |
| 30 | Real World Use Cases | Data Lakehouse Architecture, Incremental Data Pipelines, CDC Data Processing, Data Warehousing, Machine Learning Data Pipelines |