| Both sides previous revision Previous revision | |
| en:praktyka [2026/01/23 20:05] – przemyslaw_baczek | en:praktyka [2026/01/23 20:07] (current) – przemyslaw_baczek |
|---|
| **Table 1.** Overview of the most popular KMS systems for Big Data and their features. | **Table 1.** Overview of the most popular KMS systems for Big Data and their features. |
| ^ KMS ^ Type ^ Features ^ | ^ KMS ^ Type ^ Features ^ |
| |Hadoop with ecosystem (HDFS, Hive, Spark)|Open Source|<WRAP justify>It stores and processes vast amounts of data, enabling analysis and knowledge extraction from logs, IoT data, and social media.</WRAP>| | |Hadoop with ecosystem (HDFS, Hive, Spark)|Open Source|<WRAP justify>Stores and processes vast amounts of data, enabling analysis and knowledge extraction from logs, IoT data, and social media.</WRAP>| |
| |Apache Solr|Open Source|<WRAP justify>It indexes vast collections of data and documents, supports NLP, classification, and recommendations. It creates something like Google within the organization. It is a tool for content searching, not for management.</WRAP>| | |Apache Solr|Open Source|<WRAP justify>Indexes vast collections of data and documents, supports NLP, classification, and recommendations. Also creates something like Google within the organization. It's a tool for content searching, not for management.</WRAP>| |
| |Apache Atlas|Open Source|<WRAP justify>Data management within the organization, audits, documenting data flows, and data cataloging. It enables integration with Hadoop (storage), Hive (searching), Kafka (streaming), and Spark (processing). It is a tool for data knowledge.</WRAP>| | |Apache Atlas|Open Source|<WRAP justify>Data management within the organization, audits, documenting data flows, and data cataloging. It enables integration with Hadoop (storage), Hive (searching), Kafka (streaming), and Spark (processing). It's a tool for data knowledge.</WRAP>| |
| |Elastic Stack|Open Source|<WRAP justify> A highly advanced knowledge search engine, text analysis, logs, documents, and knowledge dashboards. It enables fast searching and visualization of knowledge.</WRAP>| | |Elastic Stack|Open Source|<WRAP justify> A highly advanced knowledge search engine, text analysis, logs, documents, and knowledge dashboards. Enables fast searching and visualization of knowledge.</WRAP>| |
| |Databricks|Commercial|<WRAP justify> Data warehouse, supports ML, NLP, and predictive analysis. Enables generating knowledge from data.</WRAP>| | |Databricks|Commercial|<WRAP justify> Data warehouse, supports ML, NLP, and predictive analysis. Enables generating knowledge from data.</WRAP>| |
| |Microsoft Fabric/Azure Synapse|Commercial, Cloud|<WRAP justify>It integrates data, analytics, ML, and reporting. It allows building knowledge models and automatically generates knowledge from operational data.</WRAP>| | |Microsoft Fabric/Azure Synapse|Commercial, Cloud|<WRAP justify>Integrates data, analytics, ML, and reporting. It allows building knowledge models and automatically generates knowledge from operational data.</WRAP>| |
| |Google BigQuery|Commercial|<WRAP justify> Very fast searching across petabytes of data. Characterized by extremely rapid knowledge generation from Big Data.</WRAP>| | |Google BigQuery|Commercial|<WRAP justify> Very fast searching across petabytes of data. Characterized by extremely rapid knowledge generation from Big Data.</WRAP>| |
| |IBM Watson Knowledge Catalog|Commercial|<WRAP justify> It catalogs knowledge and data, automatically tags and classifies them. It is integrated with AI and ML. </WRAP>| | |IBM Watson Knowledge Catalog|Commercial|<WRAP justify> Catalogs knowledge and data, automatically tags and classifies them. It's integrated with AI and ML. </WRAP>| |
| |
| |