ai_data_monitoring_reporing
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ai_data_monitoring_reporing [2025/05/25 16:46] – [Purpose] eagleeyenebula | ai_data_monitoring_reporing [2025/05/25 16:50] (current) – [1. Monitoring Data Quality] eagleeyenebula | ||
|---|---|---|---|
| Line 45: | Line 45: | ||
| ===== Key Features ===== | ===== Key Features ===== | ||
| + | |||
| The **DataMonitoringReporting** module includes the following core features: | The **DataMonitoringReporting** module includes the following core features: | ||
| - | * **Data Monitoring Tools:** | + | * **Data Monitoring Tools: |
| 1. Detect missing values and calculate dataset coverage (% completeness). | 1. Detect missing values and calculate dataset coverage (% completeness). | ||
| - | * **Flexible Report Generation: | + | * **Flexible Report Generation: |
| 2. Automated string-based summary reports for processed datasets or workflows. | 2. Automated string-based summary reports for processed datasets or workflows. | ||
| - | * **Detailed Logging:** | + | * **Detailed Logging: |
| 3. Logs all actions, including data quality checks and report generation results, for thorough traceability. | 3. Logs all actions, including data quality checks and report generation results, for thorough traceability. | ||
| - | * **Integration-Ready: | + | * **Integration-Ready: |
| 4. Easily integrates into existing pipelines as a monitoring or reporting component. | 4. Easily integrates into existing pipelines as a monitoring or reporting component. | ||
| - | * **Customizable Reporting Templates: | + | * **Customizable Reporting Templates: |
| 5. Can be extended to generate reports in various formats like JSON, HTML, or Markdown. | 5. Can be extended to generate reports in various formats like JSON, HTML, or Markdown. | ||
| Line 68: | Line 69: | ||
| The **DataMonitoringReporting** class provides two core methods: | The **DataMonitoringReporting** class provides two core methods: | ||
| - | 1. **monitor_data_quality(data): | + | * **monitor_data_quality(data): |
| - | + | | |
| - | This monitors | + | |
| - | + | ||
| - | 2. **generate_report(data): | + | |
| - | This generates | + | * **generate_report(data): |
| + | Generates | ||
| - | The workflow is as follows: | + | **The workflow is as follows:** |
| - | * Pass data into | + | |
| - | * **monitor_data_quality** | + | |
| - | To receive a structured dictionary containing monitored results (e.g., missing value count, coverage percentage). Use **generate_report** to create a human-readable string report based on the findings or processed data. | + | * Pass data into **monitor_data_quality** to receive a structured dictionary containing monitored results |
| + | * Use **generate_report** to create a human-readable string report based on the findings or processed data. | ||
| ==== 1. Monitoring Data Quality ==== | ==== 1. Monitoring Data Quality ==== | ||
| Line 86: | Line 84: | ||
| * **Missing Data:** Identifies **None** or **NaN** values in the dataset. | * **Missing Data:** Identifies **None** or **NaN** values in the dataset. | ||
| * **Total Data Points:** Counts the overall size of the dataset. | * **Total Data Points:** Counts the overall size of the dataset. | ||
| - | * **Coverage Percentage: | + | * **Coverage Percentage: |
| The output is a dictionary summarizing quality statistics: | The output is a dictionary summarizing quality statistics: | ||
ai_data_monitoring_reporing.1748191606.txt.gz · Last modified: 2025/05/25 16:46 by eagleeyenebula
