Key Steps of Data Ingestion
data ingestion involves the following key functions:
- Data collection: Before performing any analysis, you have to first collect the required data. This is the stage at which you analyse the sources from where the data has to be imported based on your requirement.
- Data validation: Once you prioritise the resources and collect the data, you have to validate it to filter out unwanted data.
- Data routing: Next, you have to route the validated data to its particular destination, such as HBase, HDFS, Hive or some other system, where it will be further analysed.
Only after the completion of these steps, can you say that data ingestion is complete. After these steps, you can use your BI (business intelligence) and BA (business analytics) tools to derive insights or simply use other data processing tools to get the desired output.