(1).png)
IRIS is an excellent option for machine learning projects with massive data operation scenarios, because the following reasons:
1. Supports the use of shards to scale the data repository, just like MongoDB.
2. Supports the creation of analytical cubes, this in association with sharding allows to have volume with performance.
3. Supports collecting data on a scheduled basis or in real time with a variety of data adapter options.
4. It allows to automate the entire deduplication process using logic in Python or ObjectScript.
5.



389

