Vector Search | InterSystems Developer Community

Article

Renato Banzai · Jul 17, 2020 3m read

Using Machine Learning to Organize the Community - 2

This is the second post of a series explaining how to create an end-to-end Machine Learning system.

Exploring Data

The InterSystems IRIS already has what we need to explore the data: an SQL Engine! For people who used to explore data in
csv or text files this could help to accelerate this step. Basically we explore all the data to understand the intersection
(joins) which should help to create a dataset prepared to be used by a machine learning algorithm.

#IntegratedML #Machine Learning (ML) #Python #Unstructured Data #Vector Search #InterSystems IRIS

Open Exchange app

1 0

1 361

Article

Niyaz Khafizov · Oct 8, 2018 16m read

Record linkage using InterSystems IRIS, Apache Zeppelin, and Apache Spark

Hi all. We are going to find duplicates in a dataset using Apache Spark Machine Learning algorithms.

Note: I have done the following on Ubuntu 18.04, Python 3.6.5, Zeppelin 0.8.0, Spark 2.1.1

Introduction

In previous articles we have done the following:

#Artificial Intelligence (AI) #Analytics #Beginner #Machine Learning (ML) #Python #Vector Search #InterSystems IRIS

0 0

1 770

Article

Niyaz Khafizov · Jul 19, 2018 4m read

K-Means clustering of the IRIS Dataset

Hi all. Today we are going to use k-means algorithm on the Iris Dataset.

Note: I have done the following on Ubuntu 18.04, Apache Zeppelin 0.8.0, python 3.6.5.

#Artificial Intelligence (AI) #API #Beginner #Machine Learning (ML) #Python #Vector Search #InterSystems IRIS

6 0

3 9.8K

Article

David E Nelson · Mar 9, 2017 9m read

Machine Learning with Spark and Caché

Apache Spark has rapidly become one of the most exciting technologies for big data analytics and machine learning. Spark is a general data processing engine created for use in clustered computing environments. Its heart is the Resilient Distributed Dataset (RDD) which represents a distributed, fault tolerant, collection of data that can be operated on in parallel across the nodes of a cluster. Spark is implemented using a combination of Java and Scala and so comes as a library that can run on any JVM.

#Artificial Intelligence (AI) #Analytics #Big Data #JDBC #Machine Learning (ML) #Python #Vector Search #Caché

11 5

1 2.8K

Question

Benjamin Eriksson · Mar 14, 2016

[Research] iKnow and algorithms.

Hello!

My group and I are currently doing a research project on natural language processing and iKnow plays a big role in this project. I am aware that the algorithms iKnow use aren't public, and I respect that.

My question is, are there any public documents/research that explains, at least part of, the algorthims iKnow uses and the motivations for using them?

#Analytics #Vector Search #InterSystems Natural Language Processing (NLP, iKnow)

1 2

0 464