1. Intro
1.3. Predictive Modeling Functionalities
2 min
anatella provides advanced text mining capabilities using the anatella text mining operators, you can automatically correct spelling mistakes (in your “address” fields, for example…) translate text from one language to another do “fuzzy matching” for example, to join 2 tables (based on multilingual sound encoding) classify texts (in combination with tim) these operators apply the classical “bag of word” technique to produce, starting from raw, unstructured text data, many new columns and new variables directly exploitable inside tim or stardust, for predictive analytics you can easily enrich your datasets with unstructured data to obtain the highest predictive modeling accuracy the anatella predictive text mining functionalities are unique graph mining or social network analysis (sna) this set of operators is mainly useful for telecommunication companies and banks, to create churn predictive models, cross selling predictive models, up selling predictive models, to estimate the share of wallet, etc the objective of these operators is to extract out of the “phone communication network” valuable social metrics typically, the “phone communication network” is defined in this way each individual is a node an “arc” of the network between the two individuals a and b represents the relation “a called b” the social metrics that could be extracted from the network are the best connected individual, the individual who plays the most important role in any group, the groups of friends, the proximity to a churner, the number of churners in the “neighborhood” of an individual those metrics can improve the accuracy of your predictive models operational research (or) optimization toolbox in particular, anatella integrates an efficient multi threaded lp/ip solver that allows you to solve large scale optimization problem the lp solver handles millions of constraints and several thousands of variables an efficient solver for the gap (general assignment problem) typical gap problems includes which product do i have to offer to which customer when i have the following business constraints the stock of each product is limited, each selected customer receives a folder with n offers (no less and no more than n offers), the margin on each product is different, the gap solver included inside anatella handles campaigns with several millions of customers and thousands of products the anatella optimization plugin is typically used for operation research (or), sales & profit optimization, stock optimization, etc modeling factory for large scale predictive datamining projects some of our customers need to re build from scratch several thousands of predictive models every day or week this can easy be accomplished using timi (as the analytical engine) and using anatella to supervise & manage, in a 100% automated way, the whole procedure the multithreading capabilities of anatella allow you to exploit all the cpu’s in your server to deliver a very high computing power
