Symbolic data analysis and the sodas software

While in data mining and classical statistics the data to be analyzed usually presents one single value for each variable, that is no longer the case when the entities under analysis are not single elements, but groups gathered on the basis of some given criteria. We define symbolic data analysis sda as the extension of standard data analysis to symbolic data tables as input in order to find symbolic objects as. Standard statistical methods do not have the power or flexibility to make sense of very large datasets, and symbolic data analysis te. This is shown by several applications in official statistics.

An introduction to symbolic data analysis and the sodas software edwin diday university paris 9 dauphine ceremade. Pdf an introduction to symbolic data analysis and its. Sodas software based on symbolic data analysis was extensively described. Knowledge discovery from symbolic data and the sodas software. Finally, we introduce the software prototype, developed by 17 teams from nine countries involved in the sodas eurostat project. Raymond bisdorff crpgl, luxembourg the development of the sodas software based on symbolic data analysis was extensively described in the previous chapters of this book. Symbolic data analysis and the sodas software is primarily aimed at practitioners of symbolic data analysis, such as statisticians and economists, within both the public and private sectors. Symbolic linear regression methodology filipe afonso, lynne billard, edwin diday and mehdi limam. Symbolic object and symbolic description, complex mathematical entity, qualitative original variables, symbolic multivalued categorical variables, breakdown or drilldown process, symbolic descriptions generalization, geographical statistical units, categorical single and multivalued variables, socioeconomic data. The symbolic data analysis theory is now enhanced by a new software tool called sodas which results from the effort of 17 european teams sponsored by eurostat.

Symbolic data analysis sda is an extension of standard data analysis where symbolic data tables are used as input and symbolic objects are made output as a result. The sodas 2 software is the result of the european project asso analysis system of symbolic official data 20012004. Symbolic data analysis and the sodas software wiley online books. Provides a supplementary website featuring links to download the sodas software developed exclusively for symbolic data analysis, data sets, and further material. It was accompanied by a series of benchmark activities involving some official statistical institutes throughout europe. Pdf an introduction to symbolic data analysis and the sodas. Symbolic data analysis and the sodas software wiley online. Standard statistical methods do not have the power or flexibility to make sense of very large datasets, and symbolic data analysis techniques have been developed in order to. Diday university of paris ix dauphine and inria aim from hudge data in an economic wayextract new knowledgesummarize concatenatesolve confidentialityexplain correlation how. There is also much of interest to postgraduate students and researchers within web mining, text mining, and bioengineering. Symbolic data analysis and the sodas software ebook, 2008. Based on these four spaces, new problems appear such as the quality, robustness and reliability of the approximation of a concept by a symbolic object, the symbolic description of a class, the consensus between symbolic descriptions etc in this paper we give an overview on recent development in sda.

Contents contributors ix foreword xiii preface xv asso partners xvii introduction 1 1 the state of the art in symbolic data analysis. The electronic journal of symbolic data analysis vol. Jan 01, 2003 read an introduction to symbolic data analysis and the sodas software, intelligent data analysis on deepdyve, the largest online rental service for scholarly research with thousands of academic publications available at your fingertips. Sodas symbolic official data analysis system is a modular software in which. An introduction to symbolic data analysis and the sodas. New advances in symbolic data analysis and spatial. Symbolic data analysis and the sodas software core. Moreover, they can be used to define queries of a relational data base and propagate concepts between data bases.

Finally, we introduce the software prototype, developed by 17 teams from nine countries involved in the sodas eurostat. The author gives a short introduction to the sodas software. Sodas symbolic official data analysis system is a modular software in which each statistical method symbolic objects data base, distance matrix for symbolic objects, divisible classification of symbolic data, symbolic kernel discriminant analysis, symbolic description of groups, factorial discriminant analysis, principal component. Pdf the data descriptions of the units are called symbolic when they are more complex than standard ones due to the fact that they contain. An introduction to symbolic data analysis and the sodas software, journal journal of. The data descriptions of the units are called symbolic when they are more complex than standard ones due to the fact that they contain internal. They model concepts and constitute an explanatory output for data analysis. Symbolic data arise from many sources, for instance when summarizing huge relational data bases by their underlying concepts. Symbolic data analysis and the sodas software in official. Features a supporting website hosting the software, and user manual.

Mar 21, 2008 symbolic data analysis and the sodas software is primarily aimed at practitioners of symbolic data analysis, such as statisticians and economists, within both the public and private sectors. Symbolic data analysis by lynne billard overdrive rakuten. The data units are called symbolic since they are more complex than standard ones, as they not only contain values or categories, but also include internal variation and structure. Introduces the sodas software, which is complementary to existing data analysis software e. We define symbolic data analysis sda as the extension of standard data analysis to such tables.

An introduction to symbolic data analysis and the sodas software article pdf available in intelligent data analysis 76. Symbolic data analysis sda is a new statistical approach invented by edwin diday. Pdf an introduction to symbolic data analysis and the. An introduction to symbolic data analysis and the sodas software. Symbolic data analysis workshop 2018 estgipvc portugal. Symbolic data analysis the two levels of statistical units. Induces, exports, and compares knowledge from one database to another. Symbolic data analysis is a relatively new field that provides a range of methods for analyzing complex datasets. Symbolic data analysis and the sodas software wiley.

Sas, spss, spad that are unable to work on symbolic data. Primarily aimed at statisticians and data analysts, symbolic data analysis is also ideal for scientists working on problems involving large volumes of data from a range of disciplines. Symbolic data analysis and the sodas software guide books. Features exercises at the end of each chapter, enabling the reader to develop their understanding of the theory.

Spatial classification symbolic data analysis software. The general aim of sodas can be stated in the following way. An introduction to symbolic data analysis and its application to the sodas project. It supports the analysis of multidimensional complex data numerical and non numerical coming from databases mainly in statistical offices and administration using symbolic data analysis. Symbolic data analysis and the sodas software mathematical.

We briefly describe some sda tools and methods and, in particular, we describe some dissimilarity methods for symbolic objects which are central to the majority of symbolic data analysis methods. Standard statistical methods do not have the power or flexibility to make sense of very large datasets, and symbolic data analysis techniques have been developed in order to extract knowledge from such data. Moreover they can be used in order to define queries of a relational data base and propagate concepts between data bases. We define symbolic data analysis sda as the extension of standard data analysis to symbolic data tables as input in order to find symbolic objects as output. Exploratory methods for extracting statistical information from complex data. An introduction to symbolic data analysis and the sodas software 2003. Multilayer perceptrons and symbolic data fabrice rossi and brieuc conanguez. Symbolic data analysis sda provides a framework for the representation and analysis of data that comprehends inherent variability. By working on higher level units called concepts necessary described by more complex data extending data mining to. Symbolic data analysis and the sodas software by edwin diday. We define symbolic data analysis sda as the extension of standard data analysis to symbolic data tables.

1511 1089 1446 229 418 522 625 73 1278 994 78 1548 94 133 9 1166 1009 825 1466 1549 1411 530 1481 1107 595 991 1301 955 1140 135 756 733 268 700 19 165 520 944 133 1391 1279 1317 1370 1074 403 318