Girish Mahajan (Editor)

Information algebra

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

The term "information algebra" refers to mathematical techniques of information processing. Classical information theory goes back to Claude Shannon. It is a theory of information transmission, looking at communication and storage. However, it has not been considered so far that information comes from different sources and that it is therefore usually combined. It has furthermore been neglected in classical information theory that one wants to extract those parts out of a piece of information that are relevant to specific questions.

Contents

A mathematical phrasing of these operations leads to an algebra of information, describing basic modes of information processing. Such an algebra involves several formalisms of computer science, which seem to be different on the surface: relational databases, multiple systems of formal logic or numerical problems of linear algebra. It allows the development of generic procedures of information processing and thus a unification of basic methods of computer science, in particular of distributed information processing.

Information relates to precise questions, comes from different sources, must be aggregated, and can be focused on questions of interest. Starting from these considerations, information algebras (Kohlas 2003) are two-sorted algebras ( Φ , D ) , where Φ is a semigroup, representing combination or aggregation of information, D is a lattice of domains (related to questions) whose partial order reflects the granularity of the domain or the question, and a mixed operation representing focusing or extraction of information.

Information and its operations

More precisely, in the two-sorted algebra ( Φ , D ) , the following operations are defined

Additionally, in D the usual lattice operations (meet and join) are defined.

Axioms and definition

The axioms of the two-sorted algebra ( Φ , D ) , in addition to the axioms of the lattice D :

A two-sorted algebra ( Φ , D ) satisfying these axioms is called an Information Algebra.

Order of information

A partial order of information can be introduced by defining ϕ ψ if ϕ ψ = ψ . This means that ϕ is less informative than ψ if it adds no new information to ψ . The semigroup Φ is a semilattice relative to this order, i.e. ϕ ψ = ϕ ψ . Relative to any domain (question) x D a partial order can be introduced by defining ϕ x ψ if ϕ x ψ x . It represents the order of information content of ϕ and ψ relative to the domain (question) x .

Labeled information algebra

The pairs ( ϕ , x )   , where ϕ Φ and x D such that ϕ x = ϕ form a labeled Information Algebra. More precisely, in the two-sorted algebra ( Φ , D )   , the following operations are defined

Models of information algebras

Here follows an incomplete list of instances of information algebras:

  • Relational algebra: The reduct of a relational algebra with natural join as combination and the usual projection is a labeled information algebra, see Example.
  • Constraint systems: Constraints form an information algebra (Jaffar & Maher 1994).
  • Semiring valued algebras: C-Semirings induce information algebras (Bistarelli & Montanari Rossi1997);(Bistarelli et al. Schiex);(Kohlas & Wilson 2006).
  • Logic: Many logic systems induce information algebras (Wilson & Mengin 1999). Reducts of cylindric algebras (Henkin, Monk & Tarski 1971) or polyadic algebras are information algebras related to predicate logic (Halmos 2000).
  • Module algebras: (Bergstra, Heering & Klint 1990);(de Lavalette 1992).
  • Linear systems: Systems of linear equations or linear inequalities induce information algebras (Kohlas 2003).
  • Worked-out example: relational algebra

    Let A be a set of symbols, called attributes (or column names). For each α A let U α be a non-empty set, the set of all possible values of the attribute α . For example, if A = { name , age , income } , then U name could be the set of strings, whereas U age and U income are both the set of non-negative integers.

    Let x A . An x -tuple is a function f so that dom ( f ) = x and f ( α ) U α for each α x The set of all x -tuples is denoted by E x . For an x -tuple f and a subset y x the restriction f [ y ] is defined to be the y -tuple g so that g ( α ) = f ( α ) for all α y .

    A relation R over x is a set of x -tuples, i.e. a subset of E x . The set of attributes x is called the domain of R and denoted by d ( R ) . For y d ( R ) the projection of R onto y is defined as follows:

    π y ( R ) := { f [ y ] f R } .

    The join of a relation R over x and a relation S over y is defined as follows:

    R S := { f f ( x y ) -tuple , f [ x ] R , f [ y ] S } .

    As an example, let R and S be the following relations:

    R = name age A 34 B 47 S = name income A 20'000 B 32'000

    Then the join of R and S is:

    R S = name age income A 34 20'000 B 47 32'000

    A relational database with natural join as combination and the usual projection π is an information algebra. The operations are well defined since

  • d ( R S ) = d ( R ) d ( S )
  • If x d ( R ) , then d ( π x ( R ) ) = x .
  • It is easy to see that relational databases satisfy the axioms of a labeled information algebra:

    semigroup 
    ( R 1 R 2 ) R 3 = R 1 ( R 2 R 3 ) and R S = S R
    transitivity 
    If x y d ( R ) , then π x ( π y ( R ) ) = π x ( R ) .
    combination 
    If d ( R ) = x and d ( S ) = y , then π x ( R S ) = R π x y ( S ) .
    idempotency 
    If x d ( R ) , then R π x ( R ) = R .
    support 
    If x = d ( R ) , then π x ( R ) = R .

    Connections

    Valuation algebras 
    Dropping the idempotency axiom leads to valuation algebras. These axioms have been introduced by (Shenoy & Shafer 1990) to generalize local computation schemes (Lauritzen & Spiegelhalter 1988) from Bayesian networks to more general formalisms, including belief function, possibility potentials, etc. (Kohlas & Shenoy 2000). For a book-length exposition on the topic see Pouly & Kohlas (2011).
    Domains and information systems
    Compact Information Algebras (Kohlas 2003) are related to Scott domains and Scott information systems (Scott 1970);(Scott 1982);(Larsen & Winskel 1984).
    Uncertain information 
    Random variables with values in information algebras represent probabilistic argumentation systems (Haenni, Kohlas & Lehmann 2000).
    Semantic information 
    Information algebras introduce semantics by relating information to questions through focusing and combination (Groenendijk & Stokhof 1984);(Floridi 2004).
    Information flow 
    Information algebras are related to information flow, in particular classifications (Barwise & Seligman 1997).
    Tree decomposition 
    ...
    Semigroup theory 
    ...

    Historical Roots

    The axioms for information algebras are derived from the axiom system proposed in (Shenoy and Shafer, 1990), see also (Shafer, 1991).

    References

    Information algebra Wikipedia


    Similar Topics