Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata”...

33
Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy

Transcript of Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata”...

Page 1: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Computing languages by

(bounded) local sets

Dora GiammarresiUniversità di Roma “Tor Vergata”

Italy

Page 2: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Summary of the talk

Put in a unique framework some know (disjoint) results and get a:

Characterization of Chomsky’s hierarchy by

local sets + alphabetic projections

1.

Insert new families into Chomsky’s hierarchy by introducing new types of local sets:

Bounded-Grids Context-Sensitive languages

2.

Page 3: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Definition: String language L is local if all substrings of length 2 are in a finite set Θ. (L=L(Θ) )

Local (string) languages…

w with border

# #0 001 1

string w over Γ= {0, 1}

w= 0 001 1

Θ =finite set of strings of length 2

over Γ #…allowed substrings

0 0 0 0

0

1 1

1# #

Page 4: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

… to define Regular languages

Theorem : L is regular local set of strings S such that L=π(S).

, Γ alphabets π: Γ alphabetic

projection

Local set

+

projection

= Finite automaton

Proof:

local alphabet Γ = edges of automaton

set Θ = pairs of consecutive edges

π gives labels of the edges

Page 5: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Local Picture Languages [GR’94]

picture p over Γ= {0, 1}

Definition: Picture language L is local if all subpictures of size 2x2 are in a finite set Θ . (L=L(Θ) )

p=

p with border# # # # # # ##### # # #

###

# # #

0 0 00 0 0 0

0 0

11

1

1 1 1

Θ =finite set of 2x2 pictures

over Γ #…allowed subpictures

00

000 0

00# #

#0# #

11 1 1

11

0 0 00 0 0 0

0 0

11

1

1 1 1

Page 6: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

…to define Context-sensitive languages

Theorem [F69 – LS97]: L is context-sensitive local set of pictures S such that L=π(fr(S)).

Proof: () given an accepting run of an LBA, take all instantaneous configurations and write them in order, one above the others. This gives a local picture.

() given a local set of 2x2 squares, define corresponding context-sensitive grammar rules such that derivations correspond to the local pictures.

p=

fr(p) = frontier of p

# # # # # # ##### # # #

###

# # #0

00

00

1 011

122

222

Page 7: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Local sets of binary trees…

tree t with border

Definition: Tree set L is local if all 3-vertices sub-trees belong to a finite set Θ . (L=L(Θ) )

Θ =finite set of 3-vertices trees

over Γ #… allowed sub-trees

1 0

1

0 0

0

1 0

0

1 0

0

# #

1

1

0

# #

1

# #

1

# #

0

0

# #

1

# #

1

$

Γ= {0, 1, $}

Page 8: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

…to define Context-free languages

Theorem [MW67]: L is context-free local set of binary trees S such that L=π(fr(S)).

Proof: Notice that a derivation tree of a context free grammar in Chomsky's Normal Form is actually a local binary tree (possibly after some minor modifications).

fr(t) = frontier of t

$

t=

# # # # # # # # # # # # # #

Page 9: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Look at them all together…

● Local sets of binary trees elementary binary tree: x y

z

● Local sets of lines elementary line:

x y x y

● Local sets of grids elementary grid:

x yz t

x y

z t

Page 10: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

look at them all together…

● Sets of line graphs Γ= {0,1}

● Sets of binary trees

● Sets of grid graphs

frontier = all non-border vertices

frontier = vertices adjacent

to the leaves

frontier =

lowest

non-border row

# # # # # # # #

## # # # #

## # # # #

0 0 0 0

#

#

#

0 0 0 0

0 0 0 0

#

#

#

0 ## 0 0 0

Page 11: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Chomsky’s hierarchy by local sets

Proposition: Let L be a (string) language. Then:

3. L is context-sensitive

L is projection of the frontier of a local set of grids;

2. L is context-free

L is projection of the frontier of a local set of binary trees;

1. L is regular

L is projection of the frontier of a local set of lines;

Page 12: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Local sets as computations…

#

# # # # # # # # ## ## #####

##

##

######### # # # # # #

# # # # # # # # # # # # # #

# #

Regular

Context-free

Context-sensitiveLocal machines!

Page 13: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Remark on local computations

Size of local graph is measure of TIME of computation

NOT measure of SPACE!

No need to keep the all graph space:

• Lines left-to-right # #

• Trees leaves-to-root

$

# # # # # # # # # # # # # #• Grids bottom-to-top

# # # # # # # # ######

##

#

##

##

######### # # # # # #

Page 14: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

New families into Chomsky’s hierarchy?

Define a new type of local sets ….

….. and get a new family of string languages!

# ## # # # # # # # ######

##

#

##

##

######### # # # # # #

$

# # # # # # # # # # # # # #

Regular

Context-free

Context-sensitive

Page 15: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

What are the differences?

# ## # # # # # # # ######

##

#

##

##

######### # # # # # #

$

# # # # # # # # # # # # # #

Regular

Context-free

Context-sensitive

degree ≤ 2 ≤ 3 ≤ 4

≤ 4n O(n) not bounded by nsize n+2 O(1)BIG GAP!

Page 16: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Bounded Grids Local Sets Computations

Definition: grids with 2-sides frontiers

# # # # # # # ######

###### # ####

0 00 01 1

0

11

1 000

00

00

1

1111

1 1

0 00 01 1

0

11

1 1 0

length of w

n

Exact size depends on the position

where we turn the string!

size of local grid for w,

≤ (n+2)2

2

Page 17: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Bounded Grids Local Sets

Need to exploit geometric local properties of patterns defined in the pictures….

No more space for istantaneous configurations of

a run of a LBA automaton!

Use local picture languages theory

techniques [GR’94]

Page 18: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

An example

L= anbn | n≥0 }

0 0 0 0 00 0 0 0

0 0 00 0

0

11 11 1 11 1 1 1

4 4 4 4 4# # # # #

# # # # #

########

########

π: Γ → projection

4 → a

0 → b

= {a,b}

Γ= {0,1,4}

Page 19: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Another example (use same idea!)

L= anbncn | n≥0 }

0 0 0 0 00 0 0 0

0 0 00 0

0

11 11 1 11 1 1 1

2 2 2 2 22 2 2 2

2 2 22 2

2

33 333

33

33 3

4 4 4 4 4 5 5 5 5 5 π: Γ → projection

4 → a

5 → b

2 → c

= {a,b,c}

Γ= {0,1,4,2,3,5}

Page 20: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Another example

L= wwR | w*} palindromes

a0a0

a0

a0

a0

a0

a0

a0

a0

a0

a0

a0

a0

a1 a1 a1a1 a1 a1 a1

a1a1a1a1a1a1

a1

a0b0b0

b0

b0b0

b0

b0

b1 b1 b1 b1 b1

b1 b1 π: Γ → projection

a0, a1 → a

b0, b1 → b

= {a,b}

Γ= {a0, b0, a1, b1}

Page 21: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Bounded Grids Computations

Bounded-grids context sensitive

(Bgrid-CS).

Theorem:

If L is a projection of the 2-sides frontier of a local picture language, then L is context-sensitive.

Page 22: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Proof:

Idea: define a LBA that “behaves” as a local machine:

all the writing operations effectively build the picture!

• non deterministically rewrite w=x1x2....xn , π(xi)=ai

• put w as frontier of a picture (non deterministically choose i

and put xi in the BR-corner of a picture)

• check that all bottom and right border subpictures are in Θ

Let w=a1a2....an

# #0

# # # # # #######

# 0 0 01 1111

1110

00

• finish to build the picture by elements of Θ

Page 23: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

More examples

L= anb2nc(n+3) | n≥0 }

L= an | n≥0 }; L= a2 | n≥0 }

L= ap | p prime}; L= af | f not prime}

L= ww | w* }

L= w|w| | |w|>1}

2 n

Page 24: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Closure properties

Theorem: Bgrid-CS languages are closed under concatenation and Kleene's star.

Theorem: Bgrid-CS languages are closed under Boolean union and intersection.

Page 25: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Open problem

● Bgrid-CS languages = CS languages?

Remark 2: [ Gladkij]

there are CS-languages with no linear bounded derivations

Remark 1: [ R.Book71]

In 1971 R. Book defined infinite hierarchy of subfamilies of context-sensitive languages corresponding to different time bounding functions leaving open question whether hierarchy collaps.

Page 26: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Open problem

…recall that CS languages are closed under complement.

● Are Bgrid-CS languages closed under complement?

What about deterministic versions?

Page 27: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Deterministic Local B-grid (machines)

Open question: are deterministic B-grid CS languages equivalent to non-deterministic ones?

x1,x2,x3 Γ {#} there is at most one

Definition:

Set of 2x2 grids Θ is deterministic when,

yx1 x2

x3 Θ

Page 28: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Remark

● Bgrid-CS languages = CS languages

● Bgrid-CS languages are “deterministic”

DSPACE(n)=NSPACE(n)

Page 29: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Open problem (the last one!)

…turning back to characterization for the

Chomsky's hierarchy …….

● Can we define a “local set” to characterize recursive languages?

Page 30: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

The end

Page 31: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Proof (by example) 1

1

0

0

p q

p0p0

p0 p1

p1

p1

q1

q0

q0

q0 q0

q1

= {0, 1}

p0q1

p1q1

Γ = {p0, p1, q0, q1 }

# p0

# p1

#

#q1

p0

p

pq

qπ: p0 , q0

p1 , q10

1

Page 32: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Look at them more generally…

Γ alphabet #1,…, #k Γ border symbol

• embedded labeled graph over Γ {#1,…, #k }

• frontier = (labels of) a path of non-border vertices

• border vertices = vertices carryng #1,…, #k

(string over Γ)• elementary graph = “small” graph shape

• local set of graphs = (labels of) a path of non-border vertices

Page 33: Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.

Local sets

Definition:

A set of graphs S over Γ {#} is local if there exists a finite set of elementary graphs Θ over Γ {#} such that, for all s S, every subgraph of elementary size belongs to Θ. We write S= L (Θ).

Given a typology of graphs,

fix “shape” of elementary graph