Regular Languages

(1)

Properties of Regular Languages

(2)

Standard Representations of Regular Languages

A language L over an alphabet S is regular

iff

• It is accepted by FA (DFA, NFA, or NFA-^e).

• It is described by a regular expression.

• It is generated by regular grammar (Left-linear or Right-linear).

(3)

Standard Representations of Regular Languages

Regular Languages

Regular

Expressions

Regular Grammars (Left/Right Linear) FA

(DFA/NFA/NFA- e)

(4)

When we say: We are given a Regular Language

We mean:

L

Language is in a standard Representation (FA/RE/RG)

L

Standard Representations of Regular Languages

(5)

Closure Properties

• Recall a closure property is a statement that a certain operation on languages, when applied to languages in a class, produces a result that is also in that class.

• For regular languages, we can use any of its representations to prove a closure property.

Given two regular languages L

₁

and L

₂

, is their union is also regular ? If it is true for all regular

languages, then family of regular

languages is closed under union.

(6)

L 1 L ₂

Are regular Languages For regular languages and

we will prove that:

Properties of RLs

(7)

We say: The family of Regular languages are

closed under

(8)

L 1 Regular language

( ) ^M ₁ ^L ₁

L = M 1

Single final state

NFA M 2

L 2

Single final state

( ) ^M ₂ ^L ₂

L =

Regular language

NFA

(9)

Example

a

b

M 1

{ } ^ba

L ₂ = ^b ^a

M 2

L

₁

={a

ⁿ

b :n≥0}

(10)

Union

NFA for

M 1

M 2

!

_"

∪ !

_$

l l l

l

(11)

Example

a

b

b a

l l l

l

}

1 { a b L = ⁿ

}

2 { ba L =

} {

}

2 {

1 L a b ba

L È = ⁿ È

NFA for

(12)

Concatenation

NFA for L 1 L 2

M 1 M ₂

l l

(13)

Example

NFA for

a

b _b a

}

1 { a b L = ⁿ

}

2 { ba L =

} {

} }{

2 {

1 L a b ba a bba

L = ⁿ = ⁿ

l l

³ 0

n

(14)

Star Operation

NFA for L ₁ *

M 1

l

1 * Î L

l

l l

(15)

Example

The string in L

₁

* is w NFA for

* } {

1 * a b L = ⁿ

a

b

}

1 { a b L = ⁿ l

l

1 2 1

L w

w w

i

k

Î

= !

(16)

Reversal

L ₁ R

M 1

NFA for 1 ¢ M

1. Reverse all transitions

2. Mark initial state as final state and vice versa

L 1

(17)

Example

}

1 { a b L = ⁿ

a

b

M 1

}

1 R { ba n

L =

a

b

1 ¢

M

(18)

Complement

1. Take the DFA that accepts L 1

M 1

L 1 ¢

M 1

L 1

2. Make final states non-final,

and vice-versa

(19)

Example

}

1 { a b L = ⁿ

a

b

M 1

b a,

} {

* } ,

1 { a b a b

L = - ⁿ a

b

1 ¢ M

b a,

b

a,

(20)

Intersection

DeMorgan’s Law: L 1 Ç L 2 = L 1 È L 2

2 1

, L

L regular

2 1

, L

L regular

2

1

L

L È regular

2

1

L

L È regular

2

1

L

L Ç regular

(21)

Example

}

1 { a b L = ⁿ

} ,

2 { ab ba L =

regular regular

}

2 {

1 L ab

L Ç =

regular

(22)

Example

a _b

b a

b a,

b

} a,

2 {

1 L ab

L Ç =

(23)

Intersection (Product)

L₁ = L(M₁), M₁ = (Q, ^S, ^d1, q0, F₁), ^{Q = {}q0, q1…... q_m,^} L₂ = L(M₂), M₂ = (P, ^S, ^d₂, p₀, F₂), Q = {p₀, p₁…... p_n,} M₁ and M₂ are DFAs.

Define new automation, M’ = (Q’, ^S, ^d’, (q₀, p₀), F’) Q’ = QXP = {(qi, p_j): qi Q, p_j P}

Define transition function ^d^{’ s.t.}

d’( (q_i, p_j), a ) = (q_k, p_l) If ^d₁(q_i, a ) = q_k

AND d₂(p_j, a ^{) =}p_l

Define transition for all states.

The initial state is (q₀, p₀),

F’ is the set of all states s.t. (q_i, p_j) s.t.

qi, F₁, p_j F₂

(24)

Example-1

(25)

Example-2

(26)

Find product

(27)

DIFFERENCE

L₁ = L(M₁), M₁ = (Q, ^S, ^d₁, q₀, F₁), Q = {q₀, q₁…... q_m,} L₂ = L(M₂), M₂ = (P, ^S, ^d₂, p₀, F₂), Q = {p₀, p₁…... p_n,} M₁ and M₂ are DFAs

!

_"

− !

_$

= !

_"

∩ !

_$

2 1

, L

L regular

regular regular

!

_"

, !

_$

!

_"

∩ !

_$

(28)

DIFFERENCE

If L₁ and L₂ are regular languages, then So is L₁ - L₂ L₁ - L₂consists of strings in L₁ but not L₂i.e.

!

_"

− !

_$

= !

_"

∩ !

_$

Let M₁ and M₂ be DFA’s whose languages are L₁ and L₂, respectively.

Construct C, the product automaton of M₁ and M₂. Make the final states of C be the pairs where M₁-state is final but M₂-state is not.

(29)

Example

(30)

Exercise

L

₁

= L ( (a+b)a*) L

₂

= L( baa*)

For given languages L

₁

and L

₂

, construct the FAs.

• Union

• Star closure

• Concatenation

• Complement

• Reversal

• Intersection

• Difference

(31)

Exercise

L₁ = L ( (a+b)a*) L₂ = L( baa*)

!

_"

⊝ !

_$

= {': ' ∈ !

_"

*+ ' ∈

!

_$

,-. ' /0 1. /1 ,.ℎ !

_"

314 !

_$

} 1*+ (!

_"

, !

_$

) = {': ' ∉ !

_"

314 ' ∉ !

_$

}

:*+ (!

_"

, !

_$

) = {' ∶ ' ∈ !

_"

*+ ' ∈ !

_$

}

! = {-<: - ∈ !, < ∈ !

⁼

}

(32)

Exercise

L₁ = {w ε {0, 1 }* : w consists of 0’s in multiple of 3}

L₂ = {w ε {0, 1 }* : w consists of odd number of 0’s}

!

_"

⊝ !

_$

= {': ' ∈ !

_"

*+ ' ∈

!

_$

,-. ' /0 1. /1 ,.ℎ !

_"

314 !

_$

} 1*+ (!

_"

, !

_$

) = {': ' ∉ !

_"

314 ' ∉ !

_$

}

:*+ (!

_"

, !

_$

) = {' ∶ ' ∈ !

_"

*+ ' ∈ !

_$

}

! = {-<: - ∈ !, < ∈ !

⁼

}

(33)

Exercise

L

₁

= {w ε {0, 1 }* : w consists of 0’s in multiple of 3}

L

₂

= {w ε {0, 1 }* : w consists of odd number of 0’s}

For given languages L

₁

and L

₂

, find FA for

• Union

• Star closure

• Concatenation

• Complement

• Reversal

• Intersection

• Difference

(34)

Reversal of Regular Expression

Basis: If r is a primitive RE (∅, λ or a ε ∑ ), then r^R= r.

Induction: If r₁ and r₂ are two REs (r₁ + r₂)^R = (r₁)^R + (r₂)^R

(r₁r₂)^R = (r₂)^R.(r₁)^R (r₁*)^R = ((r₁)^R)*

(35)

Reversal of Regular Expression: Example

Let r₁ = 01* , r₂ =10*.

(r₁ + r₂)^R = (01* + 10*)^R = (01*)^R+ (10*)^R

= (1*)^R0^R + (0*)^R1^R

= (1^R)*0 + (0^R)*1

= 1*0+ 0*1

(36)

Homomorphism

A homomorphism on an alphabet is a function that gives a string for each symbol in that alphabet.

h : ∑ Γ*

Where ∑ and Γ are alphabets.

Rules: h(a₁a₂ … a_n-1a_n) = h(a₁)h(a₂) …… h(a_n-1)h(a_n) h(a₁+ a₂) = h(a₁) + h(a₂)

h( (a₁)* ) = (h(a₁))*

Example: h(0) = ab; h(1) = λ

∑ = {0, 1} Γ ={a, b}

h(01010) = h(0)h(1)h(0)h(1)h(0)

= (ab)(λ)(ab)(λ)(ab)(λ)(ab) = ababab

(37)

Closure Under Homomorphism

If L is a regular language, and h is a homomorphism on the alphabet of language L.

Then h(L) = {h(w) | w εL} is also a regular language.

Proof: Let r be a regular expression for L.

Apply h to each symbol in r to form a new RE r’

The new RE r’ generates language h(L).

(38)

Closure Under Homomorphism : Example

Let h(0) =ab; h(1) = λ.

∑ = {0, 1} Γ ={a, b}

If L is the RL with regular expression 01* + 10*.

Then, h(L) is the homomorphism on the alphabet of language L.

The regular expression of language h(L) is r’ = h(01* +10*)

= h(0)(h(1))* + h(1) (h(0)*

= (ab)(λ)* + (λ)(ab)*

= (ab)λ + λ (ab)*

= (ab) + (ab)*

= (ab)*

(39)

Closure Under Homomorphism : Example

Let h(a) =dbcc; h(b) = bdc.

∑ = {a, b} Γ ={b, c, d}

If L is the regular language denoted by RE r = (a + b*) (aa*).

Show that h(L) is also regular language.

(40)

Right Quotient of Languages

Let L

₁

and L

₂

be languages on the same alphabet.

Then the right quotient of L

₁

with L

₂

is

L

₁

/ L

₂

= {x: xy L

₁

for some y L

₂

}

In other words, if the string in L

₁

has a

suffix from L

₂

, remove the suffix and the

resulting string is in L

₁

/L

₂

(41)

Right Quotient of Languages: Example

L₁ = {aⁿb^m : n ≥ 1, m ≥ 0} ∪ {ba}

L₁ = {ba, a, aa, aaa, …. ab, abb, …… aab, aabb……}

L₂ = {b^m : m ≥ 1}

L₂ = {b, bb, bbb, ….. }

Then the right quotient of L₁ with L₂ is L₁ / L₂ = {aⁿb^m : n ≥ 1, m ≥ 0}

(42)

Closure under right quotient

If L

₁

, L

₂

are regular then L

₁

/L

₂

is regular.

L

₁

is regular so it has a FA.

For each node in the FA of L

₁

, check if there is a walk from that node to a final node using a string in L

₂

.

If so, mark that node final.

So L

₁

/L

₂

is regular.

(43)

Example

L

₁

= L(abaa); L

₂

= L(ab*) DFA for L

₁

:

q₀ q1 q2

q3

a

b a

a

a, b

b b

(44)

Example

L

₁

= L(abaa); L

₂

= L(ab*) DFA for L

₂

:

L

₂

= {a, ab, abb … }

p0 p1

p2

b a

a

a, b

b

(45)

Example

L

₁

= L(abaa); L

₂

= L(ab*)

Remove final marking, remember final state is q2.

q₀ q1 q2

q3

a

b a a

a, b b b

L

₂

= {a, ab, abb … }

(46)

Example

L

₁

= L(abaa); L

₂

= L(ab*)

For each node, look for a walk on element of L₂ to q₂.

q₀ q1 q2

q3

a

b a a

a, b b b

L

₂

= {a, ab, abb … }

(47)

Example

L

₁

= L(abaa); L

₂

= L(ab*) DFA for L

₁

/L

₂

:

q0 q1 q2

q3

a

b a a

a,b

b b

(48)

Find Right Quotient

L

₁

= {a

ⁿ

b

^m

: n ≥1, m ≥0} ∪ {ba}

L

₂

= {b

^m

: m ≥ 1 }

L

₁

/L

₂

= ?

(49)

Exercise

L

₁

=L(abaa) L

₂

=L(aba*)

• Find L

₁

/L

₂

• If head of a language is defined as head(L) = {x : xy L for some y

**∑*}**

Find head(L

₁

) and head(L

₂

)

(50)

Elementary Questions about

Regular Languages

(51)

Membership Question

Question: Given regular language and string

how can we check if ?

L

L w Î

w

Answer: Take the DFA that accepts and check if is accepted

L

w

(52)

DFA

L w Î

DFA

L w Ï w

w

If there is a path from initial state to final state with labeled w; Accepted

If there is no path from initial state to final state with labeled w; Rejected

(53)

Given regular language how can we check

if is empty: ?

L L

Take the DFA that accepts

Check if there is any path from the initial state to a final state

L )

( L = Æ

Question:

Answer:

(54)

DFA

Æ

¹ L

DFA

Æ

=

L

(55)

Given regular language how can we check

if is finite?

L L

Take the DFA that accepts

Check if there is a walk with cycle from the initial state to a final state

L

Question:

Answer:

(56)

DFA

L is infinite

DFA

L ^{is finite}

(57)

Given regular languages and

how can we check if ? L 1 L ₂

2 1 L

L =

Question:

Æ

= Ç

È

Ç ) ( ) ( L ₁ L ₂ L ₁ L ₂

Find if

Answer:

(58)

Æ

= Ç

È

Ç ) ( ) ( L ₁ L ₂ L ₁ L ₂

Æ

= Ç ₂

1 L

L ^and L ₁ Ç L ₂ = Æ

2 1 L

L = L 1

L 2 L ₂ L ₁

2 1 L

L Í L ₂ Í L ₁

L 2 L ₁

(59)

Æ

¹ Ç

È

Ç ) ( ) ( L ₁ L ₂ L ₁ L ₂

Æ

¹ Ç ₂

1 L

L ^or L ₁ Ç L ₂ ¹ Æ L 1

L 2 L ₂ L ₁

2 1 L

L Ë L ₂ Ë L ₁

2 1 L

L ¹

(60)

Non-regular languages

(61)

Regular Languages

Every finite language is regular.

Some infinite languages are regular L

₁

={a

ⁿ

b : n >=0}

Regularity:

The language is regular if, in processing any string, the information that has to be

remembered at any stage is strictly limited.

Some infinite languages are not regular

L

₂

= {a

ⁿ

b

ⁿ

: n >=0}

(62)

How can we prove that a language L is regular?

Prove that there is DFA M that accepts L

Prove that there is RE r that generates L

Prove that there is RG G that generates L

(63)

Regular languages

b

a * b * c + a

...

etc

* ) ( a b c

b + +

Non-regular languages { a ⁿ b ⁿ : n ³ 0 }

}*}

, { :

{ vv ^R v Î a b

(64)

How can we prove that a language is not regular?

L

Prove that there is no DFA that accepts L

Problem: this is not easy to prove

Solution: the Pumping Lemma !!!

The pumping Lemma uses pigeonhole principle.

(65)

The Pigeonhole Principle

(66)

pigeons

pigeonholes

4

3

(67)

A pigeonhole must

contain at least two pigeons

(68)

...

pigeons

pigeonholes

n

m ⁿ ^> ^m

(69)

The Pigeonhole Principle

...

pigeons

pigeonholes

n m

m

n > There is a pigeonhole

with at least 2 pigeons

(70)

The Pigeonhole Principle

If we put n objects into m boxes and if n > m

• then at least one box must have more than

one object in it.

(71)

The Pigeonhole Principle and

DFAs

(72)

DFA with 4 states

q 1 a q ₂ q ₃ b

q 4

b

b b

b

a a

(73)

aab

q 1 a q ₂ q ₃ b

q 4

b

a a

a

In walks of strings: no state

is repeated

(74)

abab aabb aabbb abbabb

In walks of strings:

q 1 a q ₂ q ₃ b

q 4

b

a a

a

a state

is repeated

(75)

If string has length :

q 1 a q ₂ q ₃ b

q 4

b

a a

a

w ^| ^w ^| ^³ ⁴

Thus, a state must be repeated Then the transitions of string

are more than the states of the DFA

w

(76)

In general, for any DFA:

String has length number of states w ³

A state must be repeated in the walk of q w

... q ...

walk of w

Repeated state

(77)

In other words for a string : transitions are pigeons states are pigeonholes

q a

w

... q ...

walk of w

Repeated state

(78)

The Pumping Lemma

(79)

Take an infinite regular language L

There exists a DFA that accepts L

states m

(80)

Take string with w w Î L

There is a walk with label : w

...

walk w

(81)

If string has length w ^| ^w ^| ^³ ^m

^(number

of states of DFA)

then, from the pigeonhole principle:

a state is repeated in the walk w

... q ...

walk w

(82)

q

... q ...

walk w

Let be the first state repeated in the

walk of w

(83)

Write w = x y z

... q ...

x

y

z

(84)

... q ...

x

y

z

Observations: length | x y | £ m number of states of DFA

1 |

| y ³

length

(85)

The string is accepted.

z

Observation: x

... q ...

x

y

z

(86)

The string x y z

is accepted Observation:

... q ...

x

y

z

(87)

The string is accepted

z y y

Observation: x

... q ...

x

y

z

(88)

The string is accepted

z y y y

Observation: x

... q ...

x

y

z

(89)

The string is accepted

z y

x ⁱ

In General:

...

, 2 , 1 ,

= 0 i

... q ...

x

y

z

(90)

L z

y x ⁱ

In General: i = 0 , 1 , 2 , ...

... q ...

x

y

z

Language accepted by the DFA

(91)

In other words, we described:

The Pumping Lemma !!!

(92)

The Pumping Lemma:

• Given a infinite regular language L

• there exists an integer for any string m

with length

L

w Î ^| ^w ^| ^³ ^m

Then, we we can decompose w = x y z

with and | x y | £ m | y | ³ 1

such that: x y ⁱ z Î L i = 0 , 1 , 2 , ...

(93)

Applications of

the Pumping Lemma

(94)

Theorem: The language L = { a ⁿ b ⁿ : n ³ 0 }

is not regular

Proof: Use the Pumping Lemma

(95)

Assume for contradiction

that is a regular language L

Since is infinite.

we can apply the Pumping Lemma

L

} 0 :

{ ³

= a b n

L ⁿ ⁿ

(96)

Let be the integer in the Pumping Lemma

Pick a string such that: w _w _Î _L

m w | ³

length |

m m b a

w =

We pick

m

} 0 :

{ ³

= a b n

L ⁿ ⁿ

(97)

it must be that length

From the Pumping Lemma

1 |

| ,

|

| x y £ m y ³

b ab

aa aa

a b

a

xyz = ^m ^m = ... ... ... ...

1 , ³

= a k

y ^k

x y ^z

m m

Write: a ^m b ^m = x y z

Thus:

(98)

From the Pumping Lemma: x y ⁱ z Î L ...

, 2 , 1 ,

= 0 i

Thus:

m m b a

z y

x =

L z

y

x ² Î

1 , ³

= a k

y ^k

(99)

From the Pumping Lemma:

L b

ab aa

aa aa

a z

xy ² = ... ... ... ... ... Î

x y ^z

k

m + m

Thus:

L z

y

x ² Î

m m b a

z y

x = y = a ^k , k ³ 1

y

L b

a ^m ⁺ ^k ^m Î

(100)

L b

a ^m ⁺ ^k ^m Î

} 0 :

{ ³

= a b n L ⁿ ⁿ

BUT:

L b

a ^m ⁺ ^k ^m Ï

CONTRADICTION!!!

1 ≥

k

(101)

Our assumption that

is a regular language is not true

L

Conclusion: L is not a regular language

Therefore:

(102)

More Applications of

the Pumping Lemma

(103)

Theorem: The language is not regular

Proof: Use the Pumping Lemma

*}

:

{ Î S

= vv v

L ^R ^S ⁼ ^{ ^a ^, ^b ^}

(104)

Assume for contradiction

that is a regular language L

Since is infinite

we can apply the Pumping Lemma

L

*}

:

{ Î S

= vv v

L ^R

(105)

m m

m

m b b a a

w =

We pick

Let be the integer in the Pumping Lemma

Pick a string such that: w _w _Î _L

m w | ³

length | m

and

*}

:

{ Î S

= vv v

L ^R

(106)

Write a ^m b ^m b ^m a ^m = x y z

it must be that length

From the Pumping Lemma

a ba

bb ab

a aa

a

xyz = ... ... ... ... ... ...

x y _z

m m m m

1 |

| ,

|

| x y £ m y ³

1 , ³

= a k

y ^k

Thus:

(107)

From the Pumping Lemma: x y ⁱ z Î L ...

, 2 , 1 ,

= 0 i

Thus: x y ² z Î L

1 , ³

= a k

y ^k

m m

m

m b b a a

z y

x =

(108)

From the Pumping Lemma:

L a

ba bb

ab a

aa aa

a z

xy ² = ... ... ... ... ... ... ...

x y _z

k

m + m m m

1 , ³

= a k

y ^k

y

L z

y

x ² Î

Thus:

m m

m

m b b a a

z y

x =

L a

b b

a ^m ⁺ ^k ^m ^m ^m Î

(109)

L a

b b

a ^m ⁺ ^k ^m ^m ^m Î

L a

b b

a ^m ⁺ ^k ^m ^m ^m Ï

BUT:

CONTRADICTION!!!

³ 1 k

*}

:

{ Î S

= vv v

L ^R

(110)

Our assumption that

is a regular language is not true

L

Conclusion: L is not a regular language

Therefore:

(111)

Regular languages Non-regular languages

} 0 ,

:

{ ³

= a b c ⁺ n l

L ⁿ ^l ⁿ ^l

(112)

Theorem: The language is not regular

Proof: Use the Pumping Lemma

} 0 ,

:

{ ³

= a b c ⁺ n l

L ⁿ ^l ⁿ ^l

(113)

Assume for contradiction

that is a regular language L

Since is infinite

we can apply the Pumping Lemma

L

} 0 ,

:

{ ³

= a b c ⁺ n l

L ⁿ ^l ⁿ ^l

(114)

m m

m b c a

w = ²

We pick

Let be the integer in the Pumping Lemma

Pick a string such that: w _w _Î _L

m w | ³

length | m

} 0 ,

:

{ ³

= a b c ⁺ n l L ⁿ ^l ⁿ ^l

and

(115)

Write a ^m b ^m c ² ^m = x y z

it must be that length

From the Pumping Lemma

c cc

bc ab

aa aa

a

xyz = ... ... ... ... ... ...

x y _z

m m 2 m

1 |

| ,

|

| x y £ m y ³

1 , ³

= a k

y ^k

Thus:

(116)

From the Pumping Lemma: x y ⁱ z Î L ...

, 2 , 1 ,

= 0 i

Thus:

m m

m b c a

z y

x = ²

L xz

z y

x ⁰ =

1 , ³

= a k

y ^k

(117)

From the Pumping Lemma:

L c

cc bc

ab aa

a

xz = ... ... ... ... ... Î

x _z

k

m - m 2 m

m m

m b c a

z y

x = ² y = a ^k , k ³ 1 L

xz Î

Thus: a ^m ^- ^k b ^m c ² ^m Î L

(118)

L c

b

a ^m ^- ^k ^m ² ^m Î

L c

b

a ^m ^- ^k ^m ² ^m Ï

BUT:

CONTRADICTION!!!

} 0 ,

:

{ ³

= a b c ⁺ n l L ⁿ ^l ⁿ ^l

³ 1

k

(119)

Our assumption that

is a regular language is not true

L

Conclusion: L is not a regular language

Therefore:

(120)

Regular languages

Non-regular languages L = { a ⁿ ^! : n ³ 0 }

(121)

Theorem: The language

is not regular

Proof: Use the Pumping Lemma

} 0 :

{ ^! ³

= a n

L ⁿ

n n

n ! = 1 × 2 ! ( - 1 ) ×

(122)

Assume for contradiction

that is a regular language L

Since is infinite

we can apply the Pumping Lemma

L

} 0 :

{ ^! ³

= a n

L ⁿ

(123)

!

a m

w =

We pick

Let be the integer in the Pumping Lemma

Pick a string such that: w _w _Î _L

m w | ³

length | m

} 0 :

{ ^! ³

= a n

L ⁿ

(124)

Write a ^m ^! = x y z

it must be that length

From the Pumping Lemma

a aa

aa aa

aa a

a

xyz = ^m ^! = ... ... ... ... ...

x y _z

m m ! - m

1 |

| ,

|

| x y £ m y ³

m k

a

y = ^k , 1 £ £

Thus:

(125)

From the Pumping Lemma: x y ⁱ z Î L ...

, 2 , 1 ,

= 0 i

Thus:

!

a m

z y

x =

L z

y

x ² Î

m k

a

y = ^k , 1 £ £

(126)

From the Pumping Lemma:

L a

aa aa

aa a

z

xy ² = ... ... ... ... ... ... Î

x y _z

k

m + m ! - m

Thus:

!

a m

z y

x = y = a ^k , 1 £ k £ m L

z y

x ² Î

y

L

a ^m ^! ⁺ ^k Î

(127)

L a ^m ^! ⁺ ^k Î

!

! k p

m + =

} 0 :

{ ^! ³

= a n

L ⁿ

Since:

m k £

£ 1

There must exist such that: p

(128)

However:

)!

1 (

) 1 (

!

+

=

+

=

+

<

+

£

+

£

m

m m

m

m m

m k m

m ! + _for m > 1

)!

1 (

! + k < m + m

!

! k p

m + ¹ ^{for any} p

(129)

L a ^m ^! ⁺ ^k Î

L a ^m ^! ⁺ ^k Ï

BUT:

CONTRADICTION!!!

} 0 :

{ ^! ³

= a n

L ⁿ

m k £

£

1

(130)

Our assumption that

is a regular language is not true

L

Conclusion: L is not a regular language

Therefore:

(131)

Exercise

Prove that the following languages are not regular.

• L ={a

ⁿ

b

^l

a

^k

: k ≥ n+l }

• L ={a

ⁿ

b

ⁿ

: n ≥ 0 } ∪ {a

ⁿ

b

ⁿ⁺¹

: n ≥ 0 } ∪ {a

ⁿ

b

ⁿ⁺¹

: n ≥ 0 }

• L ={a

^n!

: n ≥ 1 }

Are the following languages are regular ?

• L ={w ε {a, b, c}* : |w| = 3n

_a

(w)}

• L ={w

₁

cw

₂

: w

₁

, w

₂

ε {a, b}*, w

₁

≠w

₂

}

(132)

Exercise

Let ^{L = {a}

ⁿ

^b

^m

: n ≥ 100, m ≤ 50}

• Can you use the pumping lemma to show that L is regular ?

• Can you use the pumping lemma to show

that L is regular ?

(133)

Lex

(134)

Lex: a lexical analyzer

• A Lex program recognizes strings

• For each kind of string found

the lex program takes an action

(135)

Var = 12 + 9;

if (test > 20) temp = 0;

else

while (a < 20) temp++;

Lex

program

Identifier: Var Operand: = Integer: 12 Operand: + Integer: 9

Semicolumn: ; Keyword: if

Parenthesis: ( Identifier: test ....

Input

Output

(136)

In Lex strings are described with regular expressions

“if”

“then”

“+”

“-”

“=“

/* operators */

/* keywords */

Lex program

Regular expressions

(137)

(0|1|2|3|4|5|6|7|8|9)+ /* integers */

/* identifiers */

Regular expressions

(a|b|..|z|A|B|...|Z)+

Lex program

(138)

integers

[0-9]+

(0|1|2|3|4|5|6|7|8|9)+

(139)

(a|b|..|z|A|B|...|Z)+ [a-zA-Z]+

identifiers

(140)

Each regular expression

has an associated action (in C code)

Examples:

\n

Regular expression Action

linenum++;

[a-zA-Z]+ printf(“identifier”);

[0-9]+ prinf(“integer”);

(141)

Default action: ECHO;

Prints the string identified

to the output

(142)

A small lex program

%%

[a-zA-Z]+ printf(“Identifier\n”);

[0-9]+ printf(“Integer\n”);

[ \t\n] ; /skip spaces/

(143)

1234 test

var 566 78 9800

Input Output

Integer

Identifier

Integer

(144)

%%

[a-zA-Z]+ printf(“Identifier\n”);

[0-9]+ prinf(“Integer\n”);

[ \t] ; /skip spaces/

. printf(“Error in line: %d\n”, linenum);

Another program

%{

int linenum = 1;

%}

\n linenum++;

(145)

1234 test

var 566 78 9800 +

temp

Input Output

Integer Identifier Identifier Integer Integer Integer

Error in line: 3

Identifier

(146)

Lex matches the longest input string

“if”

“ifend”

Regular Expressions

Input: ifend if Matches: “ifend” “if”

Example:

(147)

Internal Structure of Lex

Lex

Regular

expressions NFA DFA Minimal DFA

The final states of the DFA are

associated with actions

(148)

Regular Languages

Properties of Regular Languages

Standard Representations of Regular Languages

iff

Standard Representations of Regular Languages

Regular Languages

Regular

Expressions

Regular Grammars (Left/Right Linear) FA

(DFA/NFA/NFA- e)

When we say: We are given a Regular Language

We mean:

L

Language is in a standard Representation (FA/RE/RG)

L

Standard Representations of Regular Languages

Closure Properties

Given two regular languages L

and L

, is their union is also regular ? If it is true for all regular

languages, then family of regular

languages is closed under union.

L 1 L 2

Are regular Languages For regular languages and

we will prove that:

Properties of RLs

We say: The family of Regular languages are

closed under

L 1 Regular language

( ) M 1 L 1

L = M 1

Single final state

NFA M 2

L 2

Single final state

( ) M 2 L 2

L =

Regular language

NFA

Example

a

b

M 1

{ } ba

L 2 = b a

M 2

L

={a

b :n≥0}

Union

NFA for

M 1

M 2

!

∪ !

l l l

l

Example

a

b

b a

l l l

l

}

1 { a b L = n

}

2 { ba L =

} {

}

2 {

1 L a b ba

L È = n È

NFA for

Concatenation

NFA for L 1 L 2

M 1 M 2

l l

Example

NFA for

a

L 1 L ₂

( ) ^M ₁ ^L ₁

( ) ^M ₂ ^L ₂

{ } ^ba

L ₂ = ^b ^a

1 { a b L = ⁿ

L È = ⁿ È

M 1 M ₂

b _b a

1 { a b L = ⁿ

L = ⁿ = ⁿ

NFA for L ₁ *

1 * a b L = ⁿ

1 { a b L = ⁿ l

L ₁ R

1 { a b L = ⁿ

1 { a b L = ⁿ