To compute the sentences for different purposes in Natural Language Processing, there is a need to identify the complex sentences and make them simple. For grammar checking, machine translation, summarization etc, corpus with simple sentences is required. So, Simplification of complex sentences is a major task of NLP. There are different types of complex sentences with different features. This paper works on Punjabi language and informs about the different types of complex sentences and uses Conditional Random Field (CRF), a statistical approach to identify complex sentences from Punjabi corpus.
Natural Language Processing, Complex Sentences, CRF, Clauses