Content-based genre classification of large texts