• WEIDONG LIU School of Computer Engineering and Science, Shanghai University Shanghai, China
  • XIANGFENG LUO School of Computer Engineering and Science, Shanghai University Shanghai, China
  • JUNYU XUAN School of Computer Engineering and Science, Shanghai University Shanghai, China
  • DANDAN JIANG School of Computer Engineering and Science, Shanghai University Shanghai, China
  • ZHENG XU The Third Research Institute of Ministry of Public Security Shanghai, China


association link network, semantic coherence measurement, short text of web events


As novel web social Media emerges on the web, large-scale short texts are springing up. Although these massive short texts contain rich information, their disorder nature makes users dicult to obtain the desired knowledge from them, especially the semantic coherent knowledge. Dierent orders of these short texts often express dierent seman- tic coherence states. Therefore, how to automatically measure semantic coherence of short texts is a fundamental and signicant problem for web knowledge services. Ex- isting related works on the semantic coherence measurement of dierent orders of short texts/sentences seldom focus on graph structure of semantic link network for re ecting coherence change, measuring coherence by these graph-based features and discovering some interesting coherence patterns. In this paper, we propose an association link net- work based semantic coherence measurement for short texts of web events. Our method rstly construct an association link network from which some graph-based features are then extracted to measure semantic coherence of dierent orders and lastly some co- herence patterns are discovered for guiding automatically text ordering/generation. To validate correctness of our method, we conduct a series of experiments including sentence order permutation, sentence removal and adding/replacing sentence and compare with other two methods. The results show that our method can measure semantic coherence with higher accuracy and outperforms other methods in some experiments. Such method can be widely applied in web text automatic generation, web short text organization and web event summarization etc.


Download data is not yet available.


R. Beaugrande, "DE, DRESSLER W.(1981)," Introduction to text linguistics.

A. M. Collins and E. F. Loftus (1975), A spreading-activation theory of semantic processing Psy-

chological review, vol. 82, pp. 407-428.

A. C. Graesser, M. Singer, and T. Trabasso (1994), Constructing inferences during narrative text

comprehension Psychological review, vol. 101, p. 371.

M. Lapata and R. Barzilay(2005), Automatic evaluation of text coherence: Models and represen-

tations in International Joint Conference On Arti cial Intelligence, p. 1085.

R. Barzilay and M. Lapata (2008), Modeling local coherence: An entity-based approach Computa-

tional Linguistics, vol. 34, pp. 1-34.

Z. Lin, H. T. Ng, and M.-Y. Kan(2011), Automatically evaluating text coherence using discourse

relations in Proceedings of the 49th Annual Meeting of the Association for Computational Lin-

guistics: Human Language Technologies-Volume 1, pp. 997-1006.

D. Newman, J. H. Lau, K. Grieser, and T. Baldwin(2010), Automatic evaluation of topic coherence

in Human Language Technologies: The 2010 Annual Conference of the North American Chapter

of the Association for Computational Linguistics, pp. 100-108.

A. C. Graesser, D. S. McNamara, M. M. Louwerse, and Z. Cai(2004), Coh-Metrix: Analysis of

text on cohesion and language Behavior Research Methods, Instruments, & Computers, vol. 36,

pp. 193-202.

T. Nahnsen(2009), Domain-independent shallow sentence orderingin Proceedings of Human Lan-

guage Technologies: The 2009 Annual Conference of the North American Chapter of the As-

sociation for Computational Linguistics, Companion Volume: Student Research Workshop and

Doctoral Consortium, 2009, pp. 78-83.

D. Marcu(1997), From local to global coherence: A bottom-up approach to text planning in Pro-

ceedings of the National Conference on Arti cial Intelligence, pp. 629-636.

R. Zhang(2011), Sentence ordering driven by local and global coherence for summary generation

in Proceedings of the ACL, pp. 6-11.

M. Lapata(2003), Probabilistic text structuring: Experiments with sentence ordering in Proceedings

of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1, pp. 545-552.

R. Barzilay and L. Lee(2004), Catching the drift: Probabilistic content models, with applications

to generation and summarization in Proceedings of HLT-NAACL.

R. Barzilay(2003), Information fusion for multidocument summarization: paraphrasing and gen-

eration Columbia University.

R. Soricut and D. Marcu(2006), Discourse generation using utility-trained coherence models in

Proceedings of the COLING/ACL on Main conference poster sessions, pp. 803-810.

M. Elsner, J. Austerweil, and E. Charniak(2007), A uni ed local and global model for discourse

coherence in Proceedings of NAACL/HLT.

N. Fang, X. Luo, and W. Xu(2009), Measuring textual context based on cognitive principles Inter-

national Journal of Software Science and Computational Intelligence (IJSSCI), vol. 1, pp. 61-89.

J.L. Austerweil, J.T. Abbott and T.L. Griths(2012), Human memory search as a random walk

in a semantic network//Advances in neural information processing systems.: 3041-3049.

T. Joachims(2002), Optimizing search engines using clickthrough data. In: Proceedings of the

Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.


F. C. T. Chua and S. Asur (2013), Automatic summarization of events from social media, in:


L. Shou, Z.Wang, K. Chen and G. Chen(2013), Sumblr: continuous summarization 490 of evolving

tweet streams, in: Proceedings of the 36th international ACM SIGIR conference on Research and

development in information retrieval, ACM, pp. 533-542.

J. Leskovec, L. Backstrom and J. Kleinberg(2009), Meme-tracking and the dynamics of the news

cycle in: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery

and data mining, 495 ACM, pp. 497-506.

J. R. Quinlan(1986), Induction of Decision Trees. Mach. Learn. 1, 1, 81-106.

X. Luo, J. Xuan and H. Liu(2014), itWeb event state prediction model: combining prior knowledge

with real time data. Journal of Web Engineering 13.5-6. 483-506.