Skip to main content
Log in

Automatic transformation from TIDES to TimeML annotation

  • Original Paper
  • Published:
Language Resources and Evaluation Aims and scope Submit manuscript

Abstract

Until recently, most systems performing temporal extraction and reasoning from text have focused on recognizing and normalizing temporal expressions alone, for which the TIDES annotation scheme has been adopted. Temporal awareness of a text, however, involves not only identifying the temporal expressions, but the events which these expressions anchor, as well as other events which must be ordered relative to them. Because of these broader concerns, TimeML has been developed as an annotation specification that encompasses not only temporal expressions, but all temporally relevant aspects of a text. The annotation schemes, however, are not interchangeable, resulting in incompatible corpora and accompanying extraction algorithms for each standard. In this paper, we describe an automatic migration process from the TIMEX2 tags of TIDES to the TIMEX3 tags of TimeML. This transformation procedure has been implemented and evaluated with two different corpora, obtaining 93.3 and 89.2% overall F-Measure respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. http://fofoca.mitre.org/annotation_guidelines/2005_timex2_standard_v1.1.pdf.

  2. http://www.nist.gov/speech/tests/ace/.

  3. http://timexportal.wikidot.com/systems.

  4. Section 5 in http://fofoca.mitre.org/annotation_guidelines/2005_timex2_standard_v1.1.pdf.

  5. Section 2.2.1.2 in http://www.timeml.org/site/publications/timeMLdocs/annguide_1.2.1.pdf.

  6. ti = t1, t2, t3, ….

  7. A non-consuming TIMEX3 is added to capture the duration that such range expressions introduce.

  8. A POSTAGGER is used for obtaining the preposition.

  9. getVal(tid) gets the val attribute (ISO format) for the expression identifies as tid. getValTF (temporalFunction, val) applies the respective function resolving the expression using the ISO format date, returning an ISO format date also.

  10. According to TIDES 2005 guidelines AS_OF is only used with the PRESENT_REF token, therefore the transformation follows the same performance as WITHIN.

  11. Only the boundaries of the expression are shown since this type of expressions were not being resolved in previous annotation guidelines.

  12. The current specification will reflect this proposal regarding how temporal expressions in specifier (genitive) position should be annotated: namely, they will be marked as indicated in the example 25.

  13. Only the items of the annotation related to this problem are shown for clarity.

  14. More information: timex2.mitre.org/tern_2004/ferro2_TERN2004_annotation_sanitized.pdf page 10.

  15. http://www.timeml.org/site/tarsqi/modules/gutime/index.html.

  16. http://timex2.mitre.org/taggers/timex2_taggers.html.

  17. More information: http://www.timeml.org/site/timebank/documentation-1.2.html#iaa.

  18. http://fofoca.mitre.org/tern_2004/tern_evalplan-2004.29apr04.pdf.

  19. http://gplsi.dlsi.ua.es/demos/T2T3.

References

  • Ahn, D. (2006). The stages of event extraction. For computational linguistics. ARTE: Workshop of 44th annual meeting of the association for computational linguistics, Sydney, Australia (pp. 1–8).

  • Ahn, D., Adafre, S. F., & de Rijke, M. (2005). Towards task-based temporal extraction and recognition. In G. Katz, J. Pustejovsky, & F. Schilder (Eds.), Annotating, extracting and reasoning about time and events. Volume 05151 of Dagstuhl seminar proceedings. Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany.

  • Allen, J. (1983). Maintaining knowledge about temporal intervals. Communications of the ACM, 26(11), 832–843.

    Article  Google Scholar 

  • Allen, J. (1984). Towards a general theory of action and time. Artificial Intelligence, 23, 123–154.

    Google Scholar 

  • Bethard, S., & Martin, J. (2007). CU-TMP: Temporal classification using syntactic and semantic features. In: Proceedings of the 4th international workshop of SemEval-2007 (pp. 129–132).

  • Boguraev, B., Pustejovsky, J., Ando, R., & Verhagen, M. (2007). TimeBank evolution as a community resource for TimeML parsing. Language Resources and Evaluation, 41, 91–115.

    Article  Google Scholar 

  • Carmona, J., Cervell, S., Márquez, L., Martí, M., Padró, L., Placer, R., et al. (1998) Morphosyntactic analysis and parsing of unrestricted spanish text. In LREC: Proceedings of first international conference on language resources and evaluation, LREC 1998, Granada, Spain.

  • Gerber, L., Ferro, L., Mani, I., Sundheim, B., Wilson, G., & Kozierok, R. (2002). Annotating temporal information: From theory to practice. In Proceedings of the 2002 conference on human language technology, San Diego, CA (pp. 226–230).

  • GUTime: Georgetown University. (2008). http://www.timeml.org/site/tarsqi/modules/gutime/index.html.

  • Hacioglu, K., Chen, Y., & Douglas, B. (2005). Automatic time expression labeling for english and chinese text. In A. F. Gelbukh (Ed.), CICLing. Volume 3406 of lecture notes in computer science (pp. 548–559). New York: Springer.

  • Lee, K., Boguaraev, B., Bunt, H., & Pustejovsky, J. (2007). ISO-TimeML and its applications. In: Proceedings of the 2007 conference for ISO technical committee 37.

  • Mani, I., Hitzeman, J., Richer, J., Harris, D., Quimby, R., & Wellner, B. (2008). SpatialML: Annotation scheme, corpora, and tools. In: Proceedings of LREC 2008.

  • Mani, I., Wilson, G., Sundheim, B., & Ferro, L. (2001). Guidelines for annotating temporal information. In J. Allan (Ed.), Proceedings of HLT 2001, first international conference on human language technology research (pp. 142–144). San Francisco: Morgan Kaufmann.

  • Mazur, P., & Dale, R. (2007). The dante temporal expression tagger. In: Proceedings of the 3rd language and technology conference.

  • MUC-6. (1995). Proceedings of the sixth message understanding conference In MUC-6: Proceedings of the sixth message understanding conference, defense advanced research projects agency. San Francisco: Morgan Kaufmann.

  • MUC-7. (1998). Proceedings of the seventh message understanding conference. In MUC-7: Proceedings of the seventh message understanding conference, defense advanced research projects agency.

  • Negri, M. (2007). Dealing with italian temporal expressions: The ita-chronos system. In Proceedings of EVALITA 2007, workshop held in conjunction with AI*IA.

  • Pustejovsky, J., Castaño, J., Ingria, R., Saurí, R., Gaizauskas, R., Setzer, A., et al. (2003a) TimeML: Robust specification of event and temporal expressions in text. In: Proceedings of the fifth international workshop on computational semantics (IWCS-5).

  • Pustejovsky, J., Castaño, J. M., Ingria, R., Sauri, R., Gaizauskas, R. J., Setzer, A., et al. (2003b). TimeML: Robust specification of event and temporal expressions in text. In New directions in question answering (pp. 28–34).

  • Pustejovsky, J., Hanks, P., Saurí, R., See, A., Gaizauskas, R., Setzer, A., et al. (2003) The timebank corpus. In: Proceedings of corpus linguistics (pp. 647–656), Lancaster.

  • Pustejovsky, J., Knippen, R., Littman, J., & Saurí, R. (2005). Temporal and event information in natural language text. Language Resources and Evaluation, 39, 123–164.

    Article  Google Scholar 

  • Pustejovsky, J., Sauri, R., Castaño, J. M., Radev, D. R., Gaizauskas, R. J., Setzer, A., et al. (2004). Representing temporal and event knowledge for qa systems. In: New directions in question answering (pp. 99–112).

  • Saquete, E., Ferrández, O., Ferrández, S., Martínez-Barco, P., & Muñoz, R. (2008). Combining automatic acquisition of knowledge with machine learning approaches for multilingual temporal recognition and normalization. Information Sciences, 178, 3319–3332.

    Google Scholar 

  • Saquete, E., Martínez-Barco, P., Muñoz, R., & Vicedo, J. (2004) Splitting complex temporal questions for question answering systems. In ACL: 42nd Annual meeting of the association for computational linguistics (pp. 566–573), Barcelona, España.

  • Saquete, E., Muñoz, R., & Martínez-Barco, P. (2006). Event ordering using terseo system. Data and Knowledge Engineering Journal, 58, 70–89.

    Google Scholar 

  • TempEx: MITRE Corporation. (2008). http://timex2.mitre.org/taggers/timex2_taggers.html.

  • Technical Committee ISO/TC 154. (2004). Processes, data elements and documents in commerce, industry and administration “ISO 8601:2004(E)”.

  • TERN: Time Expression Recognition and Normalization. (2004). http://timex2.mitre.org/tern.html.

  • Verhagen, M., Mani, I., Sauri, R., Littman, J., Knippen, R., Jang, S. B., et al. (2005). Automating temporal annotation with TARSQI. In: ACL, the association for computer linguistics.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Estela Saquete.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Saquete, E., Pustejovsky, J. Automatic transformation from TIDES to TimeML annotation. Lang Resources & Evaluation 45, 495–523 (2011). https://doi.org/10.1007/s10579-011-9147-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10579-011-9147-y

Keywords

Navigation