Proper parsing of the <DateLine> element (not to be confused by schema:contentLocation)
The <DateLine>
element corresponds to the date and the location where the press release has been written! This does not necessarily match when and where the event being reported in the articles took place.
This element is not yet parsed by the converter. The rnews:dateline
property should be used, the value being a simple string and corresponding to the sole 'location' where the article has been produced. Regex / substring should be done to remove the date and the source '(AFP)'.
At the moment, we also generate values for the property schema:contentLocation
which corresponds to The location depicted or described in the content. This property is populated using the <Location>
element which is correct.