In 7th post, we list some of the basic processing we do over the news for clustering them & adding media & other info (no semantic here)