You can now
corpus for offline use, including monthly
updates via a subscription. To date. this has added up to about
5.9 billion words of
data that you can have on your own machine. There's
nothing else like it.
corpus contains more than billion words of
text from online magazines and newspapers in 20
different English-speaking countries from 2010 to the
current time (see sources).
It is by far the largest corpus (of any language) that
is available in full-text format. Most importantly, the
corpus grows by 4-5 million words of data each day. This translates to about 100-120 million words each month and 1.2 billion
words each year. If you're interested in what's going on
in English up to and including right now, this is by far
the best corpus available.
When you purchase the
full-text data from NOW, you get all of the data up
through the month of purchase. You can also purchase an annual subscription, which will
give you another year's worth of data (typically about
1.5 billion words each year). For example, if you
purchase both datasets on September 15 2018, you would have
the data from Jan 2010 - August 2018 (which was released on
Sep 3; #1 below), and an annual subscription would give
you the data from Sep 2018 - Aug 2019 (#2 below).
If you purchase just #1
above, it would be the price of
one corpus, and there would be a
discount for purchasing both
corpora (#1 and #2) at the same time.
Note also that the monthly updates will be
released by the 3rd of the following month (at the
latest). You will be notified
as soon as the update is available, and you will have
ten days to download the data.