geminispace.info

gemini search engine
git clone https://git.clttr.info/geminispace.info.git
Log (Feed) | Files | Refs (Tags) | README | LICENSE

commit 670ac5a1f38565d9b0f36fc5758edde01ca3e4fd
parent 9de31b2aa50f5c35b13f33eb5506d560f180844b
Author: René Wagner <rwa@clttr.info>
Date:   Thu, 18 Aug 2022 10:57:23 +0200

news 2022-08-18

Diffstat:
Mserve/templates/news.gmi | 6++++++
1 file changed, 6 insertions(+), 0 deletions(-)

diff --git a/serve/templates/news.gmi b/serve/templates/news.gmi @@ -2,6 +2,12 @@ ## News +### 2022-08-18 duplicate results +Due to a small glitch in the crawler we had duplicate results in the dataset for a few weeks. +Thanks to the report of Acidus this has now been fixed and the duplicate entries were removed. + +Despite this, gemini keeps growing organically. The raw data known to geminispace.info at the moment exceeds 10 GB of data and we already exclude some high traffic capsules like news or wikipedia relays. + ### 2022-07-21 crawling issues We had some crawling issues in the last days. In the end it turns out someone decided to serve huge video files over gemini. At the moment we process all files in memory, so the crawl simply got killed by the oom-killer once the downloaded video size hits the available memory.