Languages in the Stack v2

Derek <derek-nospam@shape-of-code.com>
Wed, 05 Jun 2024 00:43:38 +0100

          From comp.compilers

Related articles
Languages in the Stack v2 derek-nospam@shape-of-code.com (Derek) (2024-06-05)
| List of all articles for this month |

From: Derek <derek-nospam@shape-of-code.com>
Newsgroups: comp.compilers
Date: Wed, 05 Jun 2024 00:43:38 +0100
Organization: Compilers Central
Injection-Info: gal.iecc.com; posting-host="news.iecc.com:2001:470:1f07:1126:0:676f:7373:6970"; logging-data="29063"; mail-complaints-to="abuse@iecc.com"
Keywords: history
Posted-Date: 05 Jun 2024 03:40:56 EDT

All,


The quantity of source code present in version 2 of the Stack,
a public source code repo designed for training LLMS,
provides an interesting insight into long-term usage
of languages over time
https://huggingface.co/datasets/bigcode/the-stack-v2


Technical details here
https://arxiv.org/abs/2402.19173


Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.