Search: [ml]

A decoder-only foundation model for time-series forecasting – Google Research Blog

finance · ml · timeseries

March 6, 2024 at 10:31:04 PM EST * · permalink

·

https://blog.research.google/2024/02/a-decoder-only-foundation-model-for.html

GGUF, the long way around | ★❤✰ Vicki Boykis ★❤✰

What are ML artifacts?

ml

February 29, 2024 at 10:00:45 PM EST * · permalink

·

https://vickiboykis.com/2024/02/28/gguf-the-long-way-around/

[2310.10688] A decoder-only foundation model for time-series forecasting

Motivated by recent advances in large language models for Natural Language Processing (NLP), we design a time-series foundation model for forecasting whose out-of-the-box zero-shot performance on a variety of public datasets comes close to the accuracy of state-of-the-art supervised forecasting models for each individual dataset. Our model is based on pretraining a patched-decoder style attention model on a large time-series corpus, and can work well across different forecasting history lengths, prediction lengths and temporal granularities.

finance · ml · paper

February 23, 2024 at 8:33:36 PM EST * · permalink

·

https://arxiv.org/abs/2310.10688

FinBen: An Holistic Financial Benchmark for Large Language Models

https://huggingface.co/papers/2402.12659

finance · ml · paper

February 23, 2024 at 8:15:53 PM EST * · permalink

·

https://arxiv.org/abs/2402.12659

True Value Investing in Credits through Machine Learning by Patrick Houweling, Philip Messow, Robbert-Jan 't Hoen :: SSRN

To better control for risk, we construct a novel machine learning based value factor and find that it outperforms existing value factors while earning less from risk and more from mispricings.

ml · finance · trade

February 19, 2024 at 9:51:52 AM EST * · permalink

·

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4718484

Paper page - Mixtures of Experts Unlock Parameter Scaling for Deep RL

This work thus provides strong empirical evidence towards developing scaling laws for reinforcement learning.

paper · ml · rl

February 14, 2024 at 6:30:24 PM EST * · permalink

·

https://huggingface.co/papers/2402.08609

The Expected Returns on Machine-Learning Strategies by Vitor Azevedo, Christopher Hoegner, Mihail Velikov :: SSRN

We document return predictability from deep-learning models that cannot be explained by common risk factors or limits to arbitrage.

finance · paper · ml

February 13, 2024 at 8:56:48 AM EST * · permalink

·

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4702406

Part 2: Kinds of RL Algorithms — Spinning Up documentation

ml

February 7, 2024 at 2:17:01 PM EST * · permalink

·

https://spinningup.openai.com/en/latest/spinningup/rl_intro2.html

Correlation Matrix Clustering for Statistical Arbitrage Portfolios by Álvaro Cartea, Mihai Cucuringu, Qi Jin :: SSRN

statistical arbitrage portfolios with graph clustering algorithms

finance · ml · paper

January 9, 2024 at 9:04:42 AM EST * · permalink

·

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4560455

llama : add Mixtral support by slaren · Pull Request #4406 · ggerganov/llama.cpp

ai · ml · mixtral

December 17, 2023 at 4:17:44 PM EST * · permalink

·

https://github.com/ggerganov/llama.cpp/pull/4406

facebookresearch/Pearl: A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

ml · rl

December 11, 2023 at 11:38:57 PM EST * · permalink

·

https://github.com/facebookresearch/Pearl

Decomposing Machine-Learning Alpha by Alessandro Laurent, Alexandre Deruaz :: SSRN

blockchain · analysis · ml

December 9, 2023 at 9:40:13 AM EST * · permalink

·

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4638550

We built a 'brain' from tiny silver wires. It learns in real time, more efficiently than computer-based AI

Our online approach requires less memory as data is processed continuously. Moreover, our network learns from each data sample only once, significantly reducing energy use and making the process highly efficient.

ml

November 4, 2023 at 2:38:56 PM EDT * · permalink

·

https://theconversation.com/we-built-a-brain-from-tiny-silver-wires-it-learns-in-real-time-more-efficiently-than-computer-based-ai-216730

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

"Unlike in CV and NLP, the field of time series lacks publicly accessible large-scale datasets."

ml · transformer · timeseries

August 9, 2023 at 11:10:15 AM EDT * · permalink

·

https://huggingface.co/blog/autoformer

Sarah Silverman is suing OpenAI and Meta for copyright infringement - The Verge

The complaint lays out in steps why the plaintiffs believe the datasets have illicit origins — in a Meta paper detailing LLaMA, the company points to sources for its training datasets, one of which is called ThePile, which was assembled by a company called EleutherAI. ThePile, the complaint points out, was described in an EleutherAI paper as being put together from “a copy of the contents of the Bibliotik private tracker.” Bibliotik and the other “shadow libraries” listed, says the lawsuit, are “flagrantly illegal.”

ml · llm · copyright

July 10, 2023 at 10:13:31 AM EDT * · permalink

·

https://www.theverge.com/2023/7/9/23788741/sarah-silverman-openai-meta-chatgpt-llama-copyright-infringement-chatbots-artificial-intelligence-ai

How GitHub Copilot is getting better at understanding your code | The GitHub Blog

With a new Fill-in-the-Middle paradigm, GitHub engineers improved the way GitHub Copilot contextualizes your code. By continuing to develop and test advanced retrieval algorithms, they’re working on making our AI tool even more advanced.

ml · LLM · code

July 2, 2023 at 12:35:50 PM EDT * · permalink

·

https://github.blog/2023-05-17-how-github-copilot-is-getting-better-at-understanding-your-code/

97% Cheaper, Faster, Better, Correct AI — With Varun Mohan Of Codeium - Foresight News EN

Source Latent Space Podcast Ep. 2: Why you are holding your GPUs wrong OpenAI just rollicked the AI world yet again yesterday — while releasing the long awaited ChatGPT API, they also priced it at $2 per million tokens generated, which is 90% cheaper than the text-davinci-003 pricing of the “GPT3.5” family. Their blogpost on how they did it is vague: Through a series

code · ml · llm · codeium

July 2, 2023 at 12:59:52 AM EDT * · permalink

·

https://en.foresightnews.pro/97-cheaper-faster-better-correct-ai-with-varun-mohan-of-codeium/

What Building "Copilot for X" Really Takes

A "Copilot for X" guide from the team that built the first real Copilot competitor!

codeium · ml · llm

July 1, 2023 at 11:59:59 PM EDT * · permalink

·

https://www.latent.space/p/what-building-copilot-for-x-really

LLM Powered Autonomous Agents | Lil'Log

ml · llm

June 27, 2023 at 11:42:33 PM EDT * · permalink

·

https://lilianweng.github.io/posts/2023-06-23-agent/

Microsoft AI Introduces Orca: A 13-Billion Parameter Model that Learns to Imitate the Reasoning Process of LFMs (Large Foundation Models)

The remarkable zero-shot learning capabilities demonstrated by large foundation models (LFMs) like ChatGPT and GPT-4 have sparked a question: Can these models autonomously supervise their behavior or other models with minimal human intervention? To explore this, a team of Microsoft researchers introduces Orca, a 13-billion parameter model that learns complex explanation traces and step-by-step thought processes from GPT-4. This innovative approach significantly improves the performance of existing state-of-the-art instruction-tuned models, addressing challenges related to task diversity, query complexity, and data scaling. The researchers acknowledge that the query and response pairs from GPT-4 can provide valuable guidance for student models. Therefore,

ml

June 13, 2023 at 7:49:19 PM EDT * · permalink

·

https://www.marktechpost.com/2023/06/13/microsoft-ai-introduces-orca-a-13-billion-parameter-model-that-learns-to-imitate-the-reasoning-process-of-lfms-large-foundation-models/?amp