The Bloomberg News Agency unveiled the BloombergGPT large language model, capable of answering questions from the finance and business sectors.
The 50-billion-parameter algorithm is built on the same technology as OpenAI’s GPT. According to the company, their neural network “with a significant lead” outperforms open models of similar size in financial and NLP tasks without sacrificing performance.
The specialists trained a LLM on a corpus of 363 billion tokens of English-language financial data collected by the company’s analysts over the last 40 years. They include Bloomberg’s internal information, securities documents, press releases, news, and publications from various outlets.
Experts also supplemented the training dataset with a general-purpose set of 345 billion tokens, containing information from GitHub, Wikipedia, YouTube and other sources.
In training the Bloomberg model, Nvidia A100 accelerators were used in the AWS cloud. The training process took about 1.3 million GPU-hours.
The model can determine whether news headlines are bearish or bullish for investors. It can replace company names with stock tickers, detect important names in documentation, and answer questions about business such as “who is the chief executive of the firm”.
The technology also has some generative AI capabilities. For example, it can write article headlines based on short annotations.
The agency said it does not plan to release the algorithm due to the risk of leaking confidential data.
The company also does not intend to compete with OpenAI’s ChatGPT. For now, Bloomberg is using AI to expand the functionality of its Terminal service for studying and analysing the financial market.
Back in January 2023, it emerged that the information portal CNET had used artificial intelligence to write a series of articles.
Subsequently, readers and editors discovered errors in the texts. As a result, the publication paused the release of new materials.
Nevertheless, two weeks later, CNET said that they would continue to write articles with the help of AI.
