Spaces:

sigridveronica
/

ai-news-analyzer

Runtime error

App Files Files Community

Sigrid De los Santos commited on Jul 4

Commit

9df4cc0

1 Parent(s): 342fd5f

Remove remaining binary file for Hugging Face

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

ai_analysis/fin_sentiment.py +17 -0
ai_analysis/fin_signal_tagging.py +15 -0
data/ai_2025-06-03.md +109 -0
data/ai_2025-07-04.md +78 -0
data/ai_2025-07-04_1.md +104 -0
data/combined_report.md +700 -0
data/mining_2025-07-04.md +89 -0
data/nuclear_energy_2025-06-03.md +130 -0
data/nuclear_energy_2025-06-03_1.md +111 -0
data/nuclear_energy_2025-07-02.md +133 -0
data/nuclear_energy_2025-07-04.md +117 -0
external/.DS_Store +0 -0
external/FinGPT/.github/FUNDING.yml +12 -0
external/FinGPT/.github/ISSUE_TEMPLATE/feature_request.md +20 -0
external/FinGPT/.gitignore +141 -0
external/FinGPT/.gitpod.yml +10 -0
external/FinGPT/.idea/.gitignore +3 -0
external/FinGPT/CODE_OF_CONDUCT.md +65 -0
external/FinGPT/CONTRIBUTING.md +68 -0
external/FinGPT/FinGPT_ Training with LoRA and Meta-Llama-3-8B.ipynb +0 -0
external/FinGPT/FinGPT_Inference_Llama2_13B_falcon_7B_for_Beginners.ipynb +0 -0
external/FinGPT/FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners_v2-2.ipynb +0 -0
external/FinGPT/LICENSE +21 -0
external/FinGPT/MANIFEST.in +1 -0
external/FinGPT/README.md +384 -0
external/FinGPT/fingpt/FinGPT_Benchmark/__init__.py +2 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/__init__.py +3 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/benchmarks.py +114 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/convfinqa.py +75 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/evaluate.sh +395 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/fineval.py +72 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/finred.py +150 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/fiqa.py +176 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/fpb.py +168 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/headline.py +84 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/ner.py +94 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/nwgi.py +86 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/sentiment_templates.txt +5 -0
external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/tfns.py +82 -0
external/FinGPT/fingpt/FinGPT_Benchmark/config.json +33 -0
external/FinGPT/fingpt/FinGPT_Benchmark/config_hf.json +11 -0
external/FinGPT/fingpt/FinGPT_Benchmark/config_new.json +35 -0
external/FinGPT/fingpt/FinGPT_Benchmark/data/__init__.py +0 -0
external/FinGPT/fingpt/FinGPT_Benchmark/data/download.py +41 -0
external/FinGPT/fingpt/FinGPT_Benchmark/data/prepare_data.ipynb +0 -0
external/FinGPT/fingpt/FinGPT_Benchmark/demo.ipynb +715 -0
external/FinGPT/fingpt/FinGPT_Benchmark/readme.md +169 -0
external/FinGPT/fingpt/FinGPT_Benchmark/train.sh +547 -0
external/FinGPT/fingpt/FinGPT_Benchmark/train_lora.py +198 -0
external/FinGPT/fingpt/FinGPT_Benchmark/utils.py +216 -0

ai_analysis/fin_sentiment.py ADDED Viewed

	@@ -0,0 +1,17 @@

+#uses HuggingFace pipeline to classify text sentiment (positive, neutral, negative) with FinBERT.
+from transformers import pipeline
+# Load FinBERT financial sentiment pipeline
+sentiment_pipeline = pipeline(
+    "sentiment-analysis",
+    model="ProsusAI/finbert",
+    top_k=None
+)
+def analyze_sentiment(text):
+    try:
+        result = sentiment_pipeline(text[:512])[0]  # limit size for tokenizer
+        return result["label"].lower(), round(result["score"], 3)
+    except Exception as e:
+        return "error", 0.0

ai_analysis/fin_signal_tagging.py ADDED Viewed

	@@ -0,0 +1,15 @@

+import re
+FINANCIAL_KEYWORDS = [
+    "IPO", "Series A", "Series B", "funding", "acquisition", "merger",
+    "partnership", "earnings", "revenue", "valuation", "investment",
+    "raise", "round", "debt", "exit", "seed", "growth", "MoM", "ARR", "burn rate"
+]
+def extract_signals(text):
+    found = []
+    for kw in FINANCIAL_KEYWORDS:
+        pattern = r"\b" + re.escape(kw.lower()) + r"\b"
+        if re.search(pattern, text.lower()):
+            found.append(kw)
+    return list(set(found))

data/ai_2025-06-03.md ADDED Viewed

	@@ -0,0 +1,109 @@

+> **Metrics**
+> Topic: `AI`
+> Articles Collected: `371`
+> Generated: `2025-06-03 11:57`
+>
+# AI Value Investing Memo – Week Ending 2/June/2025 (Weekly Focus)
+## **Intro & Market Context**
+This week in AI, the market accelerated along its current high-anticipation trajectory, with a cluster of activity in startup fundraising, M&A, and fresh enterprise adoption. While no single "breakthrough" event dominated headlines, several key themes emerged: (1) Venture capital continues to quietly roll up smaller firms into AI-centric portfolios, (2) corporate M&A is ramping up in the AI space, (3) established tech giants are focusing on massive compute expansion as agentic AI demand surges, (4) novel applications (psychiatry, fintech, legaltech, and biology) are moving into commercial and even IPO-ready scale, and (5) regulatory and privacy debate continues to follow AI's march into sensitive sectors.
+General market sentiment remains optimistic but increasingly bifurcated: public equities in megacap AI (NVDA, MSFT, GOOG) are expensive, while an undercurrent of deep value persists among small caps and M&A targets. Smart money is increasingly shifting attention to niche, high-moat AI firms not yet in Wall Street's spotlight, particularly those with strong cash flows or unique IP.
+---
+## **1. Key Value Signals**
+- **Startup Fundraising Surge:** Early- and mid-stage AI startups (Rillet, Snabbit, Inven, Valla, Symbl.ai) raised significant capital despite macro volatility ([TechCrunch](https://techcrunch.com/2025/05/30/startups-weekly-amd-acquisition-and-other-moves-to-scale-ai-startups/), [Tech Funding News](https://techfundingnews.com/next-gen-ai-pitchbook-rival-finnish-inven-grabs-12-75-for-its-first-ai-native-deal-sourcing-platform/), [TechCrunch](https://techcrunch.com/2025/06/02/valla-raises-2-7m-to-make-legal-recourse-more-accessible-to-employees/)).
+- **Venture Roll-Ups:** Khosla Ventures and Elad Gil investing in AI-powered rollups of mature, cash-flow-positive companies — a signal that expertise and customer lists are the next moat ([TechCrunch](https://techcrunch.com/2025/05/23/khosla-ventures-among-vcs-experimenting-with-ai-infused-roll-ups-of-mature-companies/), [TechCrunch](https://techcrunch.com/2025/06/01/early-ai-investor-elad-gil-finds-his-next-big-bet-ai-powered-rollups/)).
+- **Compute Demand Surge:** Fintech, health, and banking adopting agentic AI, creating enormous compute needs (100x growth potential), favoring scale datacenter and semiconductor players ([FinTech Futures](https://www.fintechfutures.com/ai-in-fintech/unlock-fintech-innovation-with-agentic-ai-ai-factories-and-ai-powered-fraud-detection-workflows)).
+- **M&A: Strategic AI Acquisitions:** Leidos (LDOS) acquires AI/cyber firm Kudu Dynamics. Invoca acquires Symbl.ai — precedent for AI-focused M&A across sectors ([Axios](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)).
+- **AI in Regulated Sectors:** Major inroads made in banking (fraud, loan origination), health (psychiatry, biology), and legaltech (Valla, legal recourse for employees) ([Nature](https://www.nature.com/articles/s41380-025-03072-3), [Rude Baguette](https://www.rudebaguette.com/en/2025/06/ai-finally-did-it-breakthrough-in-biology-solves-a-mystery-scientists-have-been-chasing-for-over-30-years/)).
+- **Data Privacy & Regulation:** Growing calls for comprehensive regulation — creates compliance and consulting tailwinds for niche AI/data security players ([Dark Reading](https://www.darkreading.com/cyber-risk/rethinking-data-privacy-age-generative-ai)).
+---
+## **2. Stocks or Startups to Watch**
+### **Public Companies**
+- **Leidos Holdings (NYSE: LDOS)**
+  - Trigger: Acquired AI-focused cyber firm Kudu Dynamics for $300M cash ([Axios](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)).
+  - Stats: P/E ~16, ROE ~16%, Market Cap ~$17.6B (as of May 2025): stable, defense/cyber/AI mix, decent value for its sector.
+  - Watch for: Expanded AI defense/cyber offering, M&A synergy upside.
+- **Invoca** (private, potential IPO/M&A target)
+  - Trigger: Acquired Symbl.ai (AI-powered customer experience, $23M funding) — raises profile as a revenue automation leader.
+### **Notable Startups & VC-Backed Companies**
+- **Rillet**
+  - Trigger: Raised $25M Series A (Sequoia, <1 yr post-seed). Focus: AI for finance/accounting automation ([TechCrunch](https://techcrunch.com/2025/05/30/startups-weekly-amd-acquisition-and-other-moves-to-scale-ai-startups/)).
+  - Value Note: Early institutional traction + rapid fundraising, in nascent AI-for-services vertical.
+- **Valla**
+  - Trigger: $2.7M seed to democratize legal recourse using GenAI; focus on employee rights ([TechCrunch](https://techcrunch.com/2025/06/02/valla-raises-2-7m-to-make-legal-recourse-more-accessible-to-employees/)).
+  - Value Note: High regulatory moat, early traction, strong founder narrative.
+- **Inven**
+  - Trigger: $12.75M for AI-native deal sourcing (potential to disrupt PitchBook and legacy PE data vendors) ([Tech Funding News](https://techfundingnews.com/next-gen-ai-pitchbook-rival-finnish-inven-grabs-12-75-for-its-first-ai-native-deal-sourcing-platform/)).
+  - Value Note: Unique vertical for AI, early validation.
+- **Symbl.ai** (acquired by Invoca)
+  - Trigger: AI-powered conversation intelligence; validates VC-funded exit path for vertical AI.
+- **Agentic AI, Data Privacy, and Fraud Detection Startups**
+  - Trigger: Fintech demand for agentic AI, "AI factories", and fraud detection = greenfield for private AI infra startups ([FinTech Futures](https://www.fintechfutures.com/ai-in-fintech/unlock-fintech-innovation-with-agentic-ai-ai-factories-and-ai-powered-fraud-detection-workflows)).
+---
+## **3. What Smart Money Might Be Acting On**
+- **AI Rollup Trend:** Major VCs (Khosla, Elad Gil) are moving beyond backing pure-play startups to quietly acquiring and aggregating legacy companies, layering AI products on top ([TechCrunch](https://techcrunch.com/2025/05/23/khosla-ventures-among-vcs-experimenting-with-ai-infused-roll-ups-of-mature-companies/), [TechCrunch](https://techcrunch.com/2025/06/01/early-ai-investor-elad-gil-finds-his-next-big-bet-ai-powered-rollups/)).
+  - **Why:** Lower risk than bleeding-edge AI bets, immediate cash flow, and quick access to hard-to-get enterprise customers.
+- **Enterprise AI B2B**: Bet on startups with regulatory/vertical moats (finance, healthcare, legal) rather than direct consumer GenAI, where hype/competition is fierce.
+- **AI-Driven M&A:** Incumbents in security, defense, and SaaS (like Leidos, Invoca) are primed to bolt on AI capabilities quickly — making small-cap/public firms with unique IP potential targets.
+- **Compute Infrastructure:** Buy or build into companies with data center, AI-chip exposure, or proprietary algorithms serving banks, fintechs, or life sciences.
+- **Compliance/Privacy**: Funds may flow to specialist consultancies and SaaS with privacy/compliance focus, as regulatory overhang tightens.
+---
+## **4. References**
+- [AI in Psychiatry](https://www.nature.com/articles/s41380-025-03072-3) | [AI for Biology](https://www.rudebaguette.com/en/2025/06/ai-finally-did-it-breakthrough-in-biology-solves-a-mystery-scientists-have-been-chasing-for-over-30-years/)
+- [Fintech/Agentic AI Demand](https://www.fintechfutures.com/ai-in-fintech/unlock-fintech-innovation-with-agentic-ai-ai-factories-and-ai-powered-fraud-detection-workflows)
+- [Leidos, Invoca M&A](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)
+- [Khosla/Elad Gil Rollups](https://techcrunch.com/2025/05/23/khosla-ventures-among-vcs-experimenting-with-ai-infused-roll-ups-of-mature-companies/), [TechCrunch on Rollups](https://techcrunch.com/2025/06/01/early-ai-investor-elad-gil-finds-his-next-big-bet-ai-powered-rollups/)
+- [Rillet/Startup Rounds](https://techcrunch.com/2025/05/30/startups-weekly-amd-acquisition-and-other-moves-to-scale-ai-startups/), [Inven Raise](https://techfundingnews.com/next-gen-ai-pitchbook-rival-finnish-inven-grabs-12-75-for-its-first-ai-native-deal-sourcing-platform/), [Valla Seed](https://techcrunch.com/2025/06/02/valla-raises-2-7m-to-make-legal-recourse-more-accessible-to-employees/)
+- [Data Privacy/Regulation](https://www.darkreading.com/cyber-risk/rethinking-data-privacy-age-generative-ai)
+- [AI Macro Debate](https://www.forbes.com/sites/bernardmarr/2025/05/23/ai-could-reshape-humanity-and-we-have-no-plan-for-it/)
+---
+## **5. Investment Hypothesis**
+- **The Cream Rises:** Amidst AI hype, value is accruing fastest to (a) established firms acquiring AI/niche tech, (b) small/midcap vertical SaaS/AI companies with regulatory moats, and (c) strategic AI-powered rollups with serious institutional expertise and cash flow.
+- **Key Thesis:** Ignore the frothy megacap multiples; focus on under-followed AI stocks and private companies with:
+    - Proven B2B or SaaS revenue,
+    - Unique IP/defensible verticals,
+    - cashflow or recent M&A/VC validation,
+    - are potential rollup or acquisition targets.
+- **Tailwind:** Surging demand in regulated and semi-regulated verticals (health, finance, defense, legal).
+- **Headwind:** Regulatory/ethical scrutiny could increase cost of doing business for generalist GenAI players — favoring those with purpose-built compliance tools or vertical knowledge.
+**Bottom Line:**
+- Watch for M&A in defense/cybersecurity, SaaS, and AI-powered B2B plays (Leidos, Invoca, Rillet).
+- Track VC-backed AI rollups as stealth vehicles for value creation and future IPO/M&A pops.
+- Seek out early-stage startups in vertical SaaS or legaltech deploying AI in compliance-intensive settings.
+- Physical compute/infra players serving agentic AI (AMD, data centers) continue to benefit from secular demand.
+---
+**Broad Summary of This Week's AI News:**
+The week was dominated by continued VC confidence, strategic M&A, and institutional moves in vertical AI applications, with significant attention on small-cap and startup valuations. Macroeconomic backdrop remains strong for AI demand, but value opportunities lie beneath the surface in rollups, newly funded vertical SaaS, and compliance-driven niches. Regulatory risk rising but also carving new investable moats.
+---

data/ai_2025-07-04.md ADDED Viewed

	@@ -0,0 +1,78 @@

+> Topic: `AI`
+> Articles Collected: `163`
+> Generated: `2025-07-04 13:40`
+>
+# Value Investor AI Weekly Memo
+**Week of June 30 – July 6, 2025**
+---
+## Market Sentiment & Trends
+This week, the AI market continues to be characterized by robust *growth optimism* and *massive capital deployment*. Sentiment remains largely positive in infrastructure and applied AI, but there is rising skepticism toward sky-high private market valuations in some fast-following startups. Major headlines focus on AI’s influence in cybersecurity, legal, HR, and consulting verticals, as well as the continuing "picks and shovels" theme in datacenter hardware and services.
+*No major regulatory shocks noted*, but institutions and investors are expressing caution about the sustainability of AI startup valuations and possible hype cycles.
+---
+## 1. Key Value Signals
+- **Infrastructure Focus Remains Dominant:** The highest conviction for value investing is in AI infrastructure—hardware, datacenters, and core networking.
+- **M&A and Partnership Activity:** Notable signals like Apple considering partnerships/acquisitions for Siri enhancements (Anthropic, OpenAI) and SoftBank moving aggressively on artificial superintelligence with multi-phase global projects.
+- **Startup Capital Flows Accelerating:** Noteworthy rounds at Harvey ($300M Series E, legal AI), Abridge ($300M Series E, medical AI), Metaview ($35M, hiring), and Lovable ($150M rumored). However, most are at steep valuations (>$2B pre/post-money).
+- **Insider & Smart Money Activity:** a16z, Kleiner Perkins, Coatue, Google Ventures, and Sequoia are active, with Glasswing Ventures discussing new AI funding strategies.
+- **Geographic Expansion:** SoftBank’s ASI moves and Asia-centric “build with context” approach highlight a more sustainable, potentially undervalued new-entrant pipeline.
+---
+## 2. Stocks or Startups to Watch
+### **Public Markets:**
+- **Arista Networks (ANET)**, **Nvidia (NVDA)**, **AMD (AMD):** “Picks and shovels” for the AI gold rush—datacenter, networking, compute chips. *Arista* has a lower valuation multiple than Nvidia, still strong ROE, and is less crowded.
+- **SoftBank (SFTBY/SFTBF):** The push for "artificial superintelligence" signals heavy capital spend, but could be an undervalued play if execution improves and Vision Fund losses subside.
+- **Apple (AAPL):** Movement on AI partnerships/acquisitions may re-rate Siri’s potential, although Apple trades rich by value standards.
+### **Private/Startup Watchlist:**
+- **Harvey (Legal AI):** $5B valuation, but massive adoption potential for legal transformation; recently had consecutive mega-rounds—possibly ahead of fundamentals.
+- **Abridge (Healthcare AI):** $5.3B valuation; automating medical notes is a real use-case, but valuation steep.
+- **Metaview (Recruitment AI):** Google Ventures led; automating/bias-reducing hiring—smaller, earlier, potentially higher reward.
+- **Lovable:** On track for $150M at $2B. Early-stage AI firm, unknown fundamentals, but worth tracking as a potential future public market debut.
+### **Infrastructure enablers:**
+- **Scott Data (Private):** Midwest US data center, supporting AI startups—potential for M&A or IPO as picks-and-shovels to the AI startup wave.
+- **Industrial/Manufacturing AI:** Watch industrial AI “digital twins” and multimodal analytics for less-flashy, but real, B2B moats.
+---
+## 3. What Smart Money Might Be Acting On
+- **Private Market Rotation:** Top VCs (Kleiner Perkins, a16z, Coatue, Sequoia, Google Ventures) are doubling down on AI startups, but selectively—pivoting more to infrastructure, HR, and healthcare use-cases where actual adoption is measurable.
+- **Datacenter & Networking Expansion:** Institutional and growth investors pushing into datacenter, network, and hardware plays over frothy model-chatbot proliferators.
+- **“Asia Build” Angle:** Long-term capital weighs Asian AI execution models, where blitzscaling is shunned for capital efficiency. Early institutional allocation might offer less-overpriced entry into the next breakout AI winners.
+---
+## 4. References
+- [Forbes: AI Hype Cycle & Infrastructure](https://www.forbes.com/sites/rscottraynovich/2025/07/01/inside-the-ai-hype-cycle-whats-next-for-enterprise-ai/)
+- [RCR Wireless: SoftBank's Superintelligence Ambitions](https://www.rcrwireless.com/20250630/ai-infrastructure/softbank-artificial)
+- [TechCrunch: Harvey, Abridge funding](https://techcrunch.com/2025/06/27/startups-weekly-tech-and-the-law/)
+- [Startup Ecosystem Canada: Lovable AI funding](https://www.startupecosystem.ca/news/lovable-ai-startup-on-track-to-raise-150m-at-2b-valuation/)
+- [GovTech: Scott Data, Omaha AI infrastructure partnership](https://www.govtech.com/artificial-intelligence/partnership-looks-to-drive-ai-adoption-in-omaha-neb)
+- [Mining Technology: Industrial/Multimodal AI](https://www.mining-technology.com/sponsored/whats-next-for-industrial-ai-five-key-developments-shaping-the-space/)
+- [Business Insider: Claude/Anthropic, Microsoft AI as a core workflow](https://www.businessinsider.com/claude-ran-store-anthropic-ai-agent-lessons-learned-middle-managers-2025-6)
+---
+## 5. Investment Hypothesis
+**The market is in the mid-to-late innings of the first generative AI value cycle. Near-term value is likely to accrue to AI infrastructure enablers (datacenter, networking, compute), NOT to richly-priced flashy model startups. The next wave UNLOCK is in B2B-specific verticals—manufacturing, healthcare, legal, hiring—especially those with defensible data or workflows (moats). Early-stage infrastructure providers outside the Bay Area (e.g., Midwest data centers, lower-multiple Asia AI shops) may offer underappreciated value. SoftBank’s renewed push and Apple’s partnership strategy suggest major future M&A, benefiting core AI tech and infrastructure players.**
+### **Screen for:**
+- Public tech with strong fundamentals (low P/E, high ROE, cash flows) in critical infrastructure (Arista, AMD)
+- Private companies with repeat-use, high-barrier products — notably in B2B SaaS, industrial, or privacy-compliant hiring/medtech AI
+- Undercovered, smaller infrastructure shops and regional datacenter players (public or potential IPO/M&A targets)
+---
+**(Caveat: Recent startup valuations may be unsustainably high. Exercise discipline; seek evidence of unit economics and actual cashflow, not just growth metrics.)**

data/ai_2025-07-04_1.md ADDED Viewed

	@@ -0,0 +1,104 @@

+> Topic: `AI`
+> Articles Collected: `116`
+> Generated: `2025-07-04 14:12`
+>
+# AI Weekly Value Investor Memo
+### Week of July 1st, 2025
+---
+## 0. Market Context & Sentiment
+**Macro & Sentiment:**
+This week, investor sentiment in the AI sector remains broadly bullish, underpinned by continued enterprise adoption and “picks and shovels” investment themes. Headlines focus on the scaling of AI towards Artificial General Intelligence (AGI) and hype cycles driven by big checks for vertical SaaS startups and infrastructure vendors. No single transformative event, but sustained high valuations, strong VC focus on “AI-native” startups, and large checks pouring into cybersecurity and legal GTM plays. Amid frothy valuations, institutional players (SoftBank, Big Tech) double down on infrastructure and ecosystem strategies over direct “brain” competition.
+Key macro trends:
+- Rising data center demand, with local partnerships (Omaha) echoing US reshoring and digitization.
+- Regulatory posture remains ambiguous—no new moats from regulation, but AI safety and compliance themes prominent.
+- Cyclical hype for AGI but little near-term fundamental change.
+- Venture market bifurcates: mature infra stocks pop while newer AI SaaS valuations are punchy, not value-oriented.
+---
+## 1. Key Value Signals (This Week)
+- **Infrastructure Over Apps:** Forbes, TechCrunch, and several VC sources stress infrastructure (“picks & shovels”) wins: Nvidia, AMD, Arista Networks. Underlying: new data center buildouts are where margins and moats are consolidating.
+- **Big Funding, High Valuation Startups:** Multiple $300M Series E rounds at $5B valuations (Harvey AI, Abridge); notable, but reflect more exit-hunting than deep value.
+- **SoftBank AI Ambitions:** SoftBank signals multi-phase, multi-location investments in ASI (Artificial Super Intelligence) infrastructure, not “hot” consumer apps.
+- **Apple Chasing Leading Models:** Rumors of Apple exploring partnerships with OpenAI and Anthropic for Siri enhancement.
+- **Cybersecurity Demand:** AI investment is driving new security solutions; sector heating up.
+---
+## 2. Stocks or Startups to Watch
+### Infrastructure
+- **Arista Networks (ANET):** Under the radar, strong FCF, high ROE, reasonable P/E (~30), and riding AI data center build.
+- **Super Micro Computer (SMCI):** Hardware picks & shovels, high revenue growth, yet P/E and P/B are both lower than high-flying AI SaaS, FCF positive.
+- **Vertiv Holdings (VRT):** Power/cooling for AI data centers, high operating margins, FCF, tailwinds from capex cycle.
+### AI-Driven Cybersecurity
+- **SentinelOne (S):** Recently turned cash flow positive, but price high—watch for dips.
+- **Crowdstrike (CRWD):** Pricey, but moat is growing, sectoral tailwinds.
+- **Small Caps:** Flashpoint and startups surfaced in cybersecurity coverage—potential acquisition targets.
+### Newer, Less-Observed Startups
+- **Lovable:** On track for $150M at $2B—early, but if growth justifies valuation, could be a midterm play if multiples contract.
+- **Gruve (tech consulting AI):** High-growth, vertical AI consulting. Private, but watch for eventual IPO or acquisition.
+- **Metaview:** Google Ventures-backed AI for interview/recruitment; market is early but growing.
+### Legal & Medical AI
+- **Harvey AI, Abridge:** Both received massive late-stage rounds. High valuation-to-revenue ratios. Watch for longer-term public market readiness or cooling-off.
+---
+## 3. What Smart Money Might Be Acting On
+- **Follow the Infra CapEx:** Hedge funds and institutions likely overweighting core infrastructure—semis, networking, power, storage—where margin, volume, and defensible moats (via high switching costs or regulatory inertia) exist.
+- **Avoiding Overhyped SaaS/Vertical Plays for Now:** Recent late-stage venture rounds suggest IPO ambitions but present frothy valuations—a signal to wait for broader tech multiple contraction.
+- **Vulture Mode on Early AI Cybersecurity:** Acquisitive public cos may shop among underfunded cybersecurity startups as hype cools.
+- **Monitoring Apple’s Moves:** Any deal with OpenAI/Anthropic for Siri would shift market sentiment and spark M&A or partnership runs in vertical AI.
+- **Asian Market Infrastructure Approaches:** With lower “blitzscaling” and more focus on fundamentals (SoftBank, Asia model), look for undercovered APAC infra names with value metrics.
+---
+## 4. References
+Cited news articles, notable for context:
+- [Inside The AI Hype Cycle: What’s Next For Enterprise AI? - Forbes](https://www.forbes.com/sites/rscottraynovich/2025/07/01/inside-the-ai-hype-cycle-whats-next-for-enterprise-ai/)
+- [SoftBank aims to lead artificial super intelligence era - RCR Wireless News](https://www.rcrwireless.com/20250630/ai-infrastructure/softbank-artificial)
+- [Apple Explores Anthropic and OpenAI for Siri AI Enhancement - Startup Ecosystem Canada](https://www.startupecosystem.ca/news/apple-explores-anthropic-and-openai-for-siri-ai-enhancement/)
+- [Surging Investments in AI Are Transforming Cybersecurity - Forbes](https://www.forbes.com/sites/chuckbrooks/2025/06/27/surging-investments-in-ai-are-transforming-cybersecurity/)
+- [Venture Gapital - Forbes](https://www.forbes.com/sites/richkarlgaard/2025/07/04/venture-gapital/)
+- [Startups Weekly: Tech and the law - TechCrunch](https://techcrunch.com/2025/06/27/startups-weekly-tech-and-the-law/)
+- [Lovable AI Startup on Track to Raise $150M at $2B Valuation - Startup Ecosystem Canada](https://www.startupecosystem.ca/news/lovable-ai-startup-on-track-to-raise-150m-at-2b-valuation/)
+---
+## 5. Investment Hypothesis
+### Core Thesis
+**Value is centered in infrastructure providers—hardware, networking, power, and core security—catering to the real, growing CapEx demand of AI buildout. Avoid vertical SaaS and “hot” application startups unless multiples meaningfully contract or unique moats emerge.**
+**Rationale:**
+- AI infrastructure remains the largest, highest-margin, most defensible section of the ecosystem, with strong fundamentals (low P/E, high ROE, FCF-rich) and clear secular tailwinds.
+- Application layer (AI native SaaS) is flush with VC cash, driving up valuations, and has a high “hype/actual value” risk unless real-world defensible moats or sticky enterprise contracts are evident.
+- Smart money is playing infrastructure, not moonshots in AGI or narrow applications—at least until the next wave of down valuations.
+### Strategy
+- **Overweight:** Data center, networking, and power infrastructure (ANET, SMCI, VRT).
+- **Underweight/monitor:** Late-stage application-layer startups (Harvey, Abridge, Lovable) for valuation resets.
+- **Watch list:** Early AI cybersec, APAC infrastructure with value metrics, Apple partnership signals.
+- **Regulatory angle:** Unlikely to drive new barriers-to-entry/moats this cycle; invest in moats built on high switching costs or network effects.
+---
+**Conclusion:**
+This is a “follow the pipes, not the flowers” moment. Let VCs chase $5B AI SaaS rounds—the best value lies in asset-light infrastructure plays with strong balance sheets, high returns on equity, and multi-year secular tailwinds.
+---

data/combined_report.md ADDED Viewed

	@@ -0,0 +1,700 @@

+---
+> **Metrics**
+> Topic: `AI`
+> Articles Collected: `371`
+> Generated: `2025-06-03 11:57`
+>
+# AI Value Investing Memo – Week Ending 2/June/2025 (Weekly Focus)
+## **Intro & Market Context**
+This week in AI, the market accelerated along its current high-anticipation trajectory, with a cluster of activity in startup fundraising, M&A, and fresh enterprise adoption. While no single "breakthrough" event dominated headlines, several key themes emerged: (1) Venture capital continues to quietly roll up smaller firms into AI-centric portfolios, (2) corporate M&A is ramping up in the AI space, (3) established tech giants are focusing on massive compute expansion as agentic AI demand surges, (4) novel applications (psychiatry, fintech, legaltech, and biology) are moving into commercial and even IPO-ready scale, and (5) regulatory and privacy debate continues to follow AI's march into sensitive sectors.
+General market sentiment remains optimistic but increasingly bifurcated: public equities in megacap AI (NVDA, MSFT, GOOG) are expensive, while an undercurrent of deep value persists among small caps and M&A targets. Smart money is increasingly shifting attention to niche, high-moat AI firms not yet in Wall Street's spotlight, particularly those with strong cash flows or unique IP.
+---
+## **1. Key Value Signals**
+- **Startup Fundraising Surge:** Early- and mid-stage AI startups (Rillet, Snabbit, Inven, Valla, Symbl.ai) raised significant capital despite macro volatility ([TechCrunch](https://techcrunch.com/2025/05/30/startups-weekly-amd-acquisition-and-other-moves-to-scale-ai-startups/), [Tech Funding News](https://techfundingnews.com/next-gen-ai-pitchbook-rival-finnish-inven-grabs-12-75-for-its-first-ai-native-deal-sourcing-platform/), [TechCrunch](https://techcrunch.com/2025/06/02/valla-raises-2-7m-to-make-legal-recourse-more-accessible-to-employees/)).
+- **Venture Roll-Ups:** Khosla Ventures and Elad Gil investing in AI-powered rollups of mature, cash-flow-positive companies — a signal that expertise and customer lists are the next moat ([TechCrunch](https://techcrunch.com/2025/05/23/khosla-ventures-among-vcs-experimenting-with-ai-infused-roll-ups-of-mature-companies/), [TechCrunch](https://techcrunch.com/2025/06/01/early-ai-investor-elad-gil-finds-his-next-big-bet-ai-powered-rollups/)).
+- **Compute Demand Surge:** Fintech, health, and banking adopting agentic AI, creating enormous compute needs (100x growth potential), favoring scale datacenter and semiconductor players ([FinTech Futures](https://www.fintechfutures.com/ai-in-fintech/unlock-fintech-innovation-with-agentic-ai-ai-factories-and-ai-powered-fraud-detection-workflows)).
+- **M&A: Strategic AI Acquisitions:** Leidos (LDOS) acquires AI/cyber firm Kudu Dynamics. Invoca acquires Symbl.ai — precedent for AI-focused M&A across sectors ([Axios](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)).
+- **AI in Regulated Sectors:** Major inroads made in banking (fraud, loan origination), health (psychiatry, biology), and legaltech (Valla, legal recourse for employees) ([Nature](https://www.nature.com/articles/s41380-025-03072-3), [Rude Baguette](https://www.rudebaguette.com/en/2025/06/ai-finally-did-it-breakthrough-in-biology-solves-a-mystery-scientists-have-been-chasing-for-over-30-years/)).
+- **Data Privacy & Regulation:** Growing calls for comprehensive regulation — creates compliance and consulting tailwinds for niche AI/data security players ([Dark Reading](https://www.darkreading.com/cyber-risk/rethinking-data-privacy-age-generative-ai)).
+---
+## **2. Stocks or Startups to Watch**
+### **Public Companies**
+- **Leidos Holdings (NYSE: LDOS)**
+  - Trigger: Acquired AI-focused cyber firm Kudu Dynamics for $300M cash ([Axios](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)).
+  - Stats: P/E ~16, ROE ~16%, Market Cap ~$17.6B (as of May 2025): stable, defense/cyber/AI mix, decent value for its sector.
+  - Watch for: Expanded AI defense/cyber offering, M&A synergy upside.
+- **Invoca** (private, potential IPO/M&A target)
+  - Trigger: Acquired Symbl.ai (AI-powered customer experience, $23M funding) — raises profile as a revenue automation leader.
+### **Notable Startups & VC-Backed Companies**
+- **Rillet**
+  - Trigger: Raised $25M Series A (Sequoia, <1 yr post-seed). Focus: AI for finance/accounting automation ([TechCrunch](https://techcrunch.com/2025/05/30/startups-weekly-amd-acquisition-and-other-moves-to-scale-ai-startups/)).
+  - Value Note: Early institutional traction + rapid fundraising, in nascent AI-for-services vertical.
+- **Valla**
+  - Trigger: $2.7M seed to democratize legal recourse using GenAI; focus on employee rights ([TechCrunch](https://techcrunch.com/2025/06/02/valla-raises-2-7m-to-make-legal-recourse-more-accessible-to-employees/)).
+  - Value Note: High regulatory moat, early traction, strong founder narrative.
+- **Inven**
+  - Trigger: $12.75M for AI-native deal sourcing (potential to disrupt PitchBook and legacy PE data vendors) ([Tech Funding News](https://techfundingnews.com/next-gen-ai-pitchbook-rival-finnish-inven-grabs-12-75-for-its-first-ai-native-deal-sourcing-platform/)).
+  - Value Note: Unique vertical for AI, early validation.
+- **Symbl.ai** (acquired by Invoca)
+  - Trigger: AI-powered conversation intelligence; validates VC-funded exit path for vertical AI.
+- **Agentic AI, Data Privacy, and Fraud Detection Startups**
+  - Trigger: Fintech demand for agentic AI, "AI factories", and fraud detection = greenfield for private AI infra startups ([FinTech Futures](https://www.fintechfutures.com/ai-in-fintech/unlock-fintech-innovation-with-agentic-ai-ai-factories-and-ai-powered-fraud-detection-workflows)).
+---
+## **3. What Smart Money Might Be Acting On**
+- **AI Rollup Trend:** Major VCs (Khosla, Elad Gil) are moving beyond backing pure-play startups to quietly acquiring and aggregating legacy companies, layering AI products on top ([TechCrunch](https://techcrunch.com/2025/05/23/khosla-ventures-among-vcs-experimenting-with-ai-infused-roll-ups-of-mature-companies/), [TechCrunch](https://techcrunch.com/2025/06/01/early-ai-investor-elad-gil-finds-his-next-big-bet-ai-powered-rollups/)).
+  - **Why:** Lower risk than bleeding-edge AI bets, immediate cash flow, and quick access to hard-to-get enterprise customers.
+- **Enterprise AI B2B**: Bet on startups with regulatory/vertical moats (finance, healthcare, legal) rather than direct consumer GenAI, where hype/competition is fierce.
+- **AI-Driven M&A:** Incumbents in security, defense, and SaaS (like Leidos, Invoca) are primed to bolt on AI capabilities quickly — making small-cap/public firms with unique IP potential targets.
+- **Compute Infrastructure:** Buy or build into companies with data center, AI-chip exposure, or proprietary algorithms serving banks, fintechs, or life sciences.
+- **Compliance/Privacy**: Funds may flow to specialist consultancies and SaaS with privacy/compliance focus, as regulatory overhang tightens.
+---
+## **4. References**
+- [AI in Psychiatry](https://www.nature.com/articles/s41380-025-03072-3) | [AI for Biology](https://www.rudebaguette.com/en/2025/06/ai-finally-did-it-breakthrough-in-biology-solves-a-mystery-scientists-have-been-chasing-for-over-30-years/)
+- [Fintech/Agentic AI Demand](https://www.fintechfutures.com/ai-in-fintech/unlock-fintech-innovation-with-agentic-ai-ai-factories-and-ai-powered-fraud-detection-workflows)
+- [Leidos, Invoca M&A](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)
+- [Khosla/Elad Gil Rollups](https://techcrunch.com/2025/05/23/khosla-ventures-among-vcs-experimenting-with-ai-infused-roll-ups-of-mature-companies/), [TechCrunch on Rollups](https://techcrunch.com/2025/06/01/early-ai-investor-elad-gil-finds-his-next-big-bet-ai-powered-rollups/)
+- [Rillet/Startup Rounds](https://techcrunch.com/2025/05/30/startups-weekly-amd-acquisition-and-other-moves-to-scale-ai-startups/), [Inven Raise](https://techfundingnews.com/next-gen-ai-pitchbook-rival-finnish-inven-grabs-12-75-for-its-first-ai-native-deal-sourcing-platform/), [Valla Seed](https://techcrunch.com/2025/06/02/valla-raises-2-7m-to-make-legal-recourse-more-accessible-to-employees/)
+- [Data Privacy/Regulation](https://www.darkreading.com/cyber-risk/rethinking-data-privacy-age-generative-ai)
+- [AI Macro Debate](https://www.forbes.com/sites/bernardmarr/2025/05/23/ai-could-reshape-humanity-and-we-have-no-plan-for-it/)
+---
+## **5. Investment Hypothesis**
+- **The Cream Rises:** Amidst AI hype, value is accruing fastest to (a) established firms acquiring AI/niche tech, (b) small/midcap vertical SaaS/AI companies with regulatory moats, and (c) strategic AI-powered rollups with serious institutional expertise and cash flow.
+- **Key Thesis:** Ignore the frothy megacap multiples; focus on under-followed AI stocks and private companies with:
+    - Proven B2B or SaaS revenue,
+    - Unique IP/defensible verticals,
+    - cashflow or recent M&A/VC validation,
+    - are potential rollup or acquisition targets.
+- **Tailwind:** Surging demand in regulated and semi-regulated verticals (health, finance, defense, legal).
+- **Headwind:** Regulatory/ethical scrutiny could increase cost of doing business for generalist GenAI players — favoring those with purpose-built compliance tools or vertical knowledge.
+**Bottom Line:**
+- Watch for M&A in defense/cybersecurity, SaaS, and AI-powered B2B plays (Leidos, Invoca, Rillet).
+- Track VC-backed AI rollups as stealth vehicles for value creation and future IPO/M&A pops.
+- Seek out early-stage startups in vertical SaaS or legaltech deploying AI in compliance-intensive settings.
+- Physical compute/infra players serving agentic AI (AMD, data centers) continue to benefit from secular demand.
+---
+**Broad Summary of This Week's AI News:**
+The week was dominated by continued VC confidence, strategic M&A, and institutional moves in vertical AI applications, with significant attention on small-cap and startup valuations. Macroeconomic backdrop remains strong for AI demand, but value opportunities lie beneath the surface in rollups, newly funded vertical SaaS, and compliance-driven niches. Regulatory risk rising but also carving new investable moats.
+---
+---
+> Topic: `Nuclear energy`
+> Articles Collected: `150`
+> Generated: `2025-07-04 13:55`
+>
+# Nuclear Energy: Value-Investor Weekly Memo
+**Week of June 30 – July 7, 2025**
+---
+## Executive Summary: Sentiment & Market Trends
+This week, nuclear energy remains at the center of global and U.S. energy policy debates, buoyed by both political tailwinds (GOP-led support in legislation, state-level deployment pushes) and rising demand from AI/data center infrastructure. Nuclear is also strategically reemerging as the “clean firm” power of choice as renewables face policy setbacks, intermittency challenges, and grid reliability strains. Major tech companies and select startup activity point to accelerations in both fission (SMRs) and fusion, with corporate and government actors signaling capital and operational shifts toward advanced nuclear solutions.
+Market sentiment appears mildly positive for established names but remains neutral for the broader sector. Early-stage deal flow and new executive moves hint at undervalued opportunities in uranium miners, SMR developers, and next-gen reactor supply chains, all backstopped by robust macro trends.
+---
+## 1. Key Value Signals
+- **Public-Private Partnerships & Policy Tailwinds**
+    - New York’s governor directs pursuit of at least 1 GW of new nuclear (possible “fleet-style” deployments), signifying state-level commitment.
+    - GOP legislation weakens renewables but retains and even enhances support for nuclear/geothermal—improving medium-term earning prospects for nuclear-exposed businesses.
+- **Tech Giant Commitments**
+    - Google commits to buying power from Commonwealth Fusion Systems (fusion) and from Kairos Power (SMRs/fission), underscoring long-term belief in and potential floor demand for advanced nuclear power.
+- **M&A / Executive Movement**
+    - Ur-Energy (URG) names Matthew Gili (ex-Cameco, Energy Fuels) as President; strong management pedigree in uranium mining suggests focus on operational ramp-up and credibility for growth.
+- **Private Funding & Industrial Partnerships**
+    - Westinghouse-ITER $180M fusion contract advances commercial pathways for fusion.
+    - Palantir partners with The Nuclear Company for AI deployment in nuclear construction, potentially de-risking timelines and cost overruns—key bottlenecks for new plants.
+- **Uranium Financing**
+    - Energy Fuels (NYSE: UUUU) launches $300M ATM share offering for growth and possibly M&A, indicating possible scale-up action or acquisition-driven value.
+---
+## 2. Stocks or Startups to Watch
+### Undervalued Small Caps / Startups
+- **Ur-Energy (URG)**
+    - **Sector**: Uranium production/mining
+    - **Signals**: New CEO with pedigree, North American supply play; potential for insider or institutional accumulation.
+    - **Fundamentals**: Historically low P/B and P/E vs. sector; improving cash flow as uranium prices trend higher.
+- **Energy Fuels (UUUU)**
+    - **Sector**: Uranium/rare earths
+    - **Signals**: ATM share offering—could precede an operational expansion, M&A, or balance sheet fortification.
+    - **Moat**: Vertical integration and North American production base; tailwinds from potential U.S. uranium supply mandates.
+- **Kairos Power**
+    - **Sector**: Small Modular Reactor (SMR) developer
+    - **Signals**: Google is a committed off-taker (500 MW); not public but watch for IPO or private rounds.
+    - **Moat**: Proprietary reactor and fuel tech, first-mover commercial projects.
+- **Commonwealth Fusion Systems (private)**
+    - **Sector**: Fusion
+    - **Signals**: Google investing + off-take for 200MW; implies robust institutional backing, possible pre-IPO unicorn.
+    - **Moat**: Leading IP/patent portfolio in commercial fusion.
+- **Floating Nuclear Consortia (Europe/Mediterranean)**
+    - **Sector**: Maritime nuclear
+    - **Signals**: New industry consortium for floating plants; regulatory tailwinds in Europe; riskier but paradigm-shifting.
+### Large-Cap Defensive/Incumbent Names
+- **Westinghouse (private, but watch via Brookfield Asset Management/partners)**
+    - **Signals**: $180M fusion contract + global SMR tenders.
+    - **Moat**: Deep IP/patents, established utility relationships.
+#### Emerging Themes
+- SMEs/startups deploying AI to compress reactor construction timelines (e.g., The Nuclear Company + Palantir).
+- Uranium spot market dislocations, supply security, and U.S./Canadian production uptrend.
+---
+## 3. What Smart Money Might Be Acting On
+### Institutional Moves and VC Flows
+- **Tech Company Off-Take Agreements**: Google’s long-dated power purchase agreements (PPAs) for nuclear fusion and SMRs indicate that large buyers are locking in future clean firm power, giving runway and de-risking revenue for emerging projects.
+- **Leadership Talent Migration**: Appointment of high-profile operators (e.g., Matthew Gili at URG) often precedes capital flows and operational improvement.
+- **Private/VC Investment**: Ongoing private fundraising in fusion (CFS/publicized; others less visible) and SMR space—potential for pre-IPO access or PIPE deals.
+- **Policy-driven Lifts**: Funds with a value/cyclical tilt may be accumulating uranium miners and established SMR suppliers, expecting U.S. or European state-driven demand and pricing power.
+---
+## 4. References
+- [Insider Monkey: Ur-Energy appoints Matthew Gili](https://www.insidermonkey.com/blog/ur-energy-urg-names-matthew-gili-as-president-to-support-growth-strategy-1562642/)
+- [TechCrunch: Google’s data center energy use doubles; commits to SMRs & Fusion](https://techcrunch.com/2025/07/01/googles-data-center-energy-use-doubled-in-four-years/)
+- [Newsweek: Google bets on Nuclear Fusion, Commonwealth Fusion Systems](https://www.newsweek.com/google-bets-nuclear-fusion-next-generation-clean-power-2091877)
+- [POWER Magazine: Westinghouse & ITER fusion contract](https://www.powermag.com/westinghouse-iter-sign-180-million-contract-to-advance-nuclear-fusion/)
+- [Utility Dive: NY Gov. Hochul nuclear push](https://www.utilitydive.com/news/new-york-gov-hochul-hints-at-fleet-style-approach-to-nuclear-deployments/751838/)
+- [Insider Monkey: Energy Fuels ATM offering](https://www.insidermonkey.com/blog/energy-fuels-uuuu-launches-300-million-atm-share-offering-program-1562647/)
+- [Marine Link: Industry consortium assesses floating nuclear](https://www.marinelink.com/news/industry-consortium-asses-floating-527616)
+- [The Verge, Sky News, NPR, CleanTechnica] (multiple for macro/policy context)
+---
+## 5. Investment Hypothesis
+Amid rising electricity demand from AI/data centers and the political marginalization of wind/solar, nuclear energy—particularly next-gen reactor developers, operationally leveraged uranium miners, and AI-enabled project managers—is set to benefit from both structural and cyclical forces. Near-term policy support, tech company PPA commitments, and tangible operational milestones (fusion contracts, executive talent upgrades) provide a fundamental backdrop for value investors.
+**Thesis**: Select undervalued uranium miners (URG, UUUU) and actionable SMR/fusion-related plays with real partnerships or contracts (Kairos, CFS, Palantir’s nuclear construction software partners) are likely mispriced relative to long-term demand, the emergence of tech buyer power, and regulatory tailwinds. Watch for balance sheet improvement, insider activity, and capex deployment as future catalysts.
+**Actionable Watchlist:**
+- Ur-Energy (NYSE: URG) — ride management upgrade and uranium bull cycle
+- Energy Fuels (NYSE: UUUU) — play on U.S. supply autonomy and balance sheet firepower
+- Private: Kairos Power, Commonwealth Fusion Systems — monitor for IPO/news, pre-IPO funds
+- Established supply chain: Westinghouse (via BAM, or tracking SMR contracts), Palantir’s nuclear ventures
+---
+**Macroeconomic/Regulatory Context:**
+- U.S. and European grid reliability and policy now lean “pro-nuclear” as renewables face political and technical hurdles.
+- Tech-sector demand for bespoke clean, reliable baseload may outpace traditional grid growth, driving long-term PPA/contracting up for nuclear-adjacent firms.
+- Early stage risk remains (especially fusion), but government cash, looser environmental reviews, and talent influx are de-risking the sector.
+---
+**Discipline:** Accumulate on dips with a margin of safety; remain alert to policy reversals, cost overruns, and technology risk. Revisit on IPO news, federal incentive shifts, and real-world contract wins.
+---
+> **Metrics**
+> Topic: `nuclear energy`
+> Articles Collected: `60`
+> Generated: `2025-06-03 11:52`
+>
+# Nuclear Energy: Value Investing Focus – Week Ending 2/June/2025
+---
+## Intro: Market Context and Week Summary
+Nuclear energy took center stage this week, driven by major executive moves in U.S. energy policy, heightened demand from AI/data centers, and investor/VC excitement about SMRs (small modular reactors). With Trump’s administration rolling out pro-nuclear executive orders and Europe/Asia accelerating new builds, public and private capital is steadily shifting back into nuclear plays. The macro environment is bullish: regulatory timelines are shortening, capital support is rising, and energy stability/cleanliness place nuclear above wind and solar in AI-focused grid conversations. On the ground: several companies (including Oklo, BWX Technologies, and Centrus) received analyst upgrades, utilities are racing to deploy SMRs, and nuclear-tech startups are pulling in fresh VC funds. Smart money is watching supply chains (uranium), next-gen reactors, and infrastructure/enabling tech for nuclear’s new "golden age."
+---
+## 1. Key Value Signals
+- **Major U.S. Policy Shift**: New Trump administration executive orders to accelerate nuclear tech approval, reduce permitting times and support uranium supply chains ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/), [Forbes](https://www.forbes.com/sites/llewellynking/2025/05/31/nuclear-golden-age-huge-potential-stubborn-obstacles/)).
+- **Big Tech Partnership Moves**: Google (and earlier, Meta) inking first agreements with small modular reactor developers ([The Guardian](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)).
+- **Startups & VC Funding Rounds**: Atomic Canyon (AI for nuclear), Kairos Power, and others drawing new funding ([Axios](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium), [TechCrunch](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)).
+- **Utility Action on SMRs**: TVA becomes first U.S. utility to seek permit for SMR, indicating a path for future orders ([Insurance Journal](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm)).
+- **Analyst Upgrades and Insider Buys**: Oklo (OKLO), Centrus Energy (LEU), and BWX Technologies (BWXT) upgraded ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)).
+- **Strong Fundamental Tailwinds**:
+  - **Low P/E, Strong ROE/FCF**: Several nuclear/uranium plays trading below market P/E, generating high free cash flow, with secular macro demand increases.
+  - **Moats Emerging**: Through regulatory complexity, IP, and public-private partnerships.
+---
+## 2. Stocks or Startups to Watch
+### **Listed Stocks**
+#### **Oklo (OKLO)**
+- **Trigger:** Analyst upgrades post-Trump nuclear EO, SMR play, strong U.S. government support ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/))
+- **Fundamentals:** Newly public (<6 months), early FMC/S-1 data. Moat: First SMR in pipeline, government/tech sector contracts.
+- **Metric:** Expected SMR deployment, contract pipeline not yet priced in.
+#### **Centrus Energy (LEU)**
+- **Trigger:** Upgraded, uranium supply chain play; critical to new U.S. nuclear push ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/))
+- **P/E:** ~13 ([Yahoo Finance](https://finance.yahoo.com/quote/LEU/))
+- **ROE:** ~27%
+- **Market Cap:** ~$650M
+- **Comment:** Only U.S. uranium enrichment capability, crucial as U.S. looks to de-risk from Russia ([Mining.com.au](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)).
+#### **BWX Technologies (BWXT)**
+- **Trigger:** Major reactor supplier for U.S. Navy and DoE, among first to benefit from process acceleration ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)).
+- **P/E:** ~24
+- **ROE:** ~35%
+- **Moat:** Navy sole-source positioning, R&D, U.S. government contracts.
+- **Market Cap:** ~$10B
+#### **NuScale Power (SMR)**
+- **Trigger:** NRC has approved SMR design, clearing path for deployment ([Utility Dive](https://www.utilitydive.com/news/nrc-approves-nuscale-small-modular-reactor-smr/749538/))
+- **Metric:** High short interest post-IPO, but new regulatory tailwinds. Watch for major contract wins.
+#### **Paladin Energy (PDN.AX)**
+- **Trigger:** Making moves at Patterson Lake as uranium demand surges with U.S. and global SMR build ([Mining.com.au](https://mining.com.au/paladin-proceeds-at-patterson-lake/)).
+- **Comment:** Undervalued relative to long-term uranium price upcycle.
+### **Startups & Undercapitalized Opportunities**
+- **Atomic Canyon**: AI-powered B2B software for nuclear industry. Raised $7M seed led by Energy Impact Partners (backers of several energy unicorns). Aim: “ChatGPT for nuclear” ([TechCrunch](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/))
+- **Kairos Power**: Leading small modular reactor startup—Google is the first customer for future SMR energy. (direct purchase PPA) ([The Guardian](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power))
+- **Type One Energy**: Fusion startup, just completed formal initial design review ([Power Magazine](https://www.powermag.com/avangrid-investing-41-million-to-rebuild-ny-grid-infrastructure/)).
+---
+## 3. What Smart Money Might Be Acting On
+- **Venture/Institutional**: Top-tier VCs (Energy Impact Partners, Plug and Play, Tower Research) making preemptive moves into enabling tech/software (e.g., Atomic Canyon).
+- **Corporate Power Users (Big Tech)**: Google, Meta inking deals with SMR startups—future demand signal for new nuclear ([The Guardian](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)).
+- **Analyst Coverage/Upgrades**: William Blair’s initiation on OKLO, LEU, and BWXT signals Wall Street is waking up to regulatory + macro catalysts ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)).
+- **Utilities/State Action**: TVA and Texas moving to lead SMR deployment and streamline permitting—possible template for state-federal partnerships ([Insurance Journal](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm), [GovTech](https://www.govtech.com/products/texas-senate-passes-350m-grant-program-for-nuclear-power)).
+- **Insider-Led Companies**: Centrus Energy (LEU, ex-government insiders, U.S.-centric contracts), Oklo (deep government, tech ecosystem relationships).
+---
+## 4. References/Sources
+- [Forbes - U.S. must double down on nuclear](https://www.forbes.com/sites/billfrist/2025/05/29/powering-the-future-why-america-must-double-down-on-nuclear-energy/)
+- [Forbes - Data Center Energy Wars](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)
+- [The Guardian - Tech firms buy SMR power](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)
+- [Investor's Business Daily - Nuclear stocks upgraded](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)
+- [Axios - Atomic Canyon B2B seed](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)
+- [TechCrunch - Atomic Canyon profile](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)
+- [Insurance Journal - TVA SMR permit](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm)
+- [Utility Dive – NRC approves NuScale SMR design](https://www.utilitydive.com/news/nrc-approves-nuscale-small-modular-reactor-smr/749538/)
+- [Mining.com.au – Centrus/Paladin/uranium momentum](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)
+- [Yahoo Finance – LEU Key Stats](https://finance.yahoo.com/quote/LEU/)
+---
+## 5. Investment Hypothesis
+**Thesis:**
+Recent regulatory and policy catalysts have created a structural tailwind for both incumbent and next-gen nuclear energy firms, particularly those exposed to SMRs, uranium refining, and critical enabling tech/software. The current market underappreciates the scale and allocation speed of coming capital inflows (from utilities, governments, and data cloud majors). Valuations (esp. in uranium and contractors) remain attractive on a P/E and FCF basis compared to wind/solar.
+- **Buy candidates:** Oklo (OKLO), Centrus (LEU), BWX Technologies (BWXT), Paladin (PDN.AX), NuScale (SMR)
+- **Venture/early-exposure:** Consider gaining VC fund/PE exposure to emerging nuclear tech/software infrastructure (e.g., Atomic Canyon, Kairos Power).
+- **Rationale:** U.S./global policy, increased AI power grid demand, and high barriers to entry combine for exceptional medium/long-term risk/reward—especially after this week’s “regime change” in sentiment and regulation.
+**Monitor:**
+New contract wins for SMR developers. U.S. uranium production and enrichment capacity (LEU). Expansion or new partnerships with tech/utility majors. Insider ownership trends and further analyst coverage for nuclear sector plays.
+---
+### Overall: This week’s news offers a clear “green light” for value investors in nuclear, particularly those seeking both deep value (LEU, BWXT) and long-tail growth via platform/SMR innovators (OKLO, Kairos, NuScale). U.S. government and major tech-firm endorsement serves as powerful affirmation for the sector’s re-rating.
+---
+---
+> Topic: `AI`
+> Articles Collected: `163`
+> Generated: `2025-07-04 13:40`
+>
+# Value Investor AI Weekly Memo
+**Week of June 30 – July 6, 2025**
+---
+## Market Sentiment & Trends
+This week, the AI market continues to be characterized by robust *growth optimism* and *massive capital deployment*. Sentiment remains largely positive in infrastructure and applied AI, but there is rising skepticism toward sky-high private market valuations in some fast-following startups. Major headlines focus on AI’s influence in cybersecurity, legal, HR, and consulting verticals, as well as the continuing "picks and shovels" theme in datacenter hardware and services.
+*No major regulatory shocks noted*, but institutions and investors are expressing caution about the sustainability of AI startup valuations and possible hype cycles.
+---
+## 1. Key Value Signals
+- **Infrastructure Focus Remains Dominant:** The highest conviction for value investing is in AI infrastructure—hardware, datacenters, and core networking.
+- **M&A and Partnership Activity:** Notable signals like Apple considering partnerships/acquisitions for Siri enhancements (Anthropic, OpenAI) and SoftBank moving aggressively on artificial superintelligence with multi-phase global projects.
+- **Startup Capital Flows Accelerating:** Noteworthy rounds at Harvey ($300M Series E, legal AI), Abridge ($300M Series E, medical AI), Metaview ($35M, hiring), and Lovable ($150M rumored). However, most are at steep valuations (>$2B pre/post-money).
+- **Insider & Smart Money Activity:** a16z, Kleiner Perkins, Coatue, Google Ventures, and Sequoia are active, with Glasswing Ventures discussing new AI funding strategies.
+- **Geographic Expansion:** SoftBank’s ASI moves and Asia-centric “build with context” approach highlight a more sustainable, potentially undervalued new-entrant pipeline.
+---
+## 2. Stocks or Startups to Watch
+### **Public Markets:**
+- **Arista Networks (ANET)**, **Nvidia (NVDA)**, **AMD (AMD):** “Picks and shovels” for the AI gold rush—datacenter, networking, compute chips. *Arista* has a lower valuation multiple than Nvidia, still strong ROE, and is less crowded.
+- **SoftBank (SFTBY/SFTBF):** The push for "artificial superintelligence" signals heavy capital spend, but could be an undervalued play if execution improves and Vision Fund losses subside.
+- **Apple (AAPL):** Movement on AI partnerships/acquisitions may re-rate Siri’s potential, although Apple trades rich by value standards.
+### **Private/Startup Watchlist:**
+- **Harvey (Legal AI):** $5B valuation, but massive adoption potential for legal transformation; recently had consecutive mega-rounds—possibly ahead of fundamentals.
+- **Abridge (Healthcare AI):** $5.3B valuation; automating medical notes is a real use-case, but valuation steep.
+- **Metaview (Recruitment AI):** Google Ventures led; automating/bias-reducing hiring—smaller, earlier, potentially higher reward.
+- **Lovable:** On track for $150M at $2B. Early-stage AI firm, unknown fundamentals, but worth tracking as a potential future public market debut.
+### **Infrastructure enablers:**
+- **Scott Data (Private):** Midwest US data center, supporting AI startups—potential for M&A or IPO as picks-and-shovels to the AI startup wave.
+- **Industrial/Manufacturing AI:** Watch industrial AI “digital twins” and multimodal analytics for less-flashy, but real, B2B moats.
+---
+## 3. What Smart Money Might Be Acting On
+- **Private Market Rotation:** Top VCs (Kleiner Perkins, a16z, Coatue, Sequoia, Google Ventures) are doubling down on AI startups, but selectively—pivoting more to infrastructure, HR, and healthcare use-cases where actual adoption is measurable.
+- **Datacenter & Networking Expansion:** Institutional and growth investors pushing into datacenter, network, and hardware plays over frothy model-chatbot proliferators.
+- **“Asia Build” Angle:** Long-term capital weighs Asian AI execution models, where blitzscaling is shunned for capital efficiency. Early institutional allocation might offer less-overpriced entry into the next breakout AI winners.
+---
+## 4. References
+- [Forbes: AI Hype Cycle & Infrastructure](https://www.forbes.com/sites/rscottraynovich/2025/07/01/inside-the-ai-hype-cycle-whats-next-for-enterprise-ai/)
+- [RCR Wireless: SoftBank's Superintelligence Ambitions](https://www.rcrwireless.com/20250630/ai-infrastructure/softbank-artificial)
+- [TechCrunch: Harvey, Abridge funding](https://techcrunch.com/2025/06/27/startups-weekly-tech-and-the-law/)
+- [Startup Ecosystem Canada: Lovable AI funding](https://www.startupecosystem.ca/news/lovable-ai-startup-on-track-to-raise-150m-at-2b-valuation/)
+- [GovTech: Scott Data, Omaha AI infrastructure partnership](https://www.govtech.com/artificial-intelligence/partnership-looks-to-drive-ai-adoption-in-omaha-neb)
+- [Mining Technology: Industrial/Multimodal AI](https://www.mining-technology.com/sponsored/whats-next-for-industrial-ai-five-key-developments-shaping-the-space/)
+- [Business Insider: Claude/Anthropic, Microsoft AI as a core workflow](https://www.businessinsider.com/claude-ran-store-anthropic-ai-agent-lessons-learned-middle-managers-2025-6)
+---
+## 5. Investment Hypothesis
+**The market is in the mid-to-late innings of the first generative AI value cycle. Near-term value is likely to accrue to AI infrastructure enablers (datacenter, networking, compute), NOT to richly-priced flashy model startups. The next wave UNLOCK is in B2B-specific verticals—manufacturing, healthcare, legal, hiring—especially those with defensible data or workflows (moats). Early-stage infrastructure providers outside the Bay Area (e.g., Midwest data centers, lower-multiple Asia AI shops) may offer underappreciated value. SoftBank’s renewed push and Apple’s partnership strategy suggest major future M&A, benefiting core AI tech and infrastructure players.**
+### **Screen for:**
+- Public tech with strong fundamentals (low P/E, high ROE, cash flows) in critical infrastructure (Arista, AMD)
+- Private companies with repeat-use, high-barrier products — notably in B2B SaaS, industrial, or privacy-compliant hiring/medtech AI
+- Undercovered, smaller infrastructure shops and regional datacenter players (public or potential IPO/M&A targets)
+---
+**(Caveat: Recent startup valuations may be unsustainably high. Exercise discipline; seek evidence of unit economics and actual cashflow, not just growth metrics.)**
+---
+> Topic: `nuclear energy`
+> Articles Collected: `133`
+> Generated: `2025-07-02 20:18`
+>
+# Nuclear Energy Weekly Value Investing Memo
+**Week of July 1, 2025**
+---
+### **Market Sentiment & Trends**
+This week’s news reconfirms nuclear energy’s rising status as both a grid reliability solution and a strategic utility for tech and industrial growth. Demand drivers include:
+- Growing AI/data center needs (Google, Microsoft, Amazon heavily engaged)
+- Policy tailwinds and new US DOE initiatives
+- New partnerships and investments from leading tech and engineering firms
+- Heightened urgency, both industrially and politically, for next-gen nuclear and advanced enrichment.
+The overall sentiment is incrementally positive: there’s powerful momentum for nuclear expansion (especially advanced/small modular/fusion), but major regulatory, funding, and execution risks remain.
+---
+## 1. **Key Value Signals**
+- **Big Tech Putting Capital to Work**: Google commits to buying electricity from both *fusion* (Commonwealth Fusion Systems) and *fission* (Kairos Power—an SMR startup), signaling a long-term offtake demand for clean nuclear output. These deals, while years out, anchor real business models and future cash flows in an industry where certainty has been rare.
+- **DOE Fast-Tracks Advanced Nuclear**: The US Department of Energy (DOE) launched a pilot program to authorize *private* test reactors—removing a longstanding barrier for early-stage and test deployments. This regulatory facilitation could accelerate revenue opportunities for startups.
+- **AI Meets Nuclear Construction**: Palantir—a leader in data analytics—announced its software will drive efficiency in reactor construction (with “The Nuclear Company”), signaling an ecosystem of digital infrastructure forming around new builds.
+- **Strategic Collaborations**: Oklo (recent SPAC, high-profile leadership) and Bill Gates’ TerraPower signed a partnership around domestic HALEU enrichment—critical for next-generation reactors and a US supply chain play.
+- **Major Fusion Funding**: Westinghouse and ITER sign a $180M contract to push fusion technology, while global fusion market size forecasts surge.
+- **IPO and Recent SPAC Activity**: Oklo’s public listing, ongoing chatter around SMR startups seeking either funding or public exits.
+---
+## 2. **Stocks or Startups to Watch**
+**A. Public/Recent IPO & Small Cap Opportunities**
+- **Oklo (NYSE: OKLO)**
+  - **Profile**: Recent SPAC debut; backed by substantial leadership and Bill Gates’ circle via TerraPower collaboration.
+  - **Signals**: Strategic partnerships, domestic enrichment angle, close alignment with DOE pilot regulatory streamlining.
+  - **Check**: Valuation (historically rich for early-stage nuclear), business execution, and regulatory milestones.
+- **Kairos Power (private, but IPO/speculation possible)**
+  - **Profile**: Small modular reactor company. Google offtake deal is a significant vote of confidence.
+  - **Signals**: Market validation, long-term revenue anchor (if plant comes online).
+- **Commonwealth Fusion Systems (private)**
+  - **Profile**: Leading fusion startup; Google as an offtaker/investor.
+  - **Signals**: Earliest in its lifecycle, but with elite backing. Watch for pre-IPO funding rounds and cap table changes.
+**B. Established, Undervalued Nuclear Plays (Check Valuation/Fundamentals)**
+- **BWX Technologies (NYSE: BWXT)**
+  - **Profile**: Established supplier for nuclear reactors and specialized components.
+  - **Moat**: Deep US government/defense contracts, emerging advanced reactor supply role.
+  - **Valuation**: P/E ratio tends to be market-comparable, but free cash flow strong and recurring revenue profile.
+  - **Signal**: Exposure to multiple advanced reactor programs, SMR rollout, and robust political support.
+- **Centrus Energy (NYSEMKT: LEU)**
+  - **Profile**: Only US public company with commercial uranium enrichment capability—potential HALEU winner.
+  - **Signals**: Vital for fueling advanced reactors; highly levered to new DOE policies.
+  - **Risks**: Small cap, volatile, but high convexity if advanced nuclear takes off in '26+.
+**C. Infrastructure, EPC, and Software**
+- **Palantir Technologies (NYSE: PLTR)**
+  - **Profile**: Now branching into nuclear with specialized construction/efficiency software.
+  - **Signal**: Long-term, stickier defense/critical infrastructure business.
+---
+## 3. **What Smart Money Might Be Acting On**
+- **Pre-emptive Strategic Investment**: Major techs (Google especially) are locking in low-carbon electricity contracts before physical infrastructure is built. Early investor entry into fusion/SMR supply chains could offer “picks & shovels” asymmetry.
+- **Pivot to Domestic Supply Chain**: Oklo/TerraPower collaboration for HALEU enrichment directly addresses “made in America” energy/defense policy. This is the tip of a deglobalization and re-onshoring trend—any US enrichment or SMR component supplier could be in play.
+- **Software/Services Layer**: The nuclear restart will bring new opportunities for “enabling” firms: EPC (AECOM, AtkinsRéalis, Arup), new digital/digital twins/AI (Palantir), and regulatory facilitators.
+- **Advanced Reactor “First Movers”**: Policy support (DOE program) will favor companies close to deployment/breakthrough—those that can move from pilot to cash generation by 2026-2030. Early capital and regulatory champions could see premium returns.
+---
+## 4. **References**
+- [Google’s Data Center Bets — TechCrunch](https://techcrunch.com/2025/07/01/googles-data-center-energy-use-doubled-in-four-years/)
+- [US DOE Pilot Program — POWER Magazine](https://www.powermag.com/doe-pilot-program-targets-three-nuclear-test-reactors-for-2026-criticality-under-department-authorization/)
+- [Palantir and Nuclear — POWER Magazine](https://www.powermag.com/groups-partnering-to-develop-ai-software-to-speed-nuclear-reactor-construction/)
+- [Oklo/TerraPower/HALEU — Oil & Gas 360](https://www.oilandgas360.com/oklo-enters-strategic-collaborations-with-hexium-and-terrapower-to-launch-new-pathway-for-domestic-haleu-enrichment/)
+- [Westinghouse/ITER Contract — POWER Magazine](https://www.powermag.com/westinghouse-iter-sign-180-million-contract-to-advance-nuclear-fusion/)
+- [Fusion Market Outlook — Precedence Research](https://www.precedenceresearch.com/fusion-energy-market)
+- [BWX Technologies (BWXT) — Investor Relations](https://www.bwxt.com/)
+---
+## 5. **Investment Hypothesis**
+**Thesis**: The convergence of policy, technology (AI/data center demand), and strategic investment from leading corporates is catalyzing a new nuclear buildout cycle—especially in the US. *First-mover* advanced fission and fusion startups, US-centric enrichment supply, and key enabling technologies (digital/twin/AI/construction) stand to generate outsize returns, particularly ahead of confirmed revenue streams in the early 2030s.
+- **Core Bets**:
+  - **Oklo** — if price corrects—offers a uniquely exposed pure play on the regulatory shift and DOE pilot program.
+  - **Centrus Energy** — levered, high-risk/high-reward play on domestic HALEU enrichment.
+  - **BWX Technologies** — lower-risk, steady exposure to SMR and advanced builds, and possible defense tailwinds.
+- **Venture/Aggressive**:
+  - Track private rounds (Commonwealth Fusion, Kairos Power); watch for IPO or secondary liquidity events.
+  - Monitor “picks and shovels” suppliers (engineering, digital, sensing, permitting).
+- **Catalysts**:
+  - DOE pilot selections and project starts (late 2025/2026).
+  - Google/Microsoft/other tech-driven PPAs or partnerships.
+  - US and UK regulatory acceleration or major political support.
+**Risks**: Execution slippage, cost overruns, regulatory reversals, or overhyped/illiquid microcaps. Fusion commercial viability remains >5-7 years out.
+---
+# **Summary Table**
+| Company                | Ticker | Opportunity            | Moat/Signal                      | Notes                                      |
+|------------------------|--------|------------------------|-----------------------------------|--------------------------------------------|
+| Oklo                   | OKLO   | Early pure play SMR    | DOE pilot, TerraPower partnership | SPAC, recent, monitor valuation carefully  |
+| Centrus Energy         | LEU    | HALEU enrichment       | Only US-capable, DOE contracts    | High volatility                            |
+| BWX Technologies       | BWXT   | Established supplier   | Govt defense, recurring revenue   | Steady, strong FCF & fundamentals          |
+| Commonwealth Fusion    | –      | Fusion, Google backing | Tech, strategic capital           | Private, pre-IPO/2nd round watching        |
+| Kairos Power           | –      | SMR, Google offtake    | Major tech validation             | Private, track for IPO                     |
+| Palantir Technologies  | PLTR   | Nuclear AI/software    | 1st big software entrant          | Not a pure play, watch ecosystem effects   |
+---
+## **Bottom Line:**
+*The investable landscape for nuclear is evolving rapidly—value investors should focus on companies bridging policy tailwind into real commercial assets, with an eye for US-centric supply, strategic contracts, and digital enablement of an emerging nuclear buildout cycle. Small/underfunded public names could offer asymmetric re-rating as the cycle unfolds.*
+---
+![nuclear energy](https://images.unsplash.com/photo-1630142895963-6996ae6b3a5b?crop=entropy&cs=tinysrgb&fit=max&fm=jpg&ixid=M3w1NzIzMjF8MHwxfHNlYXJjaHwxfHxudWNsZWFyJTIwZW5lcmd5fGVufDB8MHx8fDE3NDg5MzM1NDJ8MA&ixlib=rb-4.1.0&q=80&w=1080)
+*Photo by <a href="https://unsplash.com/@llehotsky" target="_blank">Lukáš Lehotský</a> on <a href="https://unsplash.com" target="_blank">Unsplash</a>*
+> **🧠 Metrics**
+> - Topic: `nuclear energy`
+> - Articles Collected: `69`
+> - Generated: `2025-06-03 10:10`
+>
+# Nuclear Energy Value Investing Memo – Week Ending 2/June/2025
+## General Situation & Market Summary
+This week marks a decisive shift for nuclear energy, fueled by sweeping pro-nuclear executive orders from the Trump administration, robust bipartisan support at the state and federal levels, and increased corporate demand from hyperscale data center operators such as Meta and Google [[1](https://www.forbes.com/sites/billfrist/2025/05/29/powering-the-future-why-america-must-double-down-on-nuclear-energy/)][[2](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)][[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)]. The "nuclear renaissance" is manifest in regulatory accelerations, increased federal and state funding, and strategic contracts with Big Tech. Notably, the news cycle includes upgrades for nuclear stocks, significant venture funding rounds for AI-driven nuclear ventures, and government-backed SMR builds—plus ripple effects for upstream uranium miners.
+**Market sentiment** is bullish on nuclear equities and technology providers. There's tangible momentum pouring into both legacy and disruptive names (especially SMR- and AI-aligned startups), although investors should note that capital costs and regulatory delays remain stubborn risks.
+---
+## 1. Key Value Signals
+- **Executive Tailwinds:** New Trump EO’s support accelerated licensing, funding, and uranium supply chain resiliency — structural regulators eased for new builds [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)][[9](https://www.forbes.com/sites/llewellynking/2025/05/31/nuclear-golden-age-huge-potential-stubborn-obstacles/)].
+- **State grants and approvals:** Texas passed a $350M nuclear grant program [[6](https://www.govtech.com/products/texas-senate-passes-350m-grant-program-for-nuclear-power)].
+- **Strategic partnerships and PPA’s:** Google and Meta sign nuclear PPA deals; Kairos Power (private SMR leader) lands deals with Big Tech [[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power/)].
+- **Startups funded:** Atomic Canyon (AI for nuclear ops) closes $7M seed; strong VC and founder backing [[11](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)].
+- **Stock Upgrades:** Oklo (OKLO), Centrus Energy (LEU), BWX Technologies (BWXT) upgraded by William Blair, explicitly tied to presidential actions [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+- **Uranium supply buzz:** Direct commentary from GTI Energy (ASX:GTR; uranium) spotlights bullish uranium price/volume thesis [[16](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)].
+- **Tech-enabled nuclear:** Multiple deals for SMR technologies, digital AI ops, and nuclear for maritime/data infrastructure.
+---
+## 2. Stocks or Startups to Watch
+### Upgraded or in Play
+#### Oklo (NASDAQ: OKLO) [Startup, Recent IPO]
+  - **What:** Microreactor/SMR company — major White House and sector tailwinds, newly public.
+  - **Catalyst:** Upgraded post-Trump EO; top beneficiary per analysts.
+  - **Valuation:** Pre-revenue, but tech moat and strategic government/energy partners.
+  - **Insider/Smart Money:** Backed by Sam Altman, Peter Thiel [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+#### Centrus Energy (AMEX: LEU)
+  - **What:** Uranium fuel supplier with US-centric value.
+  - **Metrics:** P/E ~11, P/B ~2, ROE ~22%; Market Cap ~$1.2B.
+  - **Catalyst:** Government support for US supply, upgraded by analysts.
+  - **Moat:** Key domestic enrichment capability.
+  - [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)], [[17](https://markets.ft.com/data/announce/detail?dockey=600-202505291748PR_NEWS_USPRX____PH99387-1)]
+#### BWX Technologies (NYSE: BWXT)
+  - **What:** Reactors for US Navy (defense moat) & utilities.
+  - **Metrics:** P/E ~25, P/B ~5.8, ROE ~36%, Market Cap ~$8.6B.
+  - **Catalyst:** Upgrade on presidential support, huge federal contracts.
+  - [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)]
+#### GTI Energy (ASX: GTR)
+  - **What:** Small-cap uranium developer, "uranium buzz" name.
+  - **Catalyst:** Publicly lauded tailwinds by CEO, levered to US uranium push.
+  - [[16](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)]
+### High-Impact Startups
+#### Atomic Canyon (Private)
+  - **What:** AI for nuclear compliance, ops, and maintenance (B2B SaaS).
+  - **Catalyst:** Landed Diablo Canyon (major US plant) as client, $7M seed from Energy Impact Partners, Commonweal, Plug and Play, Tower Research, Wischoff.
+  - **Signal:** Well-connected investors, strategic bridge between AI and nuclear infra.
+  - [[11](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)], [[12](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)]
+#### Kairos Power (Private)
+  - **What:** US SMR developer, Google’s first SMR PPA.
+  - **Catalyst:** Strategic proof-point for SMR commercialization, signaling major institutional validation.
+  - [[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power/)]
+---
+## 3. What Smart Money Might Be Acting On
+- **Venture backers:** Energy Impact Partners, Plug and Play, Tower Research are betting on Atomic Canyon, validating AI’s inevitable role in nuclear digitization [[12](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)].
+- **Insider investors:** Sam Altman, Peter Thiel, and other Silicon Valley luminaries are aligned to Oklo, a sign of big-ticket belief in next-gen reactors [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+- **Tech majors:** Google (via SMR PPA with Kairos Power) and Meta (exploring nuclear for data centers) are unlikely to backtrack — durable, volume offtake validation [[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power/)], [[2](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)].
+- **Active upgrades:** William Blair and others raising targets for BWXT, LEU, and OKLO immediately after White House/regulatory actions [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+---
+## 4. References
+- [Forbes: “Why America Must Double Down On Nuclear Energy”](https://www.forbes.com/sites/billfrist/2025/05/29/powering-the-future-why-america-must-double-down-on-nuclear-energy/)
+- [Forbes: “Gas, Nuclear, Renewables Battle Over Power For Meta’s New Data Center”](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)
+- [The Guardian: “Tide turning in Europe and beyond in favour of nuclear power”](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)
+- [Investor's Business Daily: “Trump's 'Consequential' Shift In Energy Policy Fuels Upgrades For These Nuclear Stocks”](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)
+- [GovTech: “Texas Senate Passes $350M Grant Program for Nuclear Power”](https://www.govtech.com/products/texas-senate-passes-350m-grant-program-for-nuclear-power)
+- [TechCrunch: “Atomic Canyon wants to be ChatGPT for the nuclear industry”](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)
+- [Axios: Venture deal coverage](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)
+- [Mining.com.au: “Trump’s nuclear push ignites uranium buzz”](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)
+- [Centrus company announcement](https://markets.ft.com/data/announce/detail?dockey=600-202505291748PR_NEWS_USPRX____PH99387-1)
+- [Insurance Journal: TVA/SMR permit news](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm)
+---
+## 5. Investment Hypothesis
+The current newsflow marks a **structural inflection point for nuclear energy in the US and allied markets**. Catalyst stacking — from bipartisan support, federal and state grants, White House executive orders, to urgent demand from hyperscale data centers and defense — is driving multiple fundamental and trigger events:
+- **Oklo (OKLO):** Early-stage, speculative but with tech and regulatory moats, institutional and insider backing, and direct ties to US policy. Potential 5–10x if it achieves early commercial milestones.
+- **Centrus Energy (LEU):** Profitable, unique “picks and shovels” play on US fuel sovereignty, undervalued relative to new cash flows and policy tailwinds.
+- **BWX Technologies (BWXT):** Mid-/large cap with recession-resistant defense and civil reactor businesses; ideal for institutional portfolios seeking balance.
+- **Atomic Canyon:** Private, but a “future pick-and-shovel” for digital ops in nuclear—evidence of VC smart money converging on the sector.
+**Downside risks:** Regulatory overhangs, cost overruns, and safety/lobbying backlash could impede rapid nuclear scaling—tempering parabolic runs.
+**Conclusion:**
+**This week’s news cements nuclear as a durable, high-growth infrastructure theme for the next decade with both policy and institutional tailwinds.** Well-run, undervalued or newly upgraded public nuclear stocks—especially with alignment to supply (LEU), defense (BWXT), and innovative new build (OKLO)—present strong upside. Meanwhile, closely follow VC and Big Tech’s footprints for future SMR and AI-software-linked deals.
+---
+**Summary Table: Potential Picks**
+| Company         | Ticker | Market Cap   | P/E   | ROE   | Catalyst                    |
+| --------------- | ------ | ------------ | ----- | ----- | --------------------------- |
+| Oklo            | OKLO   | ~$560M       | —     | —     | SMR, gov/insider backing    |
+| Centrus Energy  | LEU    | ~$1.2B       | ~11   | ~22%  | Uranium, analyst upgrades   |
+| BWX Technologies| BWXT   | ~$8.6B       | ~25   | ~36%  | Defense, U.S. Navy, gov’t   |
+| GTI Energy      | GTR    | ~$40M (AUD)  | —     | —     | Uranium, U.S. expansion     |
+| Atomic Canyon   | —      | Private      | —     | —     | AI SaaS, Diabolo Canyon win |
+| Kairos Power    | —      | Private      | —     | —     | Google SMR PPA              |
+*Data based on latest available annual/quarterly filings and estimates.*
+---
+---

data/mining_2025-07-04.md ADDED Viewed

	@@ -0,0 +1,89 @@

+> Topic: `Mining`
+> Articles Collected: `172`
+> Generated: `2025-07-04 14:17`
+>
+# Weekly Value Investing Memo: Mining Sector (June 28 - July 4, 2025)
+---
+## Overview: Sentiment & Market Trends
+The mining industry sees a highly dynamic week, marked by continued innovation in automation, M&A activity in minerals (especially gold and copper), and further fundraising across both established players and small caps. Macro themes include:
+- **Global demand for critical minerals (EVs, batteries) fueling competition for assets**.
+- **Rising capital investment driven by gold's record rally** and the green transition’s need for base metals.
+- **Persistently challenging regulatory and policy environments**, particularly in emerging markets (Nigeria, South Africa).
+- **Strong capital flows into mining equipment and automation**, hinting at structurally higher sector profitability for lean, tech-driven operators.
+---
+## 1. Key Value Signals
+- **High insider/institutional participation in miners and suppliers**: Examples include Cascadia Minerals’ successful oversubscribed placement and large-scale deals like Zijin’s $1.2B Kazakh gold mine buy.
+- **Sector consolidation**: Small/mid-cap mergers (Brightstar-Aurumin talks) may unlock scale and cost synergies, often a value signal.
+- **Strategic acquisitions and funding rounds in upstream enablers**: Notably, Terra CO2’s $124M for green cement and the University of Queensland’s training program reflect future-oriented bets.
+- **Investor interest in mining equipment, automation, and sustainability**: Industry reports highlight accelerating tech adoption, with larger miners seeking cost controls and ESG advantages.
+- **Government and regulatory headwinds an explicit risk**: Policy ambiguity or hostile regimes (Nigeria, South Africa) remain a clear negative screen.
+---
+## 2. Stocks or Startups to Watch
+### **Cascadia Minerals Ltd. [TSXV: CAM]**
+- **Event**: Raised C$2.27M in oversubscribed private placement for the acquisition of Granite Creek Copper and funding of the Carmacks Project.
+- **Value Angle**: Small-cap, significant insider demand, focus on copper (structural under-supply theme). Acquisition could add scale and resource upside.
+- **Fundamentals**: Pre-revenue, but well-capitalized, strategic Yukon assets, optionality on copper cycle. Monitor P/B and dilution risk post-acquisition.
+### **Terra CO2**
+- **Event**: Secured $124M Series B for low-carbon cement product.
+- **Value Angle**: Downstream of mining (aggregates, cement); sits at the ESG/green infra nexus. Major institutional support signals sector-wide bet on carbon reduction in heavy industry.
+- **Fundamentals**: Still private, but possible IPO watch for first-mover “green cement” plays with mining tie-ins.
+### **Brightstar Resources [ASX: BTR] / Aurumin [ASX: AUN]**
+- **Event**: Potential merger under negotiation (Central Sandstone, WA gold tenements).
+- **Value Angle**: Sector consolidation at the small-cap level; possible cost reduction, resource optimization. Neither yet a sector leader but could unlock scale economics if deal completes.
+- **Fundamentals**: Consider on basis of NAV discount, debt levels, historic cash burn.
+### **Zijin Mining [HKG: 2899]**
+- **Event**: $1.2B acquisition of a Kazakh gold mine; pursuing HK listing of international assets.
+- **Value Angle**: Massive balance sheet, levered to gold, aggressive expansion. Not classic ‘cheap’ value, but a play on size/moat, Chinese state alignment, and precious metals bull run.
+---
+## 3. What Smart Money Might Be Acting On
+- **Resource-constrained supply chains**: Institutions chasing assets in the copper, gold, and specialty metals space for long-term price support; Cascadia’s oversubscribed raise hints at smart capital flow into critical minerals.
+- **Green and tech-enabled mining infrastructure**: Funds flowing into equipment/automation as large miners invest to cut OPEX and meet sustainability mandates.
+- **Early-stage innovation bets**: University/industry collabs (Wheaton/UBC, MRIWA scholarships) suggest VC/PE will chase enabling tech, not just resource ownership.
+- **Selective asset consolidation**: Sophisticated holders may see sub-scale gold/copper/junior plays as efficient entry points during cyclical troughs or when M&A premiums are small.
+- **Avoidance of poorly governed or policy-risked geographies**: Smart money is likely avoiding high regulatory risk countries (Nigeria, South Africa) unless assets are truly world-class.
+---
+## 4. References
+- [Cascadia Minerals oversubscribed financing (TipRanks)](https://www.tipranks.com/news/company-announcements/cascadia-minerals-secures-c2-27m-in-oversubscribed-financing-for-strategic-acquisition)
+- [Terra CO2 $124M Series B (Startup Ecosystem Canada)](https://www.startupecosystem.ca/news/terra-co2-secures-124m-series-b-for-low-carbon-cement/)
+- [Zijin Mining’s Kazakh gold mine buy (Mining.com)](https://www.mining.com/web/zijin-mining-to-acquire-kazakh-gold-mine-in-1-2b-deal/)
+- [Brightstar Resources/Aurumin merger discussion (Mining.com.au)](https://mining.com.au/brightstar-probes-aurumin-merger-discussions/)
+- [Mining equipment/automation market surge (openPR)](https://www.openpr.com/news/4092566/global-mining-equipment-market-surges-amid-automation-green)
+- [Wheaton $1M Future of Mining Challenge (Mining.com)](https://www.mining.com/blog/wheaton-precious-metals-brings-back-1m-future-of-mining-challenge)
+---
+## 5. Investment Hypothesis
+The mining sector is entering another phase of capital allocation discipline, flagged by:
+- Minor vs. major M&A as juniors combine to gain efficiency,
+- Technology/adoption waves as automation and green mandates favor opex-lean operators,
+- Smart money preferring critical minerals, automation, and ESG-enabled suppliers,
+- And value found in overlooked small-caps pursuing strategic, low-cost acquisitions (see Cascadia Minerals), especially those exposed to copper/gold.
+Mining’s cyclical, capital-intensive nature means margins will accrue to firms with solid moats (resource quality, cost, governance). The best value is likely among well-financed, proven junior miners with clear catalysts (M&A, new discoveries, scale), or private enablers with a roadmap to public markets (like Terra CO2).
+Regulatory and macro risks (e.g., policy instability in Nigeria/South Africa) make jurisdiction and balance sheet strength paramount for downside protection. Investors should use screeners (P/E, P/B, ROE, FCF) to filter for relative value, but back this with assessment of jurisdictional and operational risk.
+---
+### Conclusion:
+**Watch for follow-on financings, consolidation deals, and private-to-public transition among mining innovators and critical mineral players. Prioritize companies with clear capital discipline, high insider/institutional ownership, and strong strategic rationale for growth.**

data/nuclear_energy_2025-06-03.md ADDED Viewed

	@@ -0,0 +1,130 @@

+![nuclear energy](https://images.unsplash.com/photo-1630142895963-6996ae6b3a5b?crop=entropy&cs=tinysrgb&fit=max&fm=jpg&ixid=M3w1NzIzMjF8MHwxfHNlYXJjaHwxfHxudWNsZWFyJTIwZW5lcmd5fGVufDB8MHx8fDE3NDg5MzM1NDJ8MA&ixlib=rb-4.1.0&q=80&w=1080)
+*Photo by <a href="https://unsplash.com/@llehotsky" target="_blank">Lukáš Lehotský</a> on <a href="https://unsplash.com" target="_blank">Unsplash</a>*
+> **🧠 Metrics**
+> - Topic: `nuclear energy`
+> - Articles Collected: `69`
+> - Generated: `2025-06-03 10:10`
+>
+# Nuclear Energy Value Investing Memo – Week Ending 2/June/2025
+## General Situation & Market Summary
+This week marks a decisive shift for nuclear energy, fueled by sweeping pro-nuclear executive orders from the Trump administration, robust bipartisan support at the state and federal levels, and increased corporate demand from hyperscale data center operators such as Meta and Google [[1](https://www.forbes.com/sites/billfrist/2025/05/29/powering-the-future-why-america-must-double-down-on-nuclear-energy/)][[2](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)][[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)]. The "nuclear renaissance" is manifest in regulatory accelerations, increased federal and state funding, and strategic contracts with Big Tech. Notably, the news cycle includes upgrades for nuclear stocks, significant venture funding rounds for AI-driven nuclear ventures, and government-backed SMR builds—plus ripple effects for upstream uranium miners.
+**Market sentiment** is bullish on nuclear equities and technology providers. There's tangible momentum pouring into both legacy and disruptive names (especially SMR- and AI-aligned startups), although investors should note that capital costs and regulatory delays remain stubborn risks.
+---
+## 1. Key Value Signals
+- **Executive Tailwinds:** New Trump EO’s support accelerated licensing, funding, and uranium supply chain resiliency — structural regulators eased for new builds [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)][[9](https://www.forbes.com/sites/llewellynking/2025/05/31/nuclear-golden-age-huge-potential-stubborn-obstacles/)].
+- **State grants and approvals:** Texas passed a $350M nuclear grant program [[6](https://www.govtech.com/products/texas-senate-passes-350m-grant-program-for-nuclear-power)].
+- **Strategic partnerships and PPA’s:** Google and Meta sign nuclear PPA deals; Kairos Power (private SMR leader) lands deals with Big Tech [[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power/)].
+- **Startups funded:** Atomic Canyon (AI for nuclear ops) closes $7M seed; strong VC and founder backing [[11](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)].
+- **Stock Upgrades:** Oklo (OKLO), Centrus Energy (LEU), BWX Technologies (BWXT) upgraded by William Blair, explicitly tied to presidential actions [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+- **Uranium supply buzz:** Direct commentary from GTI Energy (ASX:GTR; uranium) spotlights bullish uranium price/volume thesis [[16](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)].
+- **Tech-enabled nuclear:** Multiple deals for SMR technologies, digital AI ops, and nuclear for maritime/data infrastructure.
+---
+## 2. Stocks or Startups to Watch
+### Upgraded or in Play
+#### Oklo (NASDAQ: OKLO) [Startup, Recent IPO]
+  - **What:** Microreactor/SMR company — major White House and sector tailwinds, newly public.
+  - **Catalyst:** Upgraded post-Trump EO; top beneficiary per analysts.
+  - **Valuation:** Pre-revenue, but tech moat and strategic government/energy partners.
+  - **Insider/Smart Money:** Backed by Sam Altman, Peter Thiel [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+#### Centrus Energy (AMEX: LEU)
+  - **What:** Uranium fuel supplier with US-centric value.
+  - **Metrics:** P/E ~11, P/B ~2, ROE ~22%; Market Cap ~$1.2B.
+  - **Catalyst:** Government support for US supply, upgraded by analysts.
+  - **Moat:** Key domestic enrichment capability.
+  - [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)], [[17](https://markets.ft.com/data/announce/detail?dockey=600-202505291748PR_NEWS_USPRX____PH99387-1)]
+#### BWX Technologies (NYSE: BWXT)
+  - **What:** Reactors for US Navy (defense moat) & utilities.
+  - **Metrics:** P/E ~25, P/B ~5.8, ROE ~36%, Market Cap ~$8.6B.
+  - **Catalyst:** Upgrade on presidential support, huge federal contracts.
+  - [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)]
+#### GTI Energy (ASX: GTR)
+  - **What:** Small-cap uranium developer, "uranium buzz" name.
+  - **Catalyst:** Publicly lauded tailwinds by CEO, levered to US uranium push.
+  - [[16](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)]
+### High-Impact Startups
+#### Atomic Canyon (Private)
+  - **What:** AI for nuclear compliance, ops, and maintenance (B2B SaaS).
+  - **Catalyst:** Landed Diablo Canyon (major US plant) as client, $7M seed from Energy Impact Partners, Commonweal, Plug and Play, Tower Research, Wischoff.
+  - **Signal:** Well-connected investors, strategic bridge between AI and nuclear infra.
+  - [[11](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)], [[12](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)]
+#### Kairos Power (Private)
+  - **What:** US SMR developer, Google’s first SMR PPA.
+  - **Catalyst:** Strategic proof-point for SMR commercialization, signaling major institutional validation.
+  - [[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power/)]
+---
+## 3. What Smart Money Might Be Acting On
+- **Venture backers:** Energy Impact Partners, Plug and Play, Tower Research are betting on Atomic Canyon, validating AI’s inevitable role in nuclear digitization [[12](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)].
+- **Insider investors:** Sam Altman, Peter Thiel, and other Silicon Valley luminaries are aligned to Oklo, a sign of big-ticket belief in next-gen reactors [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+- **Tech majors:** Google (via SMR PPA with Kairos Power) and Meta (exploring nuclear for data centers) are unlikely to backtrack — durable, volume offtake validation [[3](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power/)], [[2](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)].
+- **Active upgrades:** William Blair and others raising targets for BWXT, LEU, and OKLO immediately after White House/regulatory actions [[4](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)].
+---
+## 4. References
+- [Forbes: “Why America Must Double Down On Nuclear Energy”](https://www.forbes.com/sites/billfrist/2025/05/29/powering-the-future-why-america-must-double-down-on-nuclear-energy/)
+- [Forbes: “Gas, Nuclear, Renewables Battle Over Power For Meta’s New Data Center”](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)
+- [The Guardian: “Tide turning in Europe and beyond in favour of nuclear power”](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)
+- [Investor's Business Daily: “Trump's 'Consequential' Shift In Energy Policy Fuels Upgrades For These Nuclear Stocks”](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)
+- [GovTech: “Texas Senate Passes $350M Grant Program for Nuclear Power”](https://www.govtech.com/products/texas-senate-passes-350m-grant-program-for-nuclear-power)
+- [TechCrunch: “Atomic Canyon wants to be ChatGPT for the nuclear industry”](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)
+- [Axios: Venture deal coverage](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)
+- [Mining.com.au: “Trump’s nuclear push ignites uranium buzz”](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)
+- [Centrus company announcement](https://markets.ft.com/data/announce/detail?dockey=600-202505291748PR_NEWS_USPRX____PH99387-1)
+- [Insurance Journal: TVA/SMR permit news](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm)
+---
+## 5. Investment Hypothesis
+The current newsflow marks a **structural inflection point for nuclear energy in the US and allied markets**. Catalyst stacking — from bipartisan support, federal and state grants, White House executive orders, to urgent demand from hyperscale data centers and defense — is driving multiple fundamental and trigger events:
+- **Oklo (OKLO):** Early-stage, speculative but with tech and regulatory moats, institutional and insider backing, and direct ties to US policy. Potential 5–10x if it achieves early commercial milestones.
+- **Centrus Energy (LEU):** Profitable, unique “picks and shovels” play on US fuel sovereignty, undervalued relative to new cash flows and policy tailwinds.
+- **BWX Technologies (BWXT):** Mid-/large cap with recession-resistant defense and civil reactor businesses; ideal for institutional portfolios seeking balance.
+- **Atomic Canyon:** Private, but a “future pick-and-shovel” for digital ops in nuclear—evidence of VC smart money converging on the sector.
+**Downside risks:** Regulatory overhangs, cost overruns, and safety/lobbying backlash could impede rapid nuclear scaling—tempering parabolic runs.
+**Conclusion:**
+**This week’s news cements nuclear as a durable, high-growth infrastructure theme for the next decade with both policy and institutional tailwinds.** Well-run, undervalued or newly upgraded public nuclear stocks—especially with alignment to supply (LEU), defense (BWXT), and innovative new build (OKLO)—present strong upside. Meanwhile, closely follow VC and Big Tech’s footprints for future SMR and AI-software-linked deals.
+---
+**Summary Table: Potential Picks**
+| Company         | Ticker | Market Cap   | P/E   | ROE   | Catalyst                    |
+| --------------- | ------ | ------------ | ----- | ----- | --------------------------- |
+| Oklo            | OKLO   | ~$560M       | —     | —     | SMR, gov/insider backing    |
+| Centrus Energy  | LEU    | ~$1.2B       | ~11   | ~22%  | Uranium, analyst upgrades   |
+| BWX Technologies| BWXT   | ~$8.6B       | ~25   | ~36%  | Defense, U.S. Navy, gov’t   |
+| GTI Energy      | GTR    | ~$40M (AUD)  | —     | —     | Uranium, U.S. expansion     |
+| Atomic Canyon   | —      | Private      | —     | —     | AI SaaS, Diabolo Canyon win |
+| Kairos Power    | —      | Private      | —     | —     | Google SMR PPA              |
+*Data based on latest available annual/quarterly filings and estimates.*
+---

data/nuclear_energy_2025-06-03_1.md ADDED Viewed

	@@ -0,0 +1,111 @@

+> **Metrics**
+> Topic: `nuclear energy`
+> Articles Collected: `60`
+> Generated: `2025-06-03 11:52`
+>
+# Nuclear Energy: Value Investing Focus – Week Ending 2/June/2025
+---
+## Intro: Market Context and Week Summary
+Nuclear energy took center stage this week, driven by major executive moves in U.S. energy policy, heightened demand from AI/data centers, and investor/VC excitement about SMRs (small modular reactors). With Trump’s administration rolling out pro-nuclear executive orders and Europe/Asia accelerating new builds, public and private capital is steadily shifting back into nuclear plays. The macro environment is bullish: regulatory timelines are shortening, capital support is rising, and energy stability/cleanliness place nuclear above wind and solar in AI-focused grid conversations. On the ground: several companies (including Oklo, BWX Technologies, and Centrus) received analyst upgrades, utilities are racing to deploy SMRs, and nuclear-tech startups are pulling in fresh VC funds. Smart money is watching supply chains (uranium), next-gen reactors, and infrastructure/enabling tech for nuclear’s new "golden age."
+---
+## 1. Key Value Signals
+- **Major U.S. Policy Shift**: New Trump administration executive orders to accelerate nuclear tech approval, reduce permitting times and support uranium supply chains ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/), [Forbes](https://www.forbes.com/sites/llewellynking/2025/05/31/nuclear-golden-age-huge-potential-stubborn-obstacles/)).
+- **Big Tech Partnership Moves**: Google (and earlier, Meta) inking first agreements with small modular reactor developers ([The Guardian](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)).
+- **Startups & VC Funding Rounds**: Atomic Canyon (AI for nuclear), Kairos Power, and others drawing new funding ([Axios](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium), [TechCrunch](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)).
+- **Utility Action on SMRs**: TVA becomes first U.S. utility to seek permit for SMR, indicating a path for future orders ([Insurance Journal](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm)).
+- **Analyst Upgrades and Insider Buys**: Oklo (OKLO), Centrus Energy (LEU), and BWX Technologies (BWXT) upgraded ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)).
+- **Strong Fundamental Tailwinds**:
+  - **Low P/E, Strong ROE/FCF**: Several nuclear/uranium plays trading below market P/E, generating high free cash flow, with secular macro demand increases.
+  - **Moats Emerging**: Through regulatory complexity, IP, and public-private partnerships.
+---
+## 2. Stocks or Startups to Watch
+### **Listed Stocks**
+#### **Oklo (OKLO)**
+- **Trigger:** Analyst upgrades post-Trump nuclear EO, SMR play, strong U.S. government support ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/))
+- **Fundamentals:** Newly public (<6 months), early FMC/S-1 data. Moat: First SMR in pipeline, government/tech sector contracts.
+- **Metric:** Expected SMR deployment, contract pipeline not yet priced in.
+#### **Centrus Energy (LEU)**
+- **Trigger:** Upgraded, uranium supply chain play; critical to new U.S. nuclear push ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/))
+- **P/E:** ~13 ([Yahoo Finance](https://finance.yahoo.com/quote/LEU/))
+- **ROE:** ~27%
+- **Market Cap:** ~$650M
+- **Comment:** Only U.S. uranium enrichment capability, crucial as U.S. looks to de-risk from Russia ([Mining.com.au](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)).
+#### **BWX Technologies (BWXT)**
+- **Trigger:** Major reactor supplier for U.S. Navy and DoE, among first to benefit from process acceleration ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)).
+- **P/E:** ~24
+- **ROE:** ~35%
+- **Moat:** Navy sole-source positioning, R&D, U.S. government contracts.
+- **Market Cap:** ~$10B
+#### **NuScale Power (SMR)**
+- **Trigger:** NRC has approved SMR design, clearing path for deployment ([Utility Dive](https://www.utilitydive.com/news/nrc-approves-nuscale-small-modular-reactor-smr/749538/))
+- **Metric:** High short interest post-IPO, but new regulatory tailwinds. Watch for major contract wins.
+#### **Paladin Energy (PDN.AX)**
+- **Trigger:** Making moves at Patterson Lake as uranium demand surges with U.S. and global SMR build ([Mining.com.au](https://mining.com.au/paladin-proceeds-at-patterson-lake/)).
+- **Comment:** Undervalued relative to long-term uranium price upcycle.
+### **Startups & Undercapitalized Opportunities**
+- **Atomic Canyon**: AI-powered B2B software for nuclear industry. Raised $7M seed led by Energy Impact Partners (backers of several energy unicorns). Aim: “ChatGPT for nuclear” ([TechCrunch](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/))
+- **Kairos Power**: Leading small modular reactor startup—Google is the first customer for future SMR energy. (direct purchase PPA) ([The Guardian](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power))
+- **Type One Energy**: Fusion startup, just completed formal initial design review ([Power Magazine](https://www.powermag.com/avangrid-investing-41-million-to-rebuild-ny-grid-infrastructure/)).
+---
+## 3. What Smart Money Might Be Acting On
+- **Venture/Institutional**: Top-tier VCs (Energy Impact Partners, Plug and Play, Tower Research) making preemptive moves into enabling tech/software (e.g., Atomic Canyon).
+- **Corporate Power Users (Big Tech)**: Google, Meta inking deals with SMR startups—future demand signal for new nuclear ([The Guardian](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)).
+- **Analyst Coverage/Upgrades**: William Blair’s initiation on OKLO, LEU, and BWXT signals Wall Street is waking up to regulatory + macro catalysts ([Investor's Business Daily](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)).
+- **Utilities/State Action**: TVA and Texas moving to lead SMR deployment and streamline permitting—possible template for state-federal partnerships ([Insurance Journal](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm), [GovTech](https://www.govtech.com/products/texas-senate-passes-350m-grant-program-for-nuclear-power)).
+- **Insider-Led Companies**: Centrus Energy (LEU, ex-government insiders, U.S.-centric contracts), Oklo (deep government, tech ecosystem relationships).
+---
+## 4. References/Sources
+- [Forbes - U.S. must double down on nuclear](https://www.forbes.com/sites/billfrist/2025/05/29/powering-the-future-why-america-must-double-down-on-nuclear-energy/)
+- [Forbes - Data Center Energy Wars](https://www.forbes.com/sites/ianpalmer/2025/05/27/gas-nuclear-renewables-battle-over-power-for-metas-new-data-center/)
+- [The Guardian - Tech firms buy SMR power](https://www.theguardian.com/environment/2025/jun/01/tide-turning-europe-beyond-favour-nuclear-power)
+- [Investor's Business Daily - Nuclear stocks upgraded](https://www.investors.com/news/trump-executive-orders-fuel-nuclear-stocks-upgrade-stock-market/)
+- [Axios - Atomic Canyon B2B seed](https://www.axios.com/pro/all-deals/2025/05/28/first-look-pro-rata-premium)
+- [TechCrunch - Atomic Canyon profile](https://techcrunch.com/2025/05/28/atomic-canyon-wants-to-be-chatgpt-for-the-nuclear-industry/)
+- [Insurance Journal - TVA SMR permit](https://www.insurancejournal.com/news/southeast/2025/05/27/825158.htm)
+- [Utility Dive – NRC approves NuScale SMR design](https://www.utilitydive.com/news/nrc-approves-nuscale-small-modular-reactor-smr/749538/)
+- [Mining.com.au – Centrus/Paladin/uranium momentum](https://mining.com.au/trumps-nuclear-push-ignites-uranium-buzz/)
+- [Yahoo Finance – LEU Key Stats](https://finance.yahoo.com/quote/LEU/)
+---
+## 5. Investment Hypothesis
+**Thesis:**
+Recent regulatory and policy catalysts have created a structural tailwind for both incumbent and next-gen nuclear energy firms, particularly those exposed to SMRs, uranium refining, and critical enabling tech/software. The current market underappreciates the scale and allocation speed of coming capital inflows (from utilities, governments, and data cloud majors). Valuations (esp. in uranium and contractors) remain attractive on a P/E and FCF basis compared to wind/solar.
+- **Buy candidates:** Oklo (OKLO), Centrus (LEU), BWX Technologies (BWXT), Paladin (PDN.AX), NuScale (SMR)
+- **Venture/early-exposure:** Consider gaining VC fund/PE exposure to emerging nuclear tech/software infrastructure (e.g., Atomic Canyon, Kairos Power).
+- **Rationale:** U.S./global policy, increased AI power grid demand, and high barriers to entry combine for exceptional medium/long-term risk/reward—especially after this week’s “regime change” in sentiment and regulation.
+**Monitor:**
+New contract wins for SMR developers. U.S. uranium production and enrichment capacity (LEU). Expansion or new partnerships with tech/utility majors. Insider ownership trends and further analyst coverage for nuclear sector plays.
+---
+### Overall: This week’s news offers a clear “green light” for value investors in nuclear, particularly those seeking both deep value (LEU, BWXT) and long-tail growth via platform/SMR innovators (OKLO, Kairos, NuScale). U.S. government and major tech-firm endorsement serves as powerful affirmation for the sector’s re-rating.
+---

data/nuclear_energy_2025-07-02.md ADDED Viewed

	@@ -0,0 +1,133 @@

+> Topic: `nuclear energy`
+> Articles Collected: `133`
+> Generated: `2025-07-02 20:18`
+>
+# Nuclear Energy Weekly Value Investing Memo
+**Week of July 1, 2025**
+---
+### **Market Sentiment & Trends**
+This week’s news reconfirms nuclear energy’s rising status as both a grid reliability solution and a strategic utility for tech and industrial growth. Demand drivers include:
+- Growing AI/data center needs (Google, Microsoft, Amazon heavily engaged)
+- Policy tailwinds and new US DOE initiatives
+- New partnerships and investments from leading tech and engineering firms
+- Heightened urgency, both industrially and politically, for next-gen nuclear and advanced enrichment.
+The overall sentiment is incrementally positive: there’s powerful momentum for nuclear expansion (especially advanced/small modular/fusion), but major regulatory, funding, and execution risks remain.
+---
+## 1. **Key Value Signals**
+- **Big Tech Putting Capital to Work**: Google commits to buying electricity from both *fusion* (Commonwealth Fusion Systems) and *fission* (Kairos Power—an SMR startup), signaling a long-term offtake demand for clean nuclear output. These deals, while years out, anchor real business models and future cash flows in an industry where certainty has been rare.
+- **DOE Fast-Tracks Advanced Nuclear**: The US Department of Energy (DOE) launched a pilot program to authorize *private* test reactors—removing a longstanding barrier for early-stage and test deployments. This regulatory facilitation could accelerate revenue opportunities for startups.
+- **AI Meets Nuclear Construction**: Palantir—a leader in data analytics—announced its software will drive efficiency in reactor construction (with “The Nuclear Company”), signaling an ecosystem of digital infrastructure forming around new builds.
+- **Strategic Collaborations**: Oklo (recent SPAC, high-profile leadership) and Bill Gates’ TerraPower signed a partnership around domestic HALEU enrichment—critical for next-generation reactors and a US supply chain play.
+- **Major Fusion Funding**: Westinghouse and ITER sign a $180M contract to push fusion technology, while global fusion market size forecasts surge.
+- **IPO and Recent SPAC Activity**: Oklo’s public listing, ongoing chatter around SMR startups seeking either funding or public exits.
+---
+## 2. **Stocks or Startups to Watch**
+**A. Public/Recent IPO & Small Cap Opportunities**
+- **Oklo (NYSE: OKLO)**
+  - **Profile**: Recent SPAC debut; backed by substantial leadership and Bill Gates’ circle via TerraPower collaboration.
+  - **Signals**: Strategic partnerships, domestic enrichment angle, close alignment with DOE pilot regulatory streamlining.
+  - **Check**: Valuation (historically rich for early-stage nuclear), business execution, and regulatory milestones.
+- **Kairos Power (private, but IPO/speculation possible)**
+  - **Profile**: Small modular reactor company. Google offtake deal is a significant vote of confidence.
+  - **Signals**: Market validation, long-term revenue anchor (if plant comes online).
+- **Commonwealth Fusion Systems (private)**
+  - **Profile**: Leading fusion startup; Google as an offtaker/investor.
+  - **Signals**: Earliest in its lifecycle, but with elite backing. Watch for pre-IPO funding rounds and cap table changes.
+**B. Established, Undervalued Nuclear Plays (Check Valuation/Fundamentals)**
+- **BWX Technologies (NYSE: BWXT)**
+  - **Profile**: Established supplier for nuclear reactors and specialized components.
+  - **Moat**: Deep US government/defense contracts, emerging advanced reactor supply role.
+  - **Valuation**: P/E ratio tends to be market-comparable, but free cash flow strong and recurring revenue profile.
+  - **Signal**: Exposure to multiple advanced reactor programs, SMR rollout, and robust political support.
+- **Centrus Energy (NYSEMKT: LEU)**
+  - **Profile**: Only US public company with commercial uranium enrichment capability—potential HALEU winner.
+  - **Signals**: Vital for fueling advanced reactors; highly levered to new DOE policies.
+  - **Risks**: Small cap, volatile, but high convexity if advanced nuclear takes off in '26+.
+**C. Infrastructure, EPC, and Software**
+- **Palantir Technologies (NYSE: PLTR)**
+  - **Profile**: Now branching into nuclear with specialized construction/efficiency software.
+  - **Signal**: Long-term, stickier defense/critical infrastructure business.
+---
+## 3. **What Smart Money Might Be Acting On**
+- **Pre-emptive Strategic Investment**: Major techs (Google especially) are locking in low-carbon electricity contracts before physical infrastructure is built. Early investor entry into fusion/SMR supply chains could offer “picks & shovels” asymmetry.
+- **Pivot to Domestic Supply Chain**: Oklo/TerraPower collaboration for HALEU enrichment directly addresses “made in America” energy/defense policy. This is the tip of a deglobalization and re-onshoring trend—any US enrichment or SMR component supplier could be in play.
+- **Software/Services Layer**: The nuclear restart will bring new opportunities for “enabling” firms: EPC (AECOM, AtkinsRéalis, Arup), new digital/digital twins/AI (Palantir), and regulatory facilitators.
+- **Advanced Reactor “First Movers”**: Policy support (DOE program) will favor companies close to deployment/breakthrough—those that can move from pilot to cash generation by 2026-2030. Early capital and regulatory champions could see premium returns.
+---
+## 4. **References**
+- [Google’s Data Center Bets — TechCrunch](https://techcrunch.com/2025/07/01/googles-data-center-energy-use-doubled-in-four-years/)
+- [US DOE Pilot Program — POWER Magazine](https://www.powermag.com/doe-pilot-program-targets-three-nuclear-test-reactors-for-2026-criticality-under-department-authorization/)
+- [Palantir and Nuclear — POWER Magazine](https://www.powermag.com/groups-partnering-to-develop-ai-software-to-speed-nuclear-reactor-construction/)
+- [Oklo/TerraPower/HALEU — Oil & Gas 360](https://www.oilandgas360.com/oklo-enters-strategic-collaborations-with-hexium-and-terrapower-to-launch-new-pathway-for-domestic-haleu-enrichment/)
+- [Westinghouse/ITER Contract — POWER Magazine](https://www.powermag.com/westinghouse-iter-sign-180-million-contract-to-advance-nuclear-fusion/)
+- [Fusion Market Outlook — Precedence Research](https://www.precedenceresearch.com/fusion-energy-market)
+- [BWX Technologies (BWXT) — Investor Relations](https://www.bwxt.com/)
+---
+## 5. **Investment Hypothesis**
+**Thesis**: The convergence of policy, technology (AI/data center demand), and strategic investment from leading corporates is catalyzing a new nuclear buildout cycle—especially in the US. *First-mover* advanced fission and fusion startups, US-centric enrichment supply, and key enabling technologies (digital/twin/AI/construction) stand to generate outsize returns, particularly ahead of confirmed revenue streams in the early 2030s.
+- **Core Bets**:
+  - **Oklo** — if price corrects—offers a uniquely exposed pure play on the regulatory shift and DOE pilot program.
+  - **Centrus Energy** — levered, high-risk/high-reward play on domestic HALEU enrichment.
+  - **BWX Technologies** — lower-risk, steady exposure to SMR and advanced builds, and possible defense tailwinds.
+- **Venture/Aggressive**:
+  - Track private rounds (Commonwealth Fusion, Kairos Power); watch for IPO or secondary liquidity events.
+  - Monitor “picks and shovels” suppliers (engineering, digital, sensing, permitting).
+- **Catalysts**:
+  - DOE pilot selections and project starts (late 2025/2026).
+  - Google/Microsoft/other tech-driven PPAs or partnerships.
+  - US and UK regulatory acceleration or major political support.
+**Risks**: Execution slippage, cost overruns, regulatory reversals, or overhyped/illiquid microcaps. Fusion commercial viability remains >5-7 years out.
+---
+# **Summary Table**
+| Company                | Ticker | Opportunity            | Moat/Signal                      | Notes                                      |
+|------------------------|--------|------------------------|-----------------------------------|--------------------------------------------|
+| Oklo                   | OKLO   | Early pure play SMR    | DOE pilot, TerraPower partnership | SPAC, recent, monitor valuation carefully  |
+| Centrus Energy         | LEU    | HALEU enrichment       | Only US-capable, DOE contracts    | High volatility                            |
+| BWX Technologies       | BWXT   | Established supplier   | Govt defense, recurring revenue   | Steady, strong FCF & fundamentals          |
+| Commonwealth Fusion    | –      | Fusion, Google backing | Tech, strategic capital           | Private, pre-IPO/2nd round watching        |
+| Kairos Power           | –      | SMR, Google offtake    | Major tech validation             | Private, track for IPO                     |
+| Palantir Technologies  | PLTR   | Nuclear AI/software    | 1st big software entrant          | Not a pure play, watch ecosystem effects   |
+---
+## **Bottom Line:**
+*The investable landscape for nuclear is evolving rapidly—value investors should focus on companies bridging policy tailwind into real commercial assets, with an eye for US-centric supply, strategic contracts, and digital enablement of an emerging nuclear buildout cycle. Small/underfunded public names could offer asymmetric re-rating as the cycle unfolds.*

data/nuclear_energy_2025-07-04.md ADDED Viewed

	@@ -0,0 +1,117 @@

+> Topic: `Nuclear energy`
+> Articles Collected: `150`
+> Generated: `2025-07-04 13:55`
+>
+# Nuclear Energy: Value-Investor Weekly Memo
+**Week of June 30 – July 7, 2025**
+---
+## Executive Summary: Sentiment & Market Trends
+This week, nuclear energy remains at the center of global and U.S. energy policy debates, buoyed by both political tailwinds (GOP-led support in legislation, state-level deployment pushes) and rising demand from AI/data center infrastructure. Nuclear is also strategically reemerging as the “clean firm” power of choice as renewables face policy setbacks, intermittency challenges, and grid reliability strains. Major tech companies and select startup activity point to accelerations in both fission (SMRs) and fusion, with corporate and government actors signaling capital and operational shifts toward advanced nuclear solutions.
+Market sentiment appears mildly positive for established names but remains neutral for the broader sector. Early-stage deal flow and new executive moves hint at undervalued opportunities in uranium miners, SMR developers, and next-gen reactor supply chains, all backstopped by robust macro trends.
+---
+## 1. Key Value Signals
+- **Public-Private Partnerships & Policy Tailwinds**
+    - New York’s governor directs pursuit of at least 1 GW of new nuclear (possible “fleet-style” deployments), signifying state-level commitment.
+    - GOP legislation weakens renewables but retains and even enhances support for nuclear/geothermal—improving medium-term earning prospects for nuclear-exposed businesses.
+- **Tech Giant Commitments**
+    - Google commits to buying power from Commonwealth Fusion Systems (fusion) and from Kairos Power (SMRs/fission), underscoring long-term belief in and potential floor demand for advanced nuclear power.
+- **M&A / Executive Movement**
+    - Ur-Energy (URG) names Matthew Gili (ex-Cameco, Energy Fuels) as President; strong management pedigree in uranium mining suggests focus on operational ramp-up and credibility for growth.
+- **Private Funding & Industrial Partnerships**
+    - Westinghouse-ITER $180M fusion contract advances commercial pathways for fusion.
+    - Palantir partners with The Nuclear Company for AI deployment in nuclear construction, potentially de-risking timelines and cost overruns—key bottlenecks for new plants.
+- **Uranium Financing**
+    - Energy Fuels (NYSE: UUUU) launches $300M ATM share offering for growth and possibly M&A, indicating possible scale-up action or acquisition-driven value.
+---
+## 2. Stocks or Startups to Watch
+### Undervalued Small Caps / Startups
+- **Ur-Energy (URG)**
+    - **Sector**: Uranium production/mining
+    - **Signals**: New CEO with pedigree, North American supply play; potential for insider or institutional accumulation.
+    - **Fundamentals**: Historically low P/B and P/E vs. sector; improving cash flow as uranium prices trend higher.
+- **Energy Fuels (UUUU)**
+    - **Sector**: Uranium/rare earths
+    - **Signals**: ATM share offering—could precede an operational expansion, M&A, or balance sheet fortification.
+    - **Moat**: Vertical integration and North American production base; tailwinds from potential U.S. uranium supply mandates.
+- **Kairos Power**
+    - **Sector**: Small Modular Reactor (SMR) developer
+    - **Signals**: Google is a committed off-taker (500 MW); not public but watch for IPO or private rounds.
+    - **Moat**: Proprietary reactor and fuel tech, first-mover commercial projects.
+- **Commonwealth Fusion Systems (private)**
+    - **Sector**: Fusion
+    - **Signals**: Google investing + off-take for 200MW; implies robust institutional backing, possible pre-IPO unicorn.
+    - **Moat**: Leading IP/patent portfolio in commercial fusion.
+- **Floating Nuclear Consortia (Europe/Mediterranean)**
+    - **Sector**: Maritime nuclear
+    - **Signals**: New industry consortium for floating plants; regulatory tailwinds in Europe; riskier but paradigm-shifting.
+### Large-Cap Defensive/Incumbent Names
+- **Westinghouse (private, but watch via Brookfield Asset Management/partners)**
+    - **Signals**: $180M fusion contract + global SMR tenders.
+    - **Moat**: Deep IP/patents, established utility relationships.
+#### Emerging Themes
+- SMEs/startups deploying AI to compress reactor construction timelines (e.g., The Nuclear Company + Palantir).
+- Uranium spot market dislocations, supply security, and U.S./Canadian production uptrend.
+---
+## 3. What Smart Money Might Be Acting On
+### Institutional Moves and VC Flows
+- **Tech Company Off-Take Agreements**: Google’s long-dated power purchase agreements (PPAs) for nuclear fusion and SMRs indicate that large buyers are locking in future clean firm power, giving runway and de-risking revenue for emerging projects.
+- **Leadership Talent Migration**: Appointment of high-profile operators (e.g., Matthew Gili at URG) often precedes capital flows and operational improvement.
+- **Private/VC Investment**: Ongoing private fundraising in fusion (CFS/publicized; others less visible) and SMR space—potential for pre-IPO access or PIPE deals.
+- **Policy-driven Lifts**: Funds with a value/cyclical tilt may be accumulating uranium miners and established SMR suppliers, expecting U.S. or European state-driven demand and pricing power.
+---
+## 4. References
+- [Insider Monkey: Ur-Energy appoints Matthew Gili](https://www.insidermonkey.com/blog/ur-energy-urg-names-matthew-gili-as-president-to-support-growth-strategy-1562642/)
+- [TechCrunch: Google’s data center energy use doubles; commits to SMRs & Fusion](https://techcrunch.com/2025/07/01/googles-data-center-energy-use-doubled-in-four-years/)
+- [Newsweek: Google bets on Nuclear Fusion, Commonwealth Fusion Systems](https://www.newsweek.com/google-bets-nuclear-fusion-next-generation-clean-power-2091877)
+- [POWER Magazine: Westinghouse & ITER fusion contract](https://www.powermag.com/westinghouse-iter-sign-180-million-contract-to-advance-nuclear-fusion/)
+- [Utility Dive: NY Gov. Hochul nuclear push](https://www.utilitydive.com/news/new-york-gov-hochul-hints-at-fleet-style-approach-to-nuclear-deployments/751838/)
+- [Insider Monkey: Energy Fuels ATM offering](https://www.insidermonkey.com/blog/energy-fuels-uuuu-launches-300-million-atm-share-offering-program-1562647/)
+- [Marine Link: Industry consortium assesses floating nuclear](https://www.marinelink.com/news/industry-consortium-asses-floating-527616)
+- [The Verge, Sky News, NPR, CleanTechnica] (multiple for macro/policy context)
+---
+## 5. Investment Hypothesis
+Amid rising electricity demand from AI/data centers and the political marginalization of wind/solar, nuclear energy—particularly next-gen reactor developers, operationally leveraged uranium miners, and AI-enabled project managers—is set to benefit from both structural and cyclical forces. Near-term policy support, tech company PPA commitments, and tangible operational milestones (fusion contracts, executive talent upgrades) provide a fundamental backdrop for value investors.
+**Thesis**: Select undervalued uranium miners (URG, UUUU) and actionable SMR/fusion-related plays with real partnerships or contracts (Kairos, CFS, Palantir’s nuclear construction software partners) are likely mispriced relative to long-term demand, the emergence of tech buyer power, and regulatory tailwinds. Watch for balance sheet improvement, insider activity, and capex deployment as future catalysts.
+**Actionable Watchlist:**
+- Ur-Energy (NYSE: URG) — ride management upgrade and uranium bull cycle
+- Energy Fuels (NYSE: UUUU) — play on U.S. supply autonomy and balance sheet firepower
+- Private: Kairos Power, Commonwealth Fusion Systems — monitor for IPO/news, pre-IPO funds
+- Established supply chain: Westinghouse (via BAM, or tracking SMR contracts), Palantir’s nuclear ventures
+---
+**Macroeconomic/Regulatory Context:**
+- U.S. and European grid reliability and policy now lean “pro-nuclear” as renewables face political and technical hurdles.
+- Tech-sector demand for bespoke clean, reliable baseload may outpace traditional grid growth, driving long-term PPA/contracting up for nuclear-adjacent firms.
+- Early stage risk remains (especially fusion), but government cash, looser environmental reviews, and talent influx are de-risking the sector.
+---
+**Discipline:** Accumulate on dips with a margin of safety; remain alert to policy reversals, cost overruns, and technology risk. Revisit on IPO news, federal incentive shifts, and real-world contract wins.

external/.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

external/FinGPT/.github/FUNDING.yml ADDED Viewed

	@@ -0,0 +1,12 @@

+# These are supported funding model platforms
+github: [BruceYanghy]
+open_collective: # Replace with a single Open Collective username
+ko_fi: # Replace with a single Ko-fi username
+tidelift: # Replace with a single Tidelift platform-name/package-name e.g., npm/babel
+community_bridge: # Replace with a single Community Bridge project-name e.g., cloud-foundry
+liberapay: # Replace with a single Liberapay username
+issuehunt: # Replace with a single IssueHunt username
+otechie: # Replace with a single Otechie username
+lfx_crowdfunding: # Replace with a single LFX Crowdfunding project-name e.g., cloud-foundry
+custom: ['paypal.me/Hongyang']

external/FinGPT/.github/ISSUE_TEMPLATE/feature_request.md ADDED Viewed

	@@ -0,0 +1,20 @@

+---
+name: Feature request
+about: Suggest an idea for this project
+title: ''
+labels: ''
+assignees: ''
+---
+**Is your feature request related to a problem? Please describe.**
+A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
+**Describe the solution you'd like**
+A clear and concise description of what you want to happen.
+**Describe alternatives you've considered**
+A clear and concise description of any alternative solutions or features you've considered.
+**Additional context**
+Add any other context or screenshots about the feature request here.

external/FinGPT/.gitignore ADDED Viewed

	@@ -0,0 +1,141 @@

+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+# C extensions
+*.so
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+pip-wheel-metadata/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+# Translations
+*.mo
+*.pot
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+# Flask stuff:
+instance/
+.webassets-cache
+# Scrapy stuff:
+.scrapy
+# Sphinx documentation
+docs/_build/
+# PyBuilder
+target/
+# Jupyter Notebook
+.ipynb_checkpoints
+# IPython
+profile_default/
+ipython_config.py
+# pyenv
+.python-version
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow
+__pypackages__/
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+# SageMath parsed files
+*.sage.py
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+# Spyder project settings
+.spyderproject
+.spyproject
+# Rope project settings
+.ropeproject
+# mkdocs documentation
+/site
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+# Pyre type checker
+.pyre/
+.DS_Store
+.idea/FinGPT.iml
+*.xml
+# Job scripts
+fingpt/FinGPT_sentiment/instruct-FinGPT/run.sh
+fingpt/FinGPT_sentiment/instruct-FinGPT/checkpoints
+fingpt/FinGPT_sentiment/instruct-FinGPT/ds_results_all_10_v2_1.*
+FinGPT_Training_LoRA_with_Chatglm2_6b_for_beginners.ipynb
+# Benchmark data
+fingpt/FinGPT_Benchmark/data/*/**

external/FinGPT/.gitpod.yml ADDED Viewed

	@@ -0,0 +1,10 @@

+# This configuration file was automatically generated by Gitpod.
+# Please adjust to your needs (see https://www.gitpod.io/docs/introduction/learn-gitpod/gitpod-yaml)
+# and commit this file to your remote git repository to share the goodness with others.
+# Learn more from ready-to-use templates: https://www.gitpod.io/docs/introduction/getting-started/quickstart
+tasks:
+  - init: pip install -r requirements.txt

external/FinGPT/.idea/.gitignore ADDED Viewed

	@@ -0,0 +1,3 @@

+# Default ignored files
+/shelf/
+/workspace.xml

external/FinGPT/CODE_OF_CONDUCT.md ADDED Viewed

	@@ -0,0 +1,65 @@

+# Code of Conduct
+## Our Pledge
+In the interest of fostering an open and welcoming environment, we as contributors and maintainers pledge to make participation in our community a harassment-free experience for everyone, regardless of age, body size, disability, ethnicity, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, religion, or sexual identity and orientation.
+## Our Standards
+Examples of behavior that contributes to creating a positive environment include:
+- Using welcoming and inclusive language
+- Being respectful of differing viewpoints and experiences
+- Gracefully accepting constructive criticism
+- Focusing on what is best for the community
+- Showing empathy towards other community members
+Examples of unacceptable behavior by participants include:
+- The use of sexualized language or imagery and unwelcome sexual attention or advances
+- Trolling, insulting/derogatory comments, and personal or political attacks
+- Public or private harassment
+- Publishing others' private information, such as a physical or electronic address, without explicit permission
+- Other conduct that could reasonably be considered inappropriate in a professional setting
+## Our Responsibilities
+We as project maintainers are responsible for clarifying the standards of acceptable behavior and are expected to take appropriate and fair corrective action in response to any instances of unacceptable behavior.
+We have the right and responsibility to remove, edit, or reject comments, commits, code, wiki edits, issues, and other contributions that are not aligned with this Code of Conduct, or to ban temporarily or permanently any contributor for other behaviors that they deem inappropriate, threatening, offensive, or harmful.
+## Scope
+This Code of Conduct applies both within project spaces and in public spaces when an individual is representing the project or its community. Examples of representing a project or community include using an official project email address, posting via an official social media account, or acting as an appointed representative at an online or offline event.
+## Enforcement
+Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by contacting the project team. All complaints will be reviewed and investigated and will result in a response that is deemed necessary and appropriate to the circumstances. The project team is obligated to maintain confidentiality with regard to the reporter of an incident.
+## Enforcement Guidelines
+Community managers will follow these Community Impact Guidelines in determining the consequences for any action they deem in violation of this Code of Conduct:
+### 1. Correction
+**Community Impact**: Use of inappropriate language or other behavior deemed unprofessional or unwelcome in the community.
+**Consequence**: A private, written warning from community leaders, providing clarity around the nature of the violation and an explanation of why the behavior was inappropriate. A public apology may be requested.
+### 2. Warning
+**Community Impact**: A violation through a single incident or series of actions.
+**Consequence**: A warning with consequences for continued behavior. No interaction with the people involved, including unsolicited interaction with those enforcing the Code of Conduct, for a specified period of time. This includes avoiding interactions in community spaces as well as external channels like social media. Violating these terms may lead to a temporary or permanent ban.
+### 3. Temporary Ban
+**Community Impact**: A serious violation of community standards, including sustained inappropriate behavior.
+**Consequence**: A temporary ban from any sort of interaction or public communication with the community for a specified period of time. No public of private interaction with the people involved, including unsolicited interaction with those enforcing the Code of Conduct, is allowed during this period. Violating these terms may lead to a permanent ban.
+### 4. Permanent Ban
+**Community Impact**: Demonstrating a pattern of violation of community standards, including sustained inappropriate behavior,  harassment of an individual, or aggression toward or disparagement of classes of individuals.
+**Consequence**: A permanent ban from any sort of public interaction within the community.

external/FinGPT/CONTRIBUTING.md ADDED Viewed

	@@ -0,0 +1,68 @@

+# FinGPT Contribution Guidelines 🚀
+Welcome to the FinGPT project! We are thrilled to have you here 🌟. Your contributions are instrumental in shaping the intersection of finance and AI, making it even more amazing. 📈✨ Let's embark on this journey together.
+## Code of Conduct 🤝
+Before diving in, please take a moment to review our Code of Conduct. It sets the tone for our community and emphasizes the importance of respect and inclusivity. [Read the Code of Conduct](LICENSE.md).
+## Contribution Types 🦠🚀📚
+### Bug Reports 🐞
+If you encounter any bugs during your journey, don't fret! We have the Bug Busters ready to help. To report a bug, follow these steps:
+1. Check if the bug has already been reported in [GitHub Issues](https://github.com/AI4Finance-Foundation/FinGPT/issues).
+2. If it's a new bug, open a new issue with a concise description and provide detailed, step-by-step instructions to reproduce it.
+### Feature Requests 💡
+Do you have visionary ideas that could elevate FinGPT? Share them with us! When submitting a feature request, be sure to include:
+1. A clear and vivid description of the feature you envision.
+2. Discuss the impact and potential benefits.
+### Documentation 📖
+For those with a penchant for words and an eye for detail, consider contributing to our documentation. You can make the documentation more enlightening for everyone. 🧙📜
+### Code Contributions 💻
+Calling all AI heroes and wizards! You are the secret sauce behind the FinGPT project. To contribute code and save the financial world:
+1. **Fork the Repository**: Click the "Fork" button on the top right of the repository's page. This creates your own copy of the project.
+2. **Clone your Fork**: In your terminal, use the following command to clone your fork to your local machine:
+   ```bash
+   git clone https://github.com/YourUsername/FinGPT.git
+   ```
+3. **Create a New Branch**: Make a new branch for your adventures. This helps keep the main codebase clean:
+   ```bash
+   git checkout -b your-feature-branch
+   ```
+4. **Work Your Magic**: Implement your code or changes.
+5. **Commit and Push**: Use these commands to commit your changes and push them to your fork:
+   ```bash
+   git commit -m "Your commit message"
+   git push origin your-feature-branch
+   ```
+6. **Create a Pull Request**: Go to the original FinGPT repository and click "New Pull Request." Select your branch, write a description, and submit.
+## Seeking Assistance ❓🙋‍♀️
+If you find yourself stuck or have questions, remember that our support team is your sidekick. Don't hesitate to reach out. We are here to guide you through the process and provide any necessary assistance.
+## Getting Started 🚀🚀
+Are you ready to make a mark on the FinGPT project? Grab your cape and join us in our mission to make finance and AI even more incredible. Your contributions are the magic that fuels our journey.
+🔗 [FinGPT GitHub Repository](https://github.com/AI4Finance-Foundation/FinGPT)
+### May your contributions be as amazing as you are! 🌌🚀

external/FinGPT/FinGPT_ Training with LoRA and Meta-Llama-3-8B.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

external/FinGPT/FinGPT_Inference_Llama2_13B_falcon_7B_for_Beginners.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

external/FinGPT/FinGPT_Training_LoRA_with_ChatGLM2_6B_for_Beginners_v2-2.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

external/FinGPT/LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2024 AI4Finance Foundation Inc.
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

external/FinGPT/MANIFEST.in ADDED Viewed

	@@ -0,0 +1 @@


1	+ include fingpt/FinGPT_Benchmark/benchmarks/sentiment_templates.txt

external/FinGPT/README.md ADDED Viewed

	@@ -0,0 +1,384 @@

+<div align="center">
+<img align="center" width="30%" alt="image" src="https://github.com/AI4Finance-Foundation/FinGPT/assets/31713746/e0371951-1ce1-488e-aa25-0992dafcc139">
+</div>
+# FinGPT: Open-Source Financial Large Language Models
+[![Downloads](https://static.pepy.tech/badge/fingpt)](https://pepy.tech/project/fingpt)
+[![Downloads](https://static.pepy.tech/badge/fingpt/week)](https://pepy.tech/project/fingpt)
+[![Python 3.8](https://img.shields.io/badge/python-3.6-blue.svg)](https://www.python.org/downloads/release/python-360/)
+[![PyPI](https://img.shields.io/pypi/v/fingpt.svg)](https://pypi.org/project/fingpt/)
+![License](https://img.shields.io/github/license/AI4Finance-Foundation/fingpt.svg?color=brightgreen)
+![](https://img.shields.io/github/issues-raw/AI4Finance-Foundation/fingpt?label=Issues)
+![](https://img.shields.io/github/issues-closed-raw/AI4Finance-Foundation/fingpt?label=Closed+Issues)
+![](https://img.shields.io/github/issues-pr-raw/AI4Finance-Foundation/fingpt?label=Open+PRs)
+![](https://img.shields.io/github/issues-pr-closed-raw/AI4Finance-Foundation/fingpt?label=Closed+PRs)
+<div align="center">
+<img align="center" src=figs/logo_transparent_background.png width="40%"/>
+</div>
+Let us not expect Wall Street to open-source LLMs or open APIs, due to FinTech institutes' internal regulations and policies.
+[Blueprint of FinGPT](https://arxiv.org/abs/2306.06031)
+<https://huggingface.co/FinGPT>
+[![](https://dcbadge.vercel.app/api/server/trsr8SXpW5)](https://discord.gg/trsr8SXpW5)
+![Visitors](https://api.visitorbadge.io/api/VisitorHit?user=AI4Finance-Foundation&repo=FinGPT&countColor=%23B17A)
+## What's New:
+ - [Model Release] Nov, 2023: We release [FinGPT-Forecaster](https://github.com/AI4Finance-Foundation/FinGPT/tree/master/fingpt/FinGPT_Forecaster)!  🔥[Demo](https://huggingface.co/spaces/FinGPT/FinGPT-Forecaster), [Medium Blog](https://medium.datadriveninvestor.com/introducing-fingpt-forecaster-the-future-of-robo-advisory-services-50add34e3d3c) & [Model](https://huggingface.co/FinGPT/fingpt-forecaster_dow30_llama2-7b_lora) are available on Huggingface🤗!
+ - [Paper Acceptance] Oct, 2023: ["FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets"](https://arxiv.org/abs/2310.04793) is accepted🎉 by [Instruction Workshop](https://an-instructive-workshop.github.io/) @ NeurIPS 2023
+ - [Paper Acceptance] Oct, 2023: ["FinGPT: Democratizing Internet-scale Data for Financial Large Language Models"](https://arxiv.org/abs/2307.10485) is accepted🎉 by [Instruction Workshop](https://an-instructive-workshop.github.io/) @ NeurIPS 2023
+ - [Model Release] Oct, 2023: We release the [financial multi-task LLMs](https://huggingface.co/FinGPT) 🔥 produced when evaluating base-LLMs on [FinGPT-Benchmark](https://github.com/AI4Finance-Foundation/FinGPT/tree/master/fingpt/FinGPT_Benchmark)
+ - [Paper Acceptance] Sep, 2023: ["Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models"](https://arxiv.org/abs/2310.04027) is accepted🎉 by [ACM International Conference on AI in Finance (ICAIF-23)](https://ai-finance.org/icaif-23-accepted-papers/)
+ - [Model Release] Aug, 2023: We release the [financial sentiment analysis model](https://huggingface.co/FinGPT/fingpt-sentiment_llama2-13b_lora) 🔥
+ - [Paper Acceptance] Jul, 2023: ["Instruct-FinGPT: Financial Sentiment Analysis by Instruction Tuning of General-Purpose Large Language Models"](https://arxiv.org/abs/2306.12659) is accepted🎉 by [FinLLM 2023](https://finllm.github.io/workshop/#/fcb)@IJCAI 2023
+ - [Paper Acceptance] Jul, 2023: ["FinGPT: Open-Source Financial Large Language Models"](https://arxiv.org/abs/2306.06031) is accepted🎉 by [FinLLM 2023](https://finllm.github.io/workshop/#/fcb)@IJCAI 2023
+ - [Medium Blog] Jun 2023: [FinGPT: Powering the Future of Finance with 20 Cutting-Edge Applications](https://medium.datadriveninvestor.com/fingpt-powering-the-future-of-finance-with-20-cutting-edge-applications-7c4d082ad3d8)
+## Why FinGPT?
+1). Finance is highly dynamic. [BloombergGPT](https://arxiv.org/abs/2303.17564) trained an LLM using a mixture of finance data and general-purpose data, which took about 53 days, at a cost of around **$3M**). It is costly to retrain an LLM model like BloombergGPT every month or every week, thus lightweight adaptation is highly favorable. FinGPT can be fine-tuned swiftly to incorporate new data (the cost falls significantly, less than **$300 per fine-tuning**).
+2). Democratizing Internet-scale financial data is critical, say allowing timely updates of the model (monthly or weekly updates) using an automatic data curation pipeline.  BloombergGPT has privileged data access and APIs, while FinGPT presents a more accessible alternative. It prioritizes lightweight adaptation, leveraging the best available open-source LLMs.
+3). The key technology is "RLHF (Reinforcement learning from human feedback)", which is missing in BloombergGPT. RLHF enables an LLM model to learn individual preferences (risk-aversion level, investing habits, personalized robo-advisor, etc.), which is the "secret" ingredient of ChatGPT and GPT4.
+### Milestone of AI Robo-Advisor: FinGPT-Forecaster
+Try the latest released FinGPT-Forecaster demo at our [HuggingFace Space](https://huggingface.co/spaces/FinGPT/FinGPT-Forecaster)
+The dataset for FinGPT-Forecaster: https://huggingface.co/datasets/FinGPT/fingpt-forecaster-dow30-202305-202405
+![demo_interface](fingpt/FinGPT_Forecaster/figs/interface.png)
+Enter the following inputs:
+1) ticker symbol (e.g. AAPL, MSFT, NVDA)
+2) the day from which you want the prediction to happen (yyyy-mm-dd)
+3) the number of past weeks where market news are retrieved
+4) whether to add the latest basic financials as additional information
+Click Submit！ And you'll be responded with a well-rounded analysis of the company and a prediction for next week's stock price movement!
+For detailed and more customized implementation, please refer to [FinGPT-Forecaster](https://github.com/AI4Finance-Foundation/FinGPT/tree/master/fingpt/FinGPT_Forecaster)
+## FinGPT Demos:
+### Current State-of-the-arts for Financial Sentiment Analysis
+* [FinGPT V3 (Updated on 10/12/2023)](./fingpt)
+  * What's new: **Best trainable and inferable FinGPT for sentiment analysis on a single RTX 3090, which is even better than GPT-4 and ChatGPT Finetuning.**
+  * [FinGPT v3](https://huggingface.co/FinGPT/fingpt-sentiment_llama2-13b_lora) series are LLMs finetuned with the LoRA method on the News and Tweets sentiment analysis dataset which achieve the best scores on most of the financial sentiment analysis datasets with low cost.
+  * FinGPT v3.3 use llama2-13b as base model; FinGPT v3.2 uses llama2-7b as base model; FinGPT v3.1 uses chatglm2-6B as base model.
+  * Benchmark Results:
+  * | Weighted F1                                                  |    FPB    |  FiQA-SA  |   TFNS    |   NWGI    |      Devices       |    Time     |      Cost      |
+    | ------------------------------------------------------------ | :-------: | :-------: | :-------: | :-------: | :----------------: | :---------: | :------------: |
+    | [FinGPT v3.3](https://huggingface.co/FinGPT/fingpt-sentiment_llama2-13b_lora)| **0.882** |   0.874   | **0.903** | **0.643** |    1 × RTX 3090    | 17.25 hours |     $17.25     |
+    | FinGPT v3.2|   0.850   |   0.860   |   0.894   |   0.636   |      1 × A100      |  5.5 hours  |    $ 22.55     |
+    | FinGPT v3.1|   0.855   |   0.850   |   0.875   |   0.642   |      1 × A100      |  5.5 hours  |    $ 22.55     |
+    | FinGPT (8bit)                                                |   0.855   |   0.847   |   0.879   |   0.632   |    1 × RTX 3090    | 6.47 hours  |     $ 6.47     |
+    | FinGPT (QLoRA)                                               |   0.777   |   0.752   |   0.828   |   0.583   |    1 × RTX 3090    | 4.15 hours  |     $ 4.15     |
+    | OpenAI Fine-tune                                             |   0.878   | **0.887** |   0.883   |     -     |         -          |      -      |       -        |
+    | GPT-4                                                        |   0.833   |   0.630   |   0.808   |     -     |         -          |      -      |       -        |
+    | FinBERT                                                      |   0.880   |   0.596   |   0.733   |   0.538   | 4 × NVIDIA K80 GPU |      -      |       -        |
+    | Llama2-7B                                                    |   0.390   |   0.800   |   0.296   |   0.503   |    2048 × A100     |   21 days   | $ 4.23 million |
+    | BloombergGPT                                                 |   0.511   |   0.751   |     -     |     -     |     512 × A100     |   53 days   | $ 2.67 million |
+    **Cost per GPU hour.** For **A100 GPUs**, the AWS p4d.24xlarge instance, equipped with 8 A100 GPUs is used as a benchmark to estimate the costs. Note that BloombergGPT also used p4d.24xlarge As of July 11, 2023, the hourly rate for this instance stands at $32.773. Consequently, the estimated cost per GPU hour comes to $32.77 divided by 8, resulting in approximately **$4.10**. With this value as the reference unit price (1 GPU hour). **BloombergGPT estimated cost= 512 x 53 x 24 = 651,264 GPU hours x $4.10 = $2,670,182.40**. For **RTX 3090**, we assume its cost per hour is approximately **$1.0**, which is actually much higher than available GPUs from platforms like vast.ai.
+  * Reproduce the results by running [benchmarks](./fingpt/FinGPT_Sentiment_Analysis_v3/benchmark/benchmarks.ipynb), and the detailed tutorial is on the way.
+  * Finetune your own FinGPT v3 model with the LoRA method on only an RTX 3090 with this [notebook](./fingpt/FinGPT_Sentiment_Analysis_v3/training_8bit/train_Llama2_13B.ipynb) in 8bit or this [notebook](./fingpt/FinGPT_Sentiment_Analysis_v3/training_int4/train.ipynb) in int4 (QLoRA)
+* [FinGPT V1](./fingpt)
+  + **FinGPT by finetuning ChatGLM2 / Llama2 with LoRA with the market-labeled data for the Chinese Market**
+## Instruction Tuning Datasets and Models
+The datasets we used, and the **multi-task financial LLM** models are available at <https://huggingface.co/FinGPT>
+[Our Code](https://github.com/AI4Finance-Foundation/FinGPT/tree/master/fingpt/FinGPT_Benchmark)
+  | Datasets | Train Rows |  Test Rows |Description  |
+  | --------- | ----------------- | ------------ | --------------------- |
+  | [fingpt-sentiment-train](https://huggingface.co/datasets/FinGPT/fingpt-sentiment-train) | 76.8K | N/A|Sentiment Analysis Training Instructions |
+  | [fingpt-finred](https://huggingface.co/datasets/FinGPT/fingpt-finred)| 27.6k | 5.11k | Financial Relation Extraction Instructions |
+  | [fingpt-headline](https://huggingface.co/datasets/FinGPT/fingpt-headline) | 82.2k | 20.5k | Financial Headline Analysis Instructions|
+  | [fingpt-ner](https://huggingface.co/datasets/FinGPT/fingpt-ner) | 511   | 98  | Financial Named-Entity Recognition Instructions|
+  | [fingpt-fiqa_qa](https://huggingface.co/datasets/FinGPT/fingpt-fiqa_qa) | 17.1k   | N/A  | Financial Q&A Instructions|
+  | [fingpt-fineval](https://huggingface.co/datasets/FinGPT/fingpt-fineval) | 1.06k   | 265  | Chinese Multiple-Choice Questions Instructions|
+  Multi-task financial LLMs Models:
+```python
+  demo_tasks = [
+      'Financial Sentiment Analysis',
+      'Financial Relation Extraction',
+      'Financial Headline Classification',
+      'Financial Named Entity Recognition',]
+  demo_inputs = [
+      "Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano",
+      "Apple Inc. Chief Executive Steve Jobs sought to soothe investor concerns about his health on Monday, saying his weight loss was caused by a hormone imbalance that is relatively simple to treat.",
+      'gold trades in red in early trade; eyes near-term range at rs 28,300-28,600',
+      'This LOAN AND SECURITY AGREEMENT dated January 27 , 1999 , between SILICON VALLEY BANK (" Bank "), a California - chartered bank with its principal place of business at 3003 Tasman Drive , Santa Clara , California 95054 with a loan production office located at 40 William St ., Ste .',]
+  demo_instructions = [
+      'What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.',
+      'Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be "relation1: word1, word2; relation2: word3, word4". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.',
+      'Does the news headline talk about price going up? Please choose an answer from {Yes/No}.',
+      'Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.',]
+```
+  | Models | Description  | Function |
+  | --------- | --------------------- |---------------- |
+  | [fingpt-mt_llama2-7b_lora](https://huggingface.co/FinGPT/fingpt-mt_llama2-7b_lora)| Fine-tuned Llama2-7b model with LoRA | Multi-Task |
+  | [fingpt-mt_falcon-7b_lora](https://huggingface.co/FinGPT/fingpt-mt_falcon-7b_lora)| Fine-tuned falcon-7b model with LoRA  | Multi-Task |
+  | [fingpt-mt_bloom-7b1_lora](https://huggingface.co/FinGPT/fingpt-mt_bloom-7b1_lora) | Fine-tuned bloom-7b1 model with LoRA | Multi-Task |
+  | [fingpt-mt_mpt-7b_lora](https://huggingface.co/FinGPT/fingpt-mt_mpt-7b_lora) | Fine-tuned mpt-7b model with LoRA | Multi-Task |
+  | [fingpt-mt_chatglm2-6b_lora](https://huggingface.co/FinGPT/fingpt-mt_chatglm2-6b_lora) | Fine-tuned chatglm-6b model with LoRA | Multi-Task |
+  | [fingpt-mt_qwen-7b_lora](https://huggingface.co/FinGPT/fingpt-mt_qwen-7b_lora) | Fine-tuned qwen-7b model with LoRA | Multi-Task |
+  | [fingpt-sentiment_llama2-13b_lora](https://huggingface.co/FinGPT/fingpt-sentiment_llama2-13b_lora) | Fine-tuned llama2-13b model with LoRA | Single-Task |
+  | [fingpt-forecaster_dow30_llama2-7b_lora](https://huggingface.co/FinGPT/fingpt-forecaster_dow30_llama2-7b_lora) | Fine-tuned llama2-7b model with LoRA | Single-Task |
+## Tutorials
+[[Training] Beginner’s Guide to FinGPT: Training with LoRA and ChatGLM2–6B One Notebook, $10 GPU](https://byfintech.medium.com/beginners-guide-to-fingpt-training-with-lora-chatglm2-6b-9eb5ace7fe99)
+## Understanding FinGPT: An Educational Blog Series
++ [FinGPT: Powering the Future of Finance with 20 Cutting-Edge Applications
+](https://medium.datadriveninvestor.com/fingpt-powering-the-future-of-finance-with-20-cutting-edge-applications-7c4d082ad3d8)
++ [FinGPT I: Why We Built the First Open-Source Large Language Model for Finance
+](https://medium.datadriveninvestor.com/fingpt-i-why-we-built-the-first-open-source-large-language-model-for-finance-c01b5517ca)
++ [FinGPT II: Cracking the Financial Sentiment Analysis Task Using Instruction Tuning of General-Purpose Large Language Models
+](https://medium.datadriveninvestor.com/fingpt-ii-cracking-the-financial-sentiment-analysis-task-using-instruction-tuning-of-3333bce428c4)
+## FinGPT Ecosystem
+### FinGPT embraces a full-stack framework for FinLLMs with five layers:
+1. **Data source layer**: This layer assures comprehensive market coverage, addressing the temporal sensitivity of financial data through real-time information capture.
+2. **Data engineering layer**: Primed for real-time NLP data processing, this layer tackles the inherent challenges of high temporal sensitivity and low signal-to-noise ratio in financial data.
+3. **LLMs layer**: Focusing on a range of fine-tuning methodologies such as LoRA, this layer mitigates the highly dynamic nature of financial data, ensuring the model’s relevance and accuracy.
+4. **Task layer**: This layer is responsible for executing fundamental tasks. These tasks serve as the benchmarks for performance evaluations and cross-comparisons in the realm of FinLLMs
+5. **Application layer**: Showcasing practical applications and demos, this layer highlights the potential capability of FinGPT in the financial sector.
+* FinGPT Framework: Open-Source Financial Large Language Models
+<div align="center">
+<img align="center" src=figs/FinGPT_framework_20240301.png>
+</div>
+* [FinGPT-RAG](https://github.com/AI4Finance-Foundation/FinGPT/tree/master/fingpt/FinGPT_RAG): We present a retrieval-augmented large language model framework specifically designed for financial sentiment analysis, optimizing information depth and context through external knowledge retrieval, thereby ensuring nuanced predictions.
+<div align="center">
+<img align="center" src=figs/FinGPT_RAG_framework.png>
+</div>
+* [FinGPT-FinNLP](https://github.com/AI4Finance-Foundation/FinNLP): FinNLP provides a playground for all people interested in LLMs and NLP in Finance. Here we provide full pipelines for LLM training and finetuning in the field of finance. The full architecture is shown in the following picture. Detail codes and introductions can be found [here](https://github.com/AI4Finance-Foundation/FinNLP). Or you may refer to the [wiki](https://ai4finance-foundation.github.io/FinNLP/)
+<div align="center">
+<img align="center" src=figs/FinGPT_FinNLP_data_source.png>
+</div>
+* [FinGPT-Benchmark](https://github.com/AI4Finance-Foundation/FinGPT/tree/master/fingpt/FinGPT_Benchmark): We introduce a novel Instruction Tuning paradigm optimized for open-source Large Language Models (LLMs) in finance, enhancing their adaptability to diverse financial datasets while also facilitating cost-effective, systematic benchmarking from task-specific, multi-task, and zero-shot instruction tuning tasks.
+<div align="center">
+<img align="center" src=figs/FinGPT_Benchmark_20231110.png>
+</div>
+## Open-Source Base Model used in the LLMs layer of FinGPT
+* Feel free to contribute more open-source base models tailored for various language-specific financial markets.
+| Base Model |Pretraining Tokens|Context Length  | Model Advantages |Model Size|Experiment Results |  Applications |
+|  ----  |  ----  |  ----  |   ----  |   ----  |  ----  | ----  |
+| [Llama-2](https://github.com/facebookresearch/llama)|2 Trillion|4096| Llama-2 excels on English-based market data | [llama-2-7b](https://huggingface.co/meta-llama/Llama-2-7b-hf) and [Llama-2-13b](https://huggingface.co/meta-llama/Llama-2-13b-hf) | llama-2 consistently shows superior fine-tuning results  | Financial Sentiment Analysis, Robo-Advisor |
+| [Falcon](https://github.com/falconry/falcon) |1,500B|2048|  Maintains high-quality results while being more resource-efficient | [falcon-7b](https://huggingface.co/tiiuae/falcon-7b) |Good for English market data  | Financial Sentiment Analysis |
+| [MPT](https://github.com/mosaicml/llm-foundry) |1T|2048| MPT models can be trained with high throughput efficiency and stable convergence | [mpt-7b](https://huggingface.co/mosaicml/mpt-7b) |Good for English market data  | Financial Sentiment Analysis |
+| [Bloom](https://github.com/bigscience-workshop/bigscience/tree/master/train/tr11-176B-ml#readme) |366B|2048| World’s largest open multilingual language model  | [bloom-7b1](https://huggingface.co/bigscience/bloom-7b1) |Good for English market data  | Financial Sentiment Analysis |
+| [ChatGLM2](https://github.com/THUDM/ChatGLM2-6B)|1.4T  |32K |Exceptional capability for Chinese language expression| [chatglm2-6b](https://huggingface.co/THUDM/chatglm2-6b) |Shows prowess for Chinese market data  | Financial Sentiment Analysis, Financial Report Summary |
+| [Qwen](https://github.com/QwenLM/Qwen-7B)|2.2T  |8k |Fast response and high accuracy| [qwen-7b](https://huggingface.co/tangger/Qwen-7B-Chat) |Effective for Chinese market data  | Financial Sentiment Analysis|
+| [InternLM](https://github.com/InternLM/InternLM) |1.8T  |8k |Can flexibly and independently construct workflows |[internlm-7b](https://huggingface.co/internlm/internlm-7b) |Effective for Chinese market data  | Financial Sentiment Analysis |
+* Benchmark Results for the above open-source Base Models in the financial sentiment analysis task using the same instruction template for SFT (LoRA):
+  | Weighted F1/Acc  |Llama2 |Falcon |  MPT|Bloom |ChatGLM2|Qwen|InternLM |
+  | --------- | ----------------- | ------------ | --------------------- | ---------------- | --------------- | ----------------- |----------------- |
+  | [FPB](https://huggingface.co/datasets/financial_phrasebank) | 0.863/0.863 | 0.846/0.849  | **0.872**/**0.872**   | 0.810/0.810 | 0.850/0.849 |0.854/0.854| 0.709/0.714 |
+  | [FiQA-SA](https://huggingface.co/datasets/pauri32/fiqa-2018)| **0.871**/0.855| 0.840/0.811  | 0.863/0.844 | 0.771/0.753| 0.864/**0.862** | 0.867/0.851  |0.679/0.687 |
+  | [TFNS](https://huggingface.co/datasets/zeroshot/twitter-financial-news-sentiment) | 0.896/0.895 | 0.893/0.893 | **0.907**/**0.907** | 0.840/0.840 | 0.859/0.858 | 0.883/0.882|0.729/0.731|
+  | [NWGI](https://huggingface.co/datasets/oliverwang15/news_with_gpt_instructions) | **0.649/0.651**   | 0.636/0.638  | 0.640/0.641| 0.573/0.574| 0.619/0.629 |0.638/0.643|0.498/0.503|
+### All Thanks To Our Contributors :
+<a href="https://github.com/AI4Finance-Foundation/FinGPT/graphs/contributors">
+  <img src="https://contrib.rocks/image?repo=AI4Finance-Foundation/FinGPT" />
+</a>
+## News
++ [Columbia Perspectives on ChatGPT](https://datascience.columbia.edu/news/2023/columbia-perspectives-on-chatgpt/?utm_source=sendinblue&utm_campaign=DSI%20Newsletter%20April%202023&utm_medium=email)
++ [MIT Technology Review] [ChatGPT is about to revolutionize the economy. We need to decide what that looks like](https://www.technologyreview.com/2023/03/25/1070275/chatgpt-revolutionize-economy-decide-what-looks-like/)
++ [BloombergGPT] [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564)
++ [Finextra] [ChatGPT and Bing AI to sit as panellists at fintech conference](https://www.finextra.com/newsarticle/41973/chatgpt-and-bing-ai-to-sit-as-panellists-at-fintech-conference)
+## ChatGPT at AI4Finance
++ [YouTube video] [I Built a Trading Bot with ChatGPT](https://www.youtube.com/watch?v=fhBw3j_O9LE), combining ChatGPT and FinRL.
++ [Hey, ChatGPT! Explain FinRL code to me!](https://medium.com/@ai4finance/hey-chatgpt-explain-finrl-code-to-me-6a91d612296f)
+## Introductory
++ [Sparks of artificial general intelligence: Early experiments with GPT-4](https://arxiv.org/abs/2303.12712)
++ [GPT-4] [GPT-4 Technical Report](https://arxiv.org/abs/2303.08774)
++ [InstructGPT] [Training language models to follow instructions with human feedback](https://openreview.net/forum?id=TG8KACxEON) NeurIPS 2022.
+[The Journey of Open AI GPT models](https://medium.com/walmartglobaltech/the-journey-of-open-ai-gpt-models-32d95b7b7fb2).  GPT models explained. Open AI's GPT-1, GPT-2, GPT-3.
++ [GPT-3] [Language models are few-shot learners](https://proceedings.neurips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html) NeurIPS 2020.
++ [GPT-2] [Language Models are Unsupervised Multitask Learners](https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf)
++ [GPT-1] [Improving Language Understanding by Generative Pre-Training](https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)
++ [Transformer] [Attention is All you Need](https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html) NeurIPS 2017.
+## (Financial) Big Data
++ [BloombergGPT] [BloombergGPT: A Large Language Model for Finance](https://arxiv.org/abs/2303.17564)
++ [WHAT’S IN MY AI?](https://lifearchitect.ai/whats-in-my-ai/) A Comprehensive Analysis of Datasets Used to Train GPT-1, GPT-2, GPT-3, GPT-NeoX-20B, Megatron-11B, MT-NLG, and Gopher
++ [FinRL-Meta Repo](https://github.com/AI4Finance-Foundation/FinRL-Meta) and paper [FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning](https://proceedings.neurips.cc/paper_files/paper/2022/hash/0bf54b80686d2c4dc0808c2e98d430f7-Abstract-Datasets_and_Benchmarks.html). Advances in Neural Information Processing Systems, 2022.
++ [AI4Finance] [FinNLP](https://github.com/AI4Finance-Foundation/FinNLP) Democratizing Internet-scale financial data.
+## Interesting Demos
++ [GPT-3 Creative Fiction](https://gwern.net/gpt-3#prompts-as-programming) Creative writing by OpenAI’s GPT-3 model, demonstrating poetry, dialogue, puns, literary parodies, and storytelling. Plus advice on effective GPT-3 prompt programming & avoiding common errors.
+## ChatGPT for FinTech
+**ChatGPT Trading Bot**
++ [YouTube video] [ChatGPT Trading strategy 20097% returns](https://www.youtube.com/watch?v=unsa_gXPAJ4)
++ [YouTube video] [ChatGPT Coding - Make A Profitable Trading Strategy In Five Minutes!](https://www.youtube.com/watch?v=4SG2884RcDY)
++ [YouTube video] [Easy Automated Live Trading using ChatGPT (+9660.3% hands free)](https://www.youtube.com/watch?v=dIEZVPVOZPQ)
++ [YouTube video] [ChatGPT Trading Strategy 893% Returns](https://www.youtube.com/watch?v=YxjvjK5AD2M)
++ [YouTube video] [ChatGPT 10 Million Trading Strategy](https://www.youtube.com/watch?v=9VPfd08uU4Q)
++ [YouTube video] [ChatGPT: Your Crypto Assistant](https://www.youtube.com/watch?v=LpzeshX6s2w)
++ [YouTube video] [Generate Insane Trading Returns with ChatGPT and TradingView](https://www.youtube.com/watch?v=ekz6ugJE1h0&t=3s)
+<!---
+**(Fast and accurate) Sentiment Analysis**
+   GPT-3 can help study customer surveys, social media tweets from customers/users.
+   Tweets
++ [Tweet Classifier](https://platform.openai.com/playground/p/default-tweet-classifier?model=text-davinci-003)
++ [Advanced Tweet Classifier](https://platform.openai.com/playground/p/default-adv-tweet-classifier?model=text-davinci-003)
+  Financial News
++ [Algorithmic Trading using Sentiment Analysis on News Articles](https://towardsdatascience.com/https-towardsdatascience-com-algorithmic-trading-using-sentiment-analysis-on-news-articles-83db77966704)
++ [Accessing Historical Financial News Headlines with Python](https://python.plainenglish.io/access-historical-financial-news-headlines-with-python-be1b8faaea9f)
+**PromptNet** Analogy to ImageNet and WordNet, it is critical to build a PromptNet.
++ [Awesome_Prompting_Papers_in_Computer_Vision](https://github.com/ttengwang/Awesome_Prompting_Papers_in_Computer_Vision)
++ [OpenPrompt](https://github.com/thunlp/OpenPrompt)
++ [promptsource](https://github.com/bigscience-workshop/promptsource)
+**Robo-advisor**
+**Coding-tutor**
++ [Hey, ChatGPT! Explain FinRL code to me!](https://medium.com/@ai4finance/hey-chatgpt-explain-finrl-code-to-me-6a91d612296f)
+**Blogs about ChatGPT for FinTech**
+## ChatGPT APIs
+Prompting as a new programming paradigm!
++ [Towards Data Science] [GPT-3: Creative Potential of NLP](https://towardsdatascience.com/gpt-3-creative-potential-of-nlp-d5ccae16c1ab)
++ [YouTube video] [OpenAI GPT-3 - Prompt Engineering For Financial NLP](https://www.youtube.com/watch?v=Nl2Cdbao5Ws)
++ [OpenAI API for GPT-3](https://platform.openai.com/docs/models/gpt-3)
++ [ChatGPT-wrapper: python and shell](https://github.com/mmabrouk/chatgpt-wrapper)
++ [OpenAI Examples Library](https://platform.openai.com/examples)
++ [GPT-3 Sandbox (Github)](https://github.com/shreyashankar/gpt3-sandbox) Enable users to create cool web demos using OpenAI GPT-3 API.
++ [Exploring the Capabilities of the ChatGPT API: A Beginner’s Guide](https://levelup.gitconnected.com/exploring-the-capabilities-of-the-chatgpt-api-a-beginners-guide-e9089d49961f)
++ [Reverse engineered ChatGPT API](https://github.com/acheong08/ChatGPT)
+**Prompting programming**
+## ChatGPT relatives:
+[A Release Timeline](https://github.com/osanseviero/ml_timeline) of many LLMs.
+[PaLM](https://arxiv.org/abs/2204.02311)
+[Chincella](https://arxiv.org/abs/2203.15556)
+Interesting evaluations:
++ [RLHF for pretraining](https://arxiv.org/abs/2302.08582)
++ [Compare ChatGPT with GPT3.5](https://arxiv.org/pdf/2302.06476.pdf)
++ [Is ChatGPT A Good Translator? A Preliminary Study](https://arxiv.org/pdf/2301.08745.pdf)
++ [A Multitask, Multilingual, Multimodal Evaluation of ChatGPT
+on Reasoning, Hallucination, and Interactivity](https://arxiv.org/pdf/2302.04023.pdf)
+[YouTube video] [Physics Solution: ChatGPT vs. Google](https://www.youtube.com/watch?v=x4dIx9VYQoM)
+--->
+## Citing FinGPT
+```
+@article{yang2023fingpt,
+  title={FinGPT: Open-Source Financial Large Language Models},
+  author={Yang, Hongyang and Liu, Xiao-Yang and Wang, Christina Dan},
+  journal={FinLLM Symposium at IJCAI 2023},
+  year={2023}
+}
+@article{zhang2023instructfingpt,
+      title={Instruct-FinGPT: Financial Sentiment Analysis by Instruction Tuning of General-Purpose Large Language Models},
+      author={Boyu Zhang and Hongyang Yang and Xiao-Yang Liu},
+      journal={FinLLM Symposium at IJCAI 2023},
+      year={2023}
+}
+@article{zhang2023fingptrag,
+  title={Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models},
+  author={Zhang, Boyu and Yang, Hongyang and Zhou, tianyu and Babar, Ali and Liu, Xiao-Yang},
+ journal = {ACM International Conference on AI in Finance (ICAIF)},
+  year={2023}
+}
+@article{wang2023fingptbenchmark,
+  title={FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets},
+  author={Wang, Neng and Yang, Hongyang and Wang, Christina Dan},
+  journal={NeurIPS Workshop on Instruction Tuning and Instruction Following},
+  year={2023}
+}
+@article{2023finnlp,
+  title={Data-centric FinGPT: Democratizing Internet-scale Data for Financial Large Language Models},
+  author={Liu, Xiao-Yang and Wang, Guoxuan and Yang, Hongyang and Zha, Daochen},
+  journal={NeurIPS Workshop on Instruction Tuning and Instruction Following},
+  year={2023}
+}
+```
+<div align="center">
+<a href="https://finllm.github.io/workshop/#/fcb" target="_blank">
+<img align="center" src=figs/fingpt_best_presentation.png width="65%">
+</div>
+## LICENSE
+MIT License
+**Disclaimer: We are sharing codes for academic purposes under the MIT education license. Nothing herein is financial advice, and NOT a recommendation to trade real money. Please use common sense and always first consult a professional before trading or investing.**

external/FinGPT/fingpt/FinGPT_Benchmark/__init__.py ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ from .data.download import download as download_datasets
2	+ from . import benchmarks

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/__init__.py ADDED Viewed

	@@ -0,0 +1,3 @@


1	+ from . import fpb, fiqa, finred, fineval, convfinqa, headline, ner, nwgi, tfns
2	+
3	+ __all__ = [fpb, fiqa, finred, fineval, convfinqa, headline, ner, nwgi, tfns]

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/benchmarks.py ADDED Viewed

	@@ -0,0 +1,114 @@

+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel, get_peft_model, LoraConfig, TaskType  # 0.4.0
+import torch
+import argparse
+from fpb import test_fpb, test_fpb_mlt
+from fiqa import test_fiqa, test_fiqa_mlt
+from tfns import test_tfns
+from nwgi import test_nwgi
+from headline import test_headline
+from ner import test_ner
+from convfinqa import test_convfinqa
+from fineval import test_fineval
+from finred import test_re
+import sys
+sys.path.append('../')
+from utils import *
+def main(args):
+    if args.from_remote:
+        model_name = parse_model_name(args.base_model, args.from_remote)
+    else:
+        model_name = '../' + parse_model_name(args.base_model)
+    model = AutoModelForCausalLM.from_pretrained(
+        model_name, trust_remote_code=True,
+        # load_in_8bit=True
+        device_map="auto",
+        # fp16=True
+    )
+    model.model_parallel = True
+    tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
+    # tokenizer.pad_token_id = tokenizer.eos_token_id
+    tokenizer.padding_side = "left"
+    if args.base_model == 'qwen':
+        tokenizer.eos_token_id = tokenizer.convert_tokens_to_ids('<|endoftext|>')
+        tokenizer.pad_token_id = tokenizer.convert_tokens_to_ids('<|extra_0|>')
+    if not tokenizer.pad_token or tokenizer.pad_token_id == tokenizer.eos_token_id:
+        tokenizer.add_special_tokens({'pad_token': '[PAD]'})
+        model.resize_token_embeddings(len(tokenizer))
+    print(f'pad: {tokenizer.pad_token_id}, eos: {tokenizer.eos_token_id}')
+    # peft_config = LoraConfig(
+    #     task_type=TaskType.CAUSAL_LM,
+    #     inference_mode=False,
+    #     r=8,
+    #     lora_alpha=32,
+    #     lora_dropout=0.1,
+    #     target_modules=lora_module_dict[args.base_model],
+    #     bias='none',
+    # )
+    # model = get_peft_model(model, peft_config)
+    # model.load_state_dict(torch.load(args.peft_model + '/pytorch_model.bin'))
+    model = PeftModel.from_pretrained(model, args.peft_model)
+    model = model.eval()
+    with torch.no_grad():
+        for data in args.dataset.split(','):
+            if data == 'fpb':
+                test_fpb(args, model, tokenizer)
+            elif data == 'fpb_mlt':
+                test_fpb_mlt(args, model, tokenizer)
+            elif data == 'fiqa':
+                test_fiqa(args, model, tokenizer)
+            elif data == 'fiqa_mlt':
+                test_fiqa_mlt(args, model, tokenizer)
+            elif data == 'tfns':
+                test_tfns(args, model, tokenizer)
+            elif data == 'nwgi':
+                test_nwgi(args, model, tokenizer)
+            elif data == 'headline':
+                test_headline(args, model, tokenizer)
+            elif data == 'ner':
+                test_ner(args, model, tokenizer)
+            elif data == 'convfinqa':
+                test_convfinqa(args, model, tokenizer)
+            elif data == 'fineval':
+                test_fineval(args, model, tokenizer)
+            elif data == 're':
+                test_re(args, model, tokenizer)
+            else:
+                raise ValueError('undefined dataset.')
+    print('Evaluation Ends.')
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--dataset", required=True, type=str)
+    parser.add_argument("--base_model", required=True, type=str, choices=['chatglm2', 'llama2', 'llama2-13b', 'llama2-13b-nr', 'baichuan', 'falcon', 'internlm', 'qwen', 'mpt', 'bloom'])
+    parser.add_argument("--peft_model", required=True, type=str)
+    parser.add_argument("--max_length", default=512, type=int)
+    parser.add_argument("--batch_size", default=4, type=int, help="The train batch size per device")
+    parser.add_argument("--instruct_template", default='default')
+    parser.add_argument("--from_remote", default=False, type=bool)
+    args = parser.parse_args()
+    print(args.base_model)
+    print(args.peft_model)
+    main(args)

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/convfinqa.py ADDED Viewed

	@@ -0,0 +1,75 @@

+from seqeval.metrics import accuracy_score
+from datasets import load_dataset, load_from_disk
+from tqdm import tqdm
+import datasets
+import torch
+from torch.utils.data import DataLoader
+from functools import partial
+import re
+import sys
+import numpy as np
+from fingpt.FinGPT_Benchmark.utils import *
+from pathlib import Path
+sys.path.append('../')
+def cvt_text_to_pred(text):
+    if not text:
+        return 'nan'
+    pred_match = re.search(r'\d+(.\d+)', text)
+    if pred_match is not None:
+        pred = pred_match.group()
+    else:
+        print(text)
+        pred = '0.0'
+    return pred
+def map_output(feature):
+    label = cvt_text_to_pred(feature['output'])
+    pred = cvt_text_to_pred(feature['out_text'])
+    return {'label': label, 'pred': pred}
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/fingpt-convfinqa')['test']
+    dataset = dataset.map(partial(test_mapping, args), load_from_cache_file=False)
+    def collate_fn(batch):
+        inputs = tokenizer(
+            [f["prompt"] for f in batch], return_tensors='pt',
+            padding=True, max_length=args.max_length,
+            return_token_type_ids=False
+        )
+        return inputs
+    dataloader = DataLoader(dataset, batch_size=args.batch_size, collate_fn=collate_fn, shuffle=False)
+    out_text_list = []
+    log_interval = len(dataloader) // 5
+    for idx, inputs in enumerate(tqdm(dataloader)):
+        inputs = {key: value.to(model.device) for key, value in inputs.items()}
+        res = model.generate(**inputs, max_length=args.max_length, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        if (idx + 1) % log_interval == 0:
+            tqdm.write(f'{idx}: {res_sentences[0]}')
+        out_text = [o.split("Answer: ")[1] if "Answer: " in o else "" for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset = dataset.add_column("out_text", out_text_list)
+    dataset = dataset.map(map_output, load_from_cache_file=False)
+    dataset = dataset.filter(lambda x: x['pred'] != 'nan')
+    dataset = dataset.to_pandas()
+    print(dataset)
+    dataset.to_csv('tmp.csv')
+    label = [float(d) for d in dataset['label']]
+    pred = [float(d) for d in dataset['pred']]
+    print('Accuracy: ', accuracy_score(label, pred))
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/evaluate.sh ADDED Viewed

	@@ -0,0 +1,395 @@

+# export TRANSFORMERS_NO_ADVISORY_WARNINGS=1
+# export TOKENIZERS_PARALLELISM=0
+#---- Relation Extraction ----
+python benchmarks.py \
+--dataset re \
+--base_model llama2 \
+--peft_model ../finetuned_models/finred-llama2-linear_202310012254 \
+--batch_size 8 \
+--max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/finred-chatglm2-linear_202310010213 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model qwen \
+# --peft_model ../finetuned_models/finred-qwen-linear_202310010502 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model mpt \
+# --peft_model ../finetuned_models/finred-mpt-linear_202310010641 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model bloom \
+# --peft_model ../finetuned_models/finred-bloom-linear_202310010741 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model falcon \
+# --peft_model ../finetuned_models/finred-falcon-linear_202310010333 \
+# --batch_size 1 \
+# --max_length 512
+#---- Generalization ----
+# python benchmarks.py \
+# --dataset fiqa_mlt \
+# --base_model falcon \
+# --peft_model ../finetuned_models/GRCLS-sentiment-falcon-linear-small_202309291801/checkpoint-300 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb_mlt \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/GRCLS-sentiment-llama2-linear-small_202309290356/checkpoint-800 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fiqa_mlt \
+# --base_model qwen \
+# --peft_model ../finetuned_models/GRCLS-sentiment-qwen-linear-small_202309292115/checkpoint-700 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb_mlt \
+# --base_model mpt \
+# --peft_model ../finetuned_models/GRCLS-sentiment-mpt-linear-small_202309300359/checkpoint-400 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fiqa_mlt \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/GRCLS-sentiment-chatglm2-linear-1e-4lr_202309280440/checkpoint-212 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fiqa_mlt \
+# --base_model bloom \
+# --peft_model ../finetuned_models/GRCLS-sentiment-bloom-linear-small_202309300044/checkpoint-500 \
+# --batch_size 8 \
+# --max_length 512
+#---- Multi-Task ----
+# python benchmarks.py \
+# --dataset re \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/MT-chatglm2-linear_202309201120 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model falcon \
+# --peft_model ../finetuned_models/MT-falcon-linear_202309210126 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model bloom \
+# --peft_model ../finetuned_models/MT-bloom-linear_202309211510 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model qwen \
+# --peft_model ../finetuned_models/MT-qwen-linear_202309221011 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model mpt \
+# --peft_model ../finetuned_models/MT-mpt-linear_202309230221 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset re \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/MT-llama2-linear_202309241345 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi,headline,ner,re \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/MT-chatglm2-linear_202309201120 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi,headline,ner,re \
+# --base_model falcon \
+# --peft_model ../finetuned_models/MT-falcon-linear_202309210126 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi,headline,ner,re \
+# --base_model bloom \
+# --peft_model ../finetuned_models/MT-bloom-linear_202309211510 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi,headline,ner,re \
+# --base_model qwen \
+# --peft_model ../finetuned_models/MT-qwen-linear_202309221011 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi,headline,ner,re \
+# --base_model mpt \
+# --peft_model ../finetuned_models/MT-mpt-linear_202309230221 \
+# --batch_size 8 \
+# --max_length 512
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi,headline,ner,re \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/MT-llama2-linear_202309241345 \
+# --batch_size 8 \
+# --max_length 512
+#---- ConvFinQA ----
+# python benchmarks.py \
+# --dataset convfinqa \
+# --base_model falcon \
+# --peft_model ../finetuned_models/convfinqa-falcon-linear_202309170614 \
+# --batch_size 1 \
+# --max_length 2048
+# python benchmarks.py \
+# --dataset convfinqa \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/convfinqa-chatglm2-linear_202309170247 \
+# --batch_size 1 \
+# --max_length 2048
+# python benchmarks.py \
+# --dataset convfinqa \
+# --base_model qwen \
+# --peft_model ../finetuned_models/convfinqa-qwen-linear_202309171029 \
+# --batch_size 1 \
+# --max_length 2048
+# python benchmarks.py \
+# --dataset convfinqa \
+# --base_model bloom \
+# --peft_model ../finetuned_models/convfinqa-bloom-linear_202309171502 \
+# --batch_size 1 \
+# --max_length 2048
+# python benchmarks.py \
+# --dataset convfinqa \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/convfinqa-llama2-linear_202309162205 \
+# --batch_size 1 \
+# --max_length 2048
+#---- FinEval ----
+# python benchmarks.py \
+# --dataset fineval \
+# --base_model falcon \
+# --peft_model ../finetuned_models/fineval-falcon-linear_202309220409 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset fineval \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/fineval-chatglm2-linear_202309220332 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset fineval \
+# --base_model qwen \
+# --peft_model ../finetuned_models/fineval-qwen-linear_202309220508 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset fineval \
+# --base_model bloom \
+# --peft_model ../finetuned_models/fineval-bloom-linear_202309220639 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset fineval \
+# --base_model mpt \
+# --peft_model ../finetuned_models/fineval-mpt-linear_202309220555 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset fineval \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/fineval-llama2-linear_202309192232 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset fineval \
+# --base_model internlm \
+# --peft_model ../finetuned_models/fineval-internlm-linear_202309211248 \
+# --batch_size 1
+#---- NER ----
+# python benchmarks.py \
+# --dataset ner \
+# --base_model falcon \
+# --peft_model ../finetuned_models/ner-falcon-linear_202309160320 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset ner \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/ner-chatglm2-linear_202309160238 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset ner \
+# --base_model qwen \
+# --peft_model ../finetuned_models/ner-qwen-linear_202309160409 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset ner \
+# --base_model bloom \
+# --peft_model ../finetuned_models/ner-bloom-linear_202309160530 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset ner \
+# --base_model mpt \
+# --peft_model ../finetuned_models/ner-mpt-linear_202309160459 \
+# --batch_size 1
+# python benchmarks.py \
+# --dataset ner \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/ner-llama2-linear_202309161924 \
+# --batch_size 1
+#---- sentiment analysis ----
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/sentiment-llama2-linear_202309130723 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi \
+# --base_model falcon \
+# --peft_model ../finetuned_models/sentiment-falcon-default_20230911055454 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/sentiment-chatglm2-default_20230910031650 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi \
+# --base_model qwen \
+# --peft_model ../finetuned_models/sentiment-qwen-linear_202309132016 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi \
+# --base_model internlm \
+# --peft_model ../finetuned_models/sentiment-internlm-linear_202309130230 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi \
+# --base_model bloom \
+# --peft_model ../finetuned_models/sentiment-bloom-linear_202309151934 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset fpb,fiqa,tfns,nwgi \
+# --base_model mpt \
+# --peft_model ../finetuned_models/sentiment-mpt-linear_202309151405 \
+# --batch_size 8
+#---- headline ----
+# python benchmarks.py \
+# --dataset headline \
+# --base_model llama2 \
+# --peft_model ../finetuned_models/headline-llama2-linear_202309140611 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset headline \
+# --base_model chatglm2 \
+# --peft_model ../finetuned_models/headline-chatglm2-linear_202309140941 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset headline \
+# --base_model internlm \
+# --peft_model ../finetuned_models/headline-internlm-linear_202309140308 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset headline \
+# --base_model falcon \
+# --peft_model ../finetuned_models/headline-falcon-linear_202309141852 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset headline \
+# --base_model qwen \
+# --peft_model ../finetuned_models/headline-qwen-linear_202309142156 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset headline \
+# --base_model mpt \
+# --peft_model ../finetuned_models/headline-mpt-linear_202309150151 \
+# --batch_size 8
+# python benchmarks.py \
+# --dataset headline \
+# --base_model bloom \
+# --peft_model ../finetuned_models/headline-bloom-linear_202309151641 \
+# --batch_size 8

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/fineval.py ADDED Viewed

	@@ -0,0 +1,72 @@

+from seqeval.metrics import accuracy_score
+from datasets import load_dataset, load_from_disk
+from tqdm import tqdm
+import datasets
+import torch
+from torch.utils.data import DataLoader
+from functools import partial
+import re
+import sys
+import numpy as np
+from fingpt.FinGPT_Benchmark.utils import *
+from pathlib import Path
+sys.path.append('../')
+def cvt_text_to_pred(text):
+    pred_match = re.search(r'[ABCD]', text)
+    if pred_match is not None:
+        pred = pred_match.group()
+        pred = ["A", "B", "C", "D"].index(pred)
+    else:
+        pred = -1
+    return pred
+def map_output(feature):
+    label = cvt_text_to_pred(feature['output'])
+    pred = cvt_text_to_pred(feature['out_text'])
+    return {'label': label, 'pred': pred}
+def test_fineval(args, model, tokenizer):
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/fingpt-fineval')['test']
+    dataset = dataset.map(partial(test_mapping, args), load_from_cache_file=False)
+    def collate_fn(batch):
+        inputs = tokenizer(
+            [f["prompt"] for f in batch], return_tensors='pt',
+            padding=True, max_length=args.max_length,
+            return_token_type_ids=False
+        )
+        return inputs
+    dataloader = DataLoader(dataset, batch_size=args.batch_size, collate_fn=collate_fn, shuffle=False)
+    out_text_list = []
+    log_interval = len(dataloader) // 5
+    for idx, inputs in enumerate(tqdm(dataloader)):
+        inputs = {key: value.to(model.device) for key, value in inputs.items()}
+        res = model.generate(**inputs, max_length=args.max_length, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        if (idx + 1) % log_interval == 0:
+            tqdm.write(f'{idx}: {res_sentences[0]}')
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset = dataset.add_column("out_text", out_text_list)
+    dataset = dataset.map(map_output, load_from_cache_file=False)
+    dataset = dataset.to_pandas()
+    print(dataset)
+    dataset.to_csv('tmp.csv')
+    print('Accuracy:', accuracy_score(dataset['label'], dataset['pred']))
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/finred.py ADDED Viewed

	@@ -0,0 +1,150 @@

+from seqeval.metrics import classification_report
+from datasets import load_dataset, load_from_disk
+from tqdm import tqdm
+import datasets
+import torch
+from torch.utils.data import DataLoader
+from functools import partial
+import re
+import sys
+import numpy as np
+from fingpt.FinGPT_Benchmark.utils import *
+from pathlib import Path
+sys.path.append('../')
+relations = [
+    'product_or_material_produced',
+    'manufacturer',
+    'distributed_by',
+    'industry',
+    'position_held',
+    'original_broadcaster',
+    'owned_by',
+    'founded_by',
+    'distribution_format',
+    'headquarters_location',
+    'stock_exchange',
+    'currency',
+    'parent_organization',
+    'chief_executive_officer',
+    'director_/_manager',
+    'owner_of',
+    'operator',
+    'member_of',
+    'employer',
+    'chairperson',
+    'platform',
+    'subsidiary',
+    'legal_form',
+    'publisher',
+    'developer',
+    'brand',
+    'business_division',
+    'location_of_formation',
+    'creator',
+]
+def cvt_text_to_pred(ref, text):
+    preds = []
+    for pred_txt in text.strip('.').split(';'):
+        pred_match = re.match(r'^(.*):(.*),(.*)$', pred_txt)
+        if pred_match is not None:
+            relation, word1, word2 = pred_match.group(1).strip(), pred_match.group(2).strip(), pred_match.group(3).strip()
+            if relation in relations and word1 in ref and word2 in ref:
+                preds.append((relation, word1, word2))
+            else:
+                print("Not found Error: ", relation, word1, word2, ref)
+        else:
+            print("Parse Error: ", pred_txt)
+    return preds
+def map_output(feature):
+    ref = feature['input']
+    label = cvt_text_to_pred(ref, feature['output'])
+    pred = cvt_text_to_pred(ref, feature['out_text'])
+    return {'label': label, 'pred': pred}
+def calc_metric(gt_list, pred_list):
+    # Initialize variables for true positives, false positives, and false negatives
+    true_positives = 0
+    false_positives = 0
+    false_negatives = 0
+    for (ground_truth, predicted_relations) in zip(gt_list, pred_list):
+        # Calculate true positives, false positives, and false negatives
+        for relation in predicted_relations:
+            if relation in ground_truth:
+                true_positives += 1
+            else:
+                false_positives += 1
+        for relation in ground_truth:
+            if relation not in predicted_relations:
+                false_negatives += 1
+    # Calculate precision, recall, and F1-Score
+    precision = true_positives / (true_positives + false_positives)
+    recall = true_positives / (true_positives + false_negatives)
+    f1_score = 2 * (precision * recall) / (precision + recall)
+    # Print the results
+    print("Precision:", precision)
+    print("Recall:", recall)
+    print("F1-Score:", f1_score)
+def test_re(args, model, tokenizer):
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/fingpt-finred-re')['test']
+    dataset = dataset.train_test_split(0.2, seed=42)['test']
+    dataset = dataset.map(partial(test_mapping, args), load_from_cache_file=False)
+    def collate_fn(batch):
+        inputs = tokenizer(
+            [f["prompt"] for f in batch], return_tensors='pt',
+            padding=True, max_length=args.max_length,
+            return_token_type_ids=False
+        )
+        return inputs
+    dataloader = DataLoader(dataset, batch_size=args.batch_size, collate_fn=collate_fn, shuffle=False)
+    out_text_list = []
+    log_interval = len(dataloader) // 5
+    for idx, inputs in enumerate(tqdm(dataloader)):
+        inputs = {key: value.to(model.device) for key, value in inputs.items()}
+        res = model.generate(**inputs, max_length=args.max_length, eos_token_id=tokenizer.eos_token_id, max_new_tokens=128)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        if (idx + 1) % log_interval == 0:
+            tqdm.write(f'{idx}: {res_sentences[0]}')
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset = dataset.add_column("out_text", out_text_list)
+    dataset = dataset.map(map_output, load_from_cache_file=False)
+    dataset = dataset.to_pandas()
+    print(dataset)
+    dataset.to_csv('tmp.csv')
+    label = [[tuple(t) for t in d.tolist()] for d in dataset['label']]
+    pred = [[tuple(t) for t in d.tolist()] for d in dataset['pred']]
+    label_re = [[t[0] for t in d.tolist()] for d in dataset['label']]
+    pred_re = [[t[0] for t in d.tolist()] for d in dataset['pred']]
+    calc_metric(label, pred)
+    calc_metric(label_re, pred_re)
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/fiqa.py ADDED Viewed

	@@ -0,0 +1,176 @@

+import warnings
+warnings.filterwarnings("ignore")
+from sklearn.metrics import accuracy_score,f1_score
+from datasets import load_dataset, load_from_disk, Dataset
+from tqdm import tqdm
+import datasets
+import torch
+from torch.utils.data import DataLoader
+from functools import partial
+from pathlib import Path
+with open(Path(__file__).parent / 'sentiment_templates.txt') as f:
+    templates = [l.strip() for l in f.readlines()]
+def format_example(example: dict) -> dict:
+    context = f"Instruction: {example['instruction']}\n"
+    if example.get("input"):
+        context += f"Input: {example['input']}\n"
+    context += "Answer: "
+    target = example["output"]
+    return {"context": context, "target": target}
+def add_instructions(x):
+    if x.format == "post":
+        return "What is the sentiment of this tweet? Please choose an answer from {negative/neutral/positive}."
+    else:
+        return "What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}."
+def make_label(x):
+    if x < - 0.1: return "negative"
+    elif x >=-0.1 and x < 0.1: return "neutral"
+    elif x >= 0.1: return "positive"
+def change_target(x):
+    if 'positive' in x or 'Positive' in x:
+        return 'positive'
+    elif 'negative' in x or 'Negative' in x:
+        return 'negative'
+    else:
+        return 'neutral'
+def vote_output(x):
+    output_dict = {'positive': 0, 'negative': 0, 'neutral': 0}
+    for i in range(len(templates)):
+        pred = change_target(x[f'out_text_{i}'].lower())
+        output_dict[pred] += 1
+    if output_dict['positive'] > output_dict['negative']:
+        return 'positive'
+    elif output_dict['negative'] > output_dict['positive']:
+        return 'negative'
+    else:
+        return 'neutral'
+def test_fiqa(args, model, tokenizer, prompt_fun=add_instructions):
+    batch_size = args.batch_size
+    # dataset = load_dataset('pauri32/fiqa-2018')
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/fiqa-2018/')
+    dataset = datasets.concatenate_datasets([dataset["train"], dataset["validation"] ,dataset["test"] ])
+    dataset = dataset.train_test_split(0.226, seed = 42)['test']
+    dataset = dataset.to_pandas()
+    dataset["output"] = dataset.sentiment_score.apply(make_label)
+    if prompt_fun is None:
+        dataset["instruction"] = "What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}."
+    else:
+        dataset["instruction"] = dataset.apply(prompt_fun, axis = 1)
+    dataset = dataset[['sentence', 'output',"instruction"]]
+    dataset.columns = ["input", "output","instruction"]
+    dataset[["context","target"]] = dataset.apply(format_example, axis=1, result_type="expand")
+    # print example
+    print(f"\n\nPrompt example:\n{dataset['context'][0]}\n\n")
+    context = dataset['context'].tolist()
+    total_steps = dataset.shape[0]//batch_size + 1
+    print(f"Total len: {len(context)}. Batchsize: {batch_size}. Total steps: {total_steps}")
+    out_text_list = []
+    for i in tqdm(range(total_steps)):
+        tmp_context = context[i* batch_size:(i+1)* batch_size]
+        tokens = tokenizer(tmp_context, return_tensors='pt', padding=True, max_length=512, return_token_type_ids=False)
+        # tokens.pop('token_type_ids')
+        for k in tokens.keys():
+            tokens[k] = tokens[k].cuda()
+        res = model.generate(**tokens, max_length=512, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        tqdm.write(f'{i}: {res_sentences[0]}')
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset["out_text"] = out_text_list
+    dataset["new_target"] = dataset["target"].apply(change_target)
+    dataset["new_out"] = dataset["out_text"].apply(change_target)
+    acc = accuracy_score(dataset["new_target"], dataset["new_out"])
+    f1_macro = f1_score(dataset["new_target"], dataset["new_out"], average = "macro")
+    f1_micro = f1_score(dataset["new_target"], dataset["new_out"], average = "micro")
+    f1_weighted = f1_score(dataset["new_target"], dataset["new_out"], average = "weighted")
+    print(f"Acc: {acc}. F1 macro: {f1_macro}. F1 micro: {f1_micro}. F1 weighted (BloombergGPT): {f1_weighted}. ")
+    return dataset
+def test_fiqa_mlt(args, model, tokenizer):
+    batch_size = args.batch_size
+    # dataset = load_dataset('pauri32/fiqa-2018')
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/fiqa-2018/')
+    dataset = datasets.concatenate_datasets([dataset["train"], dataset["validation"] ,dataset["test"] ])
+    dataset = dataset.train_test_split(0.226, seed=42)['test']
+    dataset = dataset.to_pandas()
+    dataset["output"] = dataset.sentiment_score.apply(make_label)
+    dataset["text_type"] = dataset.apply(lambda x: 'tweet' if x.format == "post" else 'news', axis=1)
+    dataset = dataset[['sentence', 'output', "text_type"]]
+    dataset.columns = ["input", "output", "text_type"]
+    dataset["output"] = dataset["output"].apply(change_target)
+    dataset = dataset[dataset["output"] != 'neutral']
+    out_texts_list = [[] for _ in range(len(templates))]
+    def collate_fn(batch):
+        inputs = tokenizer(
+            [f["context"] for f in batch], return_tensors='pt',
+            padding=True, max_length=args.max_length,
+            return_token_type_ids=False
+        )
+        return inputs
+    for i, template in enumerate(templates):
+        dataset = dataset[['input', 'output', "text_type"]]
+        dataset["instruction"] = dataset['text_type'].apply(lambda x: template.format(type=x) + "\nOptions: positive, negative")
+        # dataset["instruction"] = dataset['text_type'].apply(lambda x: template.format(type=x) + "\nOptions: negative, positive")
+        dataset[["context", "target"]] = dataset.apply(format_example, axis=1, result_type="expand")
+        dataloader = DataLoader(Dataset.from_pandas(dataset), batch_size=args.batch_size, collate_fn=collate_fn, shuffle=False)
+        log_interval = len(dataloader) // 5
+        for idx, inputs in enumerate(tqdm(dataloader)):
+            inputs = {key: value.to(model.device) for key, value in inputs.items()}
+            res = model.generate(**inputs, do_sample=False, max_length=args.max_length, eos_token_id=tokenizer.eos_token_id)#, max_new_tokens=10)
+            res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+            tqdm.write(f'{idx}: {res_sentences[0]}')
+            # if (idx + 1) % log_interval == 0:
+            #     tqdm.write(f'{idx}: {res_sentences[0]}')
+            out_text = [o.split("Answer: ")[1] for o in res_sentences]
+            out_texts_list[i] += out_text
+            torch.cuda.empty_cache()
+    for i in range(len(templates)):
+        dataset[f"out_text_{i}"] = out_texts_list[i]
+        dataset[f"out_text_{i}"] = dataset[f"out_text_{i}"].apply(change_target)
+    dataset["new_out"] = dataset.apply(vote_output, axis=1, result_type="expand")
+    dataset.to_csv('tmp.csv')
+    for k in [f"out_text_{i}" for i in range(len(templates))] + ["new_out"]:
+        acc = accuracy_score(dataset["target"], dataset[k])
+        f1_macro = f1_score(dataset["target"], dataset[k], average="macro")
+        f1_micro = f1_score(dataset["target"], dataset[k], average="micro")
+        f1_weighted = f1_score(dataset["target"], dataset[k], average="weighted")
+        print(f"Acc: {acc}. F1 macro: {f1_macro}. F1 micro: {f1_micro}. F1 weighted (BloombergGPT): {f1_weighted}. ")
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/fpb.py ADDED Viewed

	@@ -0,0 +1,168 @@

+import warnings
+warnings.filterwarnings("ignore")
+from sklearn.metrics import accuracy_score,f1_score
+from datasets import load_dataset, load_from_disk, Dataset
+from tqdm import tqdm
+import datasets
+import torch
+from torch.utils.data import DataLoader
+from functools import partial
+from pathlib import Path
+dic = {
+        0:"negative",
+        1:'neutral',
+        2:'positive',
+    }
+with open(Path(__file__).parent / 'sentiment_templates.txt') as f:
+    templates = [l.strip() for l in f.readlines()]
+def format_example(example: dict) -> dict:
+    context = f"Instruction: {example['instruction']}\n"
+    if example.get("input"):
+        context += f"Input: {example['input']}\n"
+    context += "Answer: "
+    target = example["output"]
+    return {"context": context, "target": target}
+def change_target(x):
+    if 'positive' in x or 'Positive' in x:
+        return 'positive'
+    elif 'negative' in x or 'Negative' in x:
+        return 'negative'
+    else:
+        return 'neutral'
+def vote_output(x):
+    output_dict = {'positive': 0, 'negative': 0, 'neutral': 0}
+    for i in range(len(templates)):
+        pred = change_target(x[f'out_text_{i}'].lower())
+        output_dict[pred] += 1
+    if output_dict['positive'] > output_dict['negative']:
+        return 'positive'
+    elif output_dict['negative'] > output_dict['positive']:
+        return 'negative'
+    else:
+        return 'neutral'
+def test_fpb(args, model, tokenizer, prompt_fun=None):
+    batch_size = args.batch_size
+    # instructions = load_dataset("financial_phrasebank", "sentences_50agree")
+    instructions = load_from_disk(Path(__file__).parent.parent / "data/financial_phrasebank-sentences_50agree/")
+    instructions = instructions["train"]
+    instructions = instructions.train_test_split(seed = 42)['test']
+    instructions = instructions.to_pandas()
+    instructions.columns = ["input", "output"]
+    instructions["output"] = instructions["output"].apply(lambda x:dic[x])
+    if prompt_fun is None:
+        instructions["instruction"] = "What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}."
+    else:
+        instructions["instruction"] = instructions.apply(prompt_fun, axis = 1)
+    instructions[["context","target"]] = instructions.apply(format_example, axis = 1, result_type="expand")
+    # print example
+    print(f"\n\nPrompt example:\n{instructions['context'][0]}\n\n")
+    context = instructions['context'].tolist()
+    total_steps = instructions.shape[0]//batch_size + 1
+    print(f"Total len: {len(context)}. Batchsize: {batch_size}. Total steps: {total_steps}")
+    out_text_list = []
+    for i in tqdm(range(total_steps)):
+        tmp_context = context[i* batch_size:(i+1)* batch_size]
+        tokens = tokenizer(tmp_context, return_tensors='pt', padding=True, max_length=512, return_token_type_ids=False)
+        for k in tokens.keys():
+            tokens[k] = tokens[k].cuda()
+        res = model.generate(**tokens, max_length=512, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        # print(f'{i}: {res_sentences[0]}')
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    instructions["out_text"] = out_text_list
+    instructions["new_target"] = instructions["target"].apply(change_target)
+    instructions["new_out"] = instructions["out_text"].apply(change_target)
+    acc = accuracy_score(instructions["new_target"], instructions["new_out"])
+    f1_macro = f1_score(instructions["new_target"], instructions["new_out"], average = "macro")
+    f1_micro = f1_score(instructions["new_target"], instructions["new_out"], average = "micro")
+    f1_weighted = f1_score(instructions["new_target"], instructions["new_out"], average = "weighted")
+    print(f"Acc: {acc}. F1 macro: {f1_macro}. F1 micro: {f1_micro}. F1 weighted (BloombergGPT): {f1_weighted}. ")
+    return instructions
+def test_fpb_mlt(args, model, tokenizer):
+    batch_size = args.batch_size
+    # dataset = load_dataset("financial_phrasebank", "sentences_50agree")
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/financial_phrasebank-sentences_50agree/')
+    dataset = dataset["train"]#.select(range(300))
+    dataset = dataset.train_test_split(seed=42)['test']
+    dataset = dataset.to_pandas()
+    dataset.columns = ["input", "output"]
+    dataset["output"] = dataset["output"].apply(lambda x: dic[x])
+    dataset["text_type"] = dataset.apply(lambda x: 'news', axis=1)
+    dataset["output"] = dataset["output"].apply(change_target)
+    dataset = dataset[dataset["output"] != 'neutral']
+    out_texts_list = [[] for _ in range(len(templates))]
+    def collate_fn(batch):
+        inputs = tokenizer(
+            [f["context"] for f in batch], return_tensors='pt',
+            padding=True, max_length=args.max_length,
+            return_token_type_ids=False
+        )
+        return inputs
+    for i, template in enumerate(templates):
+        dataset = dataset[['input', 'output', "text_type"]]
+        dataset["instruction"] = dataset['text_type'].apply(lambda x: template.format(type=x) + "\nOptions: positive, negative")
+        # dataset["instruction"] = dataset['text_type'].apply(lambda x: template.format(type=x) + "\nOptions: negative, positive")
+        dataset[["context", "target"]] = dataset.apply(format_example, axis=1, result_type="expand")
+        dataloader = DataLoader(Dataset.from_pandas(dataset), batch_size=args.batch_size, collate_fn=collate_fn, shuffle=False)
+        log_interval = len(dataloader) // 5
+        for idx, inputs in enumerate(tqdm(dataloader)):
+            inputs = {key: value.to(model.device) for key, value in inputs.items()}
+            res = model.generate(**inputs, do_sample=False, max_length=args.max_length, eos_token_id=tokenizer.eos_token_id, max_new_tokens=10)
+            res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+            tqdm.write(f'{idx}: {res_sentences[0]}')
+            # if (idx + 1) % log_interval == 0:
+            #     tqdm.write(f'{idx}: {res_sentences[0]}')
+            out_text = [o.split("Answer: ")[1] for o in res_sentences]
+            out_texts_list[i] += out_text
+            torch.cuda.empty_cache()
+    for i in range(len(templates)):
+        dataset[f"out_text_{i}"] = out_texts_list[i]
+        dataset[f"out_text_{i}"] = dataset[f"out_text_{i}"].apply(change_target)
+    dataset["new_out"] = dataset.apply(vote_output, axis=1, result_type="expand")
+    dataset.to_csv('tmp.csv')
+    for k in [f"out_text_{i}" for i in range(len(templates))] + ["new_out"]:
+        acc = accuracy_score(dataset["target"], dataset[k])
+        f1_macro = f1_score(dataset["target"], dataset[k], average="macro")
+        f1_micro = f1_score(dataset["target"], dataset[k], average="micro")
+        f1_weighted = f1_score(dataset["target"], dataset[k], average="weighted")
+        print(f"Acc: {acc}. F1 macro: {f1_macro}. F1 micro: {f1_micro}. F1 weighted (BloombergGPT): {f1_weighted}. ")
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/headline.py ADDED Viewed

	@@ -0,0 +1,84 @@

+from sklearn.metrics import accuracy_score, f1_score, classification_report
+from datasets import load_dataset, load_from_disk
+from tqdm import tqdm
+import datasets
+import torch
+from torch.utils.data import DataLoader
+from functools import partial
+from pathlib import Path
+from fingpt.FinGPT_Benchmark.utils import *
+import sys
+sys.path.append('../')
+def binary2multi(dataset):
+    pred, label = [], []
+    tmp_pred, tmp_label = [], []
+    for i, row in dataset.iterrows():
+        tmp_pred.append(row['pred'])
+        tmp_label.append(row['label'])
+        if (i + 1) % 9 == 0:
+            pred.append(tmp_pred)
+            label.append(tmp_label)
+            tmp_pred, tmp_label = [], []
+    return pred, label
+def map_output(feature):
+    pred = 1 if 'yes' in feature['out_text'].lower() else 0
+    label = 1 if 'yes' in feature['output'].lower() else 0
+    return {'label': label, 'pred': pred}
+def test_headline(args, model, tokenizer):
+    # dataset = load_from_disk('../data/fingpt-headline')['test']
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/fingpt-headline-instruct')['test']
+    dataset = dataset.map(partial(test_mapping, args), load_from_cache_file=False)
+    def collate_fn(batch):
+        inputs = tokenizer(
+            [f["prompt"] for f in batch], return_tensors='pt',
+            padding=True, max_length=args.max_length,
+            return_token_type_ids=False
+        )
+        return inputs
+    dataloader = DataLoader(dataset, batch_size=args.batch_size, collate_fn=collate_fn, shuffle=False)
+    out_text_list = []
+    log_interval = len(dataloader) // 5
+    for idx, inputs in enumerate(tqdm(dataloader)):
+        inputs = {key: value.to(model.device) for key, value in inputs.items()}
+        res = model.generate(**inputs, max_length=args.max_length, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        tqdm.write(f'{idx}: {res_sentences[0]}')
+        if (idx + 1) % log_interval == 0:
+            tqdm.write(f'{idx}: {res_sentences[0]}')
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset = dataset.add_column("out_text", out_text_list)
+    dataset = dataset.map(map_output, load_from_cache_file=False)
+    dataset = dataset.to_pandas()
+    print(dataset)
+    dataset.to_csv('tmp.csv')
+    # binary
+    acc = accuracy_score(dataset["label"], dataset["pred"])
+    f1 = f1_score(dataset["label"], dataset["pred"], average="binary")
+    # multi-class
+    pred, label = binary2multi(dataset)
+    print(f"\n|| Acc: {acc} || F1 binary: {f1} ||\n")
+    print(classification_report(label, pred, digits=4, target_names=['price or not', 'price up', 'price stable',
+                                                                     'price down', 'price past', 'price future',
+                                                                     'event past', 'event future', 'asset comp']))
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/ner.py ADDED Viewed

	@@ -0,0 +1,94 @@

+from seqeval.metrics import classification_report
+from datasets import load_dataset, load_from_disk
+from tqdm import tqdm
+import datasets
+import torch
+from torch.utils.data import DataLoader
+from functools import partial
+import re
+import sys
+import numpy as np
+from fingpt.FinGPT_Benchmark.utils import *
+from pathlib import Path
+sys.path.append('../')
+ent_dict = {
+    'PER': 'person',
+    'ORG': 'organization',
+    'LOC': 'location',
+}
+ent_dict_rev = {v: k for k, v in ent_dict.items()}
+def cvt_text_to_pred(tokens, text):
+    preds = ['O' for _ in range(len(tokens))]
+    for pred_txt in text.lower().strip('.').split(','):
+        pred_match = re.match(r'^(.*) is an? (.*)$', pred_txt)
+        if pred_match is not None:
+            entity, entity_type = pred_match.group(1).strip(), pred_match.group(2).strip()
+            entity_pred = ent_dict_rev.get(entity_type, 'O')
+            entity_tokens = entity.split()
+            n = len(entity_tokens)
+            for i in range(len(tokens) - n + 1):
+                if tokens[i:i+n] == entity_tokens and preds[i:i+n] == ['O'] * n:
+                    preds[i:i+n] = ['B-' + entity_pred] + ['I-' + entity_pred] * (n-1)
+                    break
+        else:
+            print(pred_txt)
+    return preds
+def map_output(feature):
+    tokens = feature['input'].lower().split()
+    label = cvt_text_to_pred(tokens, feature['output'])
+    pred = cvt_text_to_pred(tokens, feature['out_text'])
+    return {'label': label, 'pred': pred}
+def test_ner(args, model, tokenizer):
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/fingpt-ner')['test']
+    dataset = dataset.map(partial(test_mapping, args), load_from_cache_file=False)
+    def collate_fn(batch):
+        inputs = tokenizer(
+            [f["prompt"] for f in batch], return_tensors='pt',
+            padding=True, max_length=args.max_length,
+            return_token_type_ids=False
+        )
+        return inputs
+    dataloader = DataLoader(dataset, batch_size=args.batch_size, collate_fn=collate_fn, shuffle=False)
+    out_text_list = []
+    log_interval = len(dataloader) // 5
+    for idx, inputs in enumerate(tqdm(dataloader)):
+        inputs = {key: value.to(model.device) for key, value in inputs.items()}
+        res = model.generate(**inputs, max_length=args.max_length, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        if (idx + 1) % log_interval == 0:
+            tqdm.write(f'{idx}: {res_sentences[0]}')
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset = dataset.add_column("out_text", out_text_list)
+    dataset = dataset.map(map_output, load_from_cache_file=False)
+    dataset = dataset.to_pandas()
+    print(dataset)
+    dataset.to_csv('tmp.csv')
+    label = [d.tolist() for d in dataset['label']]
+    pred = [d.tolist() for d in dataset['pred']]
+    print(classification_report(label, pred, digits=4))
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/nwgi.py ADDED Viewed

	@@ -0,0 +1,86 @@

+import warnings
+warnings.filterwarnings("ignore")
+from sklearn.metrics import accuracy_score,f1_score
+from datasets import load_dataset, load_from_disk
+from tqdm import tqdm
+import datasets
+import torch
+from pathlib import Path
+dic = {
+    'strong negative':"negative",
+    'moderately negative':"negative",
+    'mildly negative':"neutral",
+    'strong positive':"positive",
+    'moderately positive':"positive",
+    'mildly positive':'neutral',
+    'neutral':'neutral',
+}
+def format_example(example: dict) -> dict:
+    context = f"Instruction: {example['instruction']}\n"
+    if example.get("input"):
+        context += f"Input: {example['input']}\n"
+    context += "Answer: "
+    target = example["output"]
+    return {"context": context, "target": target}
+def change_target(x):
+    if 'positive' in x or 'Positive' in x:
+        return 'positive'
+    elif 'negative' in x or 'Negative' in x:
+        return 'negative'
+    else:
+        return 'neutral'
+def test_nwgi(args, model, tokenizer, prompt_fun=None):
+    batch_size = args.batch_size
+    # dataset = load_dataset('oliverwang15/news_with_gpt_instructions')
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/news_with_gpt_instructions/')
+    dataset['output'] = dataset['label'].apply(lambda x:dic[x])
+    if prompt_fun is None:
+        dataset["instruction"] = "What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}."
+        # dataset["instruction"] = "What is the sentiment of this news? Please choose an answer from {strong negative/moderately negative/mildly negative/neutral/mildly positive/moderately positive/strong positive}."
+    else:
+        dataset["instruction"] = dataset.apply(prompt_fun, axis = 1)
+    dataset["input"] = dataset["news"]
+    dataset = dataset[['input', 'output', 'instruction']]
+    dataset[["context","target"]] = dataset.apply(format_example, axis = 1, result_type="expand")
+    # print example
+    print(f"\n\nPrompt example:\n{dataset['context'][0]}\n\n")
+    context = dataset['context'].tolist()
+    total_steps = dataset.shape[0]//batch_size + 1
+    print(f"Total len: {len(context)}. Batchsize: {batch_size}. Total steps: {total_steps}")
+    out_text_list = []
+    for i in tqdm(range(total_steps)):
+        tmp_context = context[i* batch_size:(i+1)* batch_size]
+        tokens = tokenizer(tmp_context, return_tensors='pt', padding=True, max_length=512, return_token_type_ids=False)
+        # tokens.pop('token_type_ids')
+        for k in tokens.keys():
+            tokens[k] = tokens[k].cuda()
+        res = model.generate(**tokens, max_length=512, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset["out_text"] = out_text_list
+    dataset["new_target"] = dataset["target"].apply(change_target)
+    dataset["new_out"] = dataset["out_text"].apply(change_target)
+    acc = accuracy_score(dataset["new_target"], dataset["new_out"])
+    f1_macro = f1_score(dataset["new_target"], dataset["new_out"], average = "macro")
+    f1_micro = f1_score(dataset["new_target"], dataset["new_out"], average = "micro")
+    f1_weighted = f1_score(dataset["new_target"], dataset["new_out"], average = "weighted")
+    print(f"Acc: {acc}. F1 macro: {f1_macro}. F1 micro: {f1_micro}. F1 weighted (BloombergGPT): {f1_weighted}. ")
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/sentiment_templates.txt ADDED Viewed

	@@ -0,0 +1,5 @@

+What is the sentiment of the input {type} from financial perspective?
+Assign a sentiment category to this {type} related to finance.
+Categorize the input {type}'s emotional tone into one of three groups.
+Determine the sentiment expressed in the {type} from financial perspective.
+Characterize the {type}'s sentiment using the following options.

external/FinGPT/fingpt/FinGPT_Benchmark/benchmarks/tfns.py ADDED Viewed

	@@ -0,0 +1,82 @@

+import warnings
+warnings.filterwarnings("ignore")
+from sklearn.metrics import accuracy_score,f1_score
+from datasets import load_dataset, load_from_disk
+from tqdm import tqdm
+import datasets
+import torch
+from pathlib import Path
+dic = {
+    0:"negative",
+    1:'positive',
+    2:'neutral',
+}
+def format_example(example: dict) -> dict:
+    context = f"Instruction: {example['instruction']}\n"
+    if example.get("input"):
+        context += f"Input: {example['input']}\n"
+    context += "Answer: "
+    target = example["output"]
+    return {"context": context, "target": target}
+def change_target(x):
+    if 'positive' in x or 'Positive' in x:
+        return 'positive'
+    elif 'negative' in x or 'Negative' in x:
+        return 'negative'
+    else:
+        return 'neutral'
+def test_tfns(args, model, tokenizer, prompt_fun=None):
+    batch_size = args.batch_size
+    # dataset = load_dataset('zeroshot/twitter-financial-news-sentiment')
+    dataset = load_from_disk(Path(__file__).parent.parent / 'data/twitter-financial-news-sentiment')
+    dataset = dataset['validation']
+    dataset = dataset.to_pandas()
+    dataset['label'] = dataset['label'].apply(lambda x:dic[x])
+    if prompt_fun is None:
+        dataset["instruction"] = 'What is the sentiment of this tweet? Please choose an answer from {negative/neutral/positive}.'
+    else:
+        dataset["instruction"] = dataset.apply(prompt_fun, axis = 1)
+    dataset.columns = ['input', 'output', 'instruction']
+    dataset[["context","target"]] = dataset.apply(format_example, axis = 1, result_type="expand")
+    # print example
+    print(f"\n\nPrompt example:\n{dataset['context'][0]}\n\n")
+    context = dataset['context'].tolist()
+    total_steps = dataset.shape[0]//batch_size + 1
+    print(f"Total len: {len(context)}. Batchsize: {batch_size}. Total steps: {total_steps}")
+    out_text_list = []
+    for i in tqdm(range(total_steps)):
+        tmp_context = context[i* batch_size:(i+1)* batch_size]
+        tokens = tokenizer(tmp_context, return_tensors='pt', padding=True, max_length=512, return_token_type_ids=False)
+        # tokens.pop('token_type_ids')
+        for k in tokens.keys():
+            tokens[k] = tokens[k].cuda()
+        res = model.generate(**tokens, max_length=512, eos_token_id=tokenizer.eos_token_id)
+        res_sentences = [tokenizer.decode(i, skip_special_tokens=True) for i in res]
+        out_text = [o.split("Answer: ")[1] for o in res_sentences]
+        out_text_list += out_text
+        torch.cuda.empty_cache()
+    dataset["out_text"] = out_text_list
+    dataset["new_target"] = dataset["target"].apply(change_target)
+    dataset["new_out"] = dataset["out_text"].apply(change_target)
+    acc = accuracy_score(dataset["new_target"], dataset["new_out"])
+    f1_macro = f1_score(dataset["new_target"], dataset["new_out"], average = "macro")
+    f1_micro = f1_score(dataset["new_target"], dataset["new_out"], average = "micro")
+    f1_weighted = f1_score(dataset["new_target"], dataset["new_out"], average = "weighted")
+    print(f"Acc: {acc}. F1 macro: {f1_macro}. F1 micro: {f1_micro}. F1 weighted (BloombergGPT): {f1_weighted}. ")
+    return dataset

external/FinGPT/fingpt/FinGPT_Benchmark/config.json ADDED Viewed

	@@ -0,0 +1,33 @@

+{
+    "train_micro_batch_size_per_gpu": "auto",
+    "train_batch_size": "auto",
+    "gradient_accumulation_steps": "auto",
+    "optimizer": {
+      "type": "ZeroOneAdam",
+      "params": {
+        "lr": "auto",
+        "weight_decay": "auto",
+        "bias_correction": false,
+        "var_freeze_step": 1000,
+        "var_update_scaler": 16,
+        "local_step_scaler": 1000,
+        "local_step_clipper": 16,
+        "cuda_aware": true,
+        "comm_backend_name": "nccl"
+        }
+    },
+    "scheduler": {
+        "type": "WarmupLR",
+        "params": {
+            "warmup_min_lr": 0,
+            "warmup_max_lr": "auto",
+            "warmup_num_steps": "auto"
+        }
+    },
+    "fp16": {
+        "enabled": true
+    },
+    "zero_optimization": {
+        "stage": 0
+    }
+}

external/FinGPT/fingpt/FinGPT_Benchmark/config_hf.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+    "train_micro_batch_size_per_gpu": "auto",
+    "train_batch_size": "auto",
+    "gradient_accumulation_steps": "auto",
+    "fp16": {
+        "enabled": true
+    },
+    "zero_optimization": {
+        "stage": 0
+    }
+}

external/FinGPT/fingpt/FinGPT_Benchmark/config_new.json ADDED Viewed

	@@ -0,0 +1,35 @@

+{
+    "train_micro_batch_size_per_gpu": "auto",
+    "train_batch_size": "auto",
+    "gradient_accumulation_steps": "auto",
+    "optimizer": {
+      "type": "AdamW",
+      "params": {
+        "lr": "auto",
+        "weight_decay": "auto",
+        "betas": "auto",
+        "eps": "auto"
+      }
+    },
+    "scheduler": {
+       "type": "WarmupDecayLR",
+       "params": {
+         "last_batch_iteration": -1,
+         "total_num_steps": "auto",
+         "warmup_min_lr": "auto",
+         "warmup_max_lr": "auto",
+         "warmup_num_steps": "auto"
+       }
+    },
+    "fp16": {
+        "enabled": true,
+        "loss_scale": 0,
+        "loss_scale_window": 1000,
+        "initial_scale_power": 16,
+        "hysteresis": 2,
+        "min_loss_scale": 1
+    },
+    "zero_optimization": {
+        "stage": 0
+    }
+}

external/FinGPT/fingpt/FinGPT_Benchmark/data/__init__.py ADDED Viewed

File without changes

external/FinGPT/fingpt/FinGPT_Benchmark/data/download.py ADDED Viewed

	@@ -0,0 +1,41 @@

+import datasets
+from pathlib import Path
+import argparse
+DATASETS = [
+    # source, destination
+    (('pauri32/fiqa-2018', None), 'fiqa-2018'),
+    (('FinGPT/fingpt-finred', None), 'fingpt-finred'),
+    (('zeroshot/twitter-financial-news-sentiment', None), 'twitter-financial-news-sentiment'),
+    (('oliverwang15/news_with_gpt_instructions', None), 'news_with_gpt_instructions'),
+    (('financial_phrasebank', 'sentences_50agree'), 'financial_phrasebank-sentences_50agree'),
+    (('FinGPT/fingpt-fiqa_qa', None), 'fingpt-fiqa_qa'),
+    (('FinGPT/fingpt-headline-cls', None), 'fingpt-headline-cls'),
+    (('FinGPT/fingpt-finred', None), 'fingpt-finred'),
+    (('FinGPT/fingpt-convfinqa', None), 'fingpt-convfinqa'),
+    (('FinGPT/fingpt-finred-cls', None), 'fingpt-finred-cls'),
+    (('FinGPT/fingpt-ner', None), 'fingpt-ner'),
+    (('FinGPT/fingpt-headline', None), 'fingpt-headline-instruct'),
+    (('FinGPT/fingpt-finred-re', None), 'fingpt-finred-re'),
+    (('FinGPT/fingpt-ner-cls', None), 'fingpt-ner-cls'),
+    (('FinGPT/fingpt-fineval', None), 'fingpt-fineval'),
+    (('FinGPT/fingpt-sentiment-cls', None), 'fingpt-sentiment-cls'),
+]
+def download(no_cache: bool = False):
+    """Downloads all datasets to where the FinGPT library is located."""
+    data_dir = Path(__file__).parent
+    for src, dest in DATASETS:
+        if Path(data_dir / dest).is_dir() and not no_cache:
+            print(f"Dataset found at {data_dir / dest}, skipping")
+            continue
+        dataset = datasets.load_dataset(*src)
+        dataset.save_to_disk(data_dir / dest)
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--no_cache", default=False, required=False, type=str, help="Redownloads all datasets if set to True")
+    args = parser.parse_args()
+    download(no_cache=args.no_cache)

external/FinGPT/fingpt/FinGPT_Benchmark/data/prepare_data.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

external/FinGPT/fingpt/FinGPT_Benchmark/demo.ipynb ADDED Viewed

	@@ -0,0 +1,715 @@

+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Read before you start:\n",
+    "\n",
+    "This notebook gives a test demo for all the LLMs we trained during phase2: Multi-Task Instruction Tuning.\n",
+    "\n",
+    "- LLMs: Llama2, Falcon, BLOOM, ChatGLM2, Qwen, MPT\n",
+    "- Tasks: Sentiment Analysis, Headline Classification, Named Entity Extraction, Relation Extraction\n",
+    "\n",
+    "All the models & instruction data samples used are also available in our huggingface organization. [https://huggingface.co/FinGPT]\n",
+    "\n",
+    "Models trained in phase1&3 are not provided, as MT-models cover most of their capacity. Feel free to train your own models based on the tasks you want.\n",
+    "\n",
+    "Due to the limited diversity of the financial tasks and datasets we used, models might not response correctly to out-of-scope instructions. We'll delve into the generalization ability more in our future works."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# First choose to load data/model from huggingface or local space\n",
+    "\n",
+    "FROM_REMOTE = False"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "[2023-10-15 20:44:54,084] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect)\n"
+     ]
+    }
+   ],
+   "source": [
+    "from transformers import AutoTokenizer, AutoModelForCausalLM\n",
+    "from peft import PeftModel\n",
+    "from utils import *"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import logging\n",
+    "# Suppress Warnings during inference\n",
+    "logging.getLogger(\"transformers\").setLevel(logging.ERROR)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "demo_tasks = [\n",
+    "    'Financial Sentiment Analysis',\n",
+    "    'Financial Relation Extraction',\n",
+    "    'Financial Headline Classification',\n",
+    "    'Financial Named Entity Recognition',\n",
+    "]\n",
+    "demo_inputs = [\n",
+    "    \"Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano\",\n",
+    "    \"Wednesday, July 8, 2015 10:30AM IST (5:00AM GMT) Rimini Street Comment on Oracle Litigation Las Vegas, United States Rimini Street, Inc., the leading independent provider of enterprise software support for SAP AG’s (NYSE:SAP) Business Suite and BusinessObjects software and Oracle Corporation’s (NYSE:ORCL) Siebel , PeopleSoft , JD Edwards , E-Business Suite , Oracle Database , Hyperion and Oracle Retail software, today issued a statement on the Oracle litigation.\",\n",
+    "    'april gold down 20 cents to settle at $1,116.10/oz',\n",
+    "    'Subject to the terms and conditions of this Agreement , Bank agrees to lend to Borrower , from time to time prior to the Commitment Termination Date , equipment advances ( each an \" Equipment Advance \" and collectively the \" Equipment Advances \").',\n",
+    "]\n",
+    "demo_instructions = [\n",
+    "    'What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.',\n",
+    "    'Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be \"relation1: word1, word2; relation2: word3, word4\". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.',\n",
+    "    'Does the news headline talk about price in the past? Please choose an answer from {Yes/No}.',\n",
+    "    'Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.',\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "def load_model(base_model, peft_model, from_remote=False):\n",
+    "    \n",
+    "    model_name = parse_model_name(base_model, from_remote)\n",
+    "\n",
+    "    model = AutoModelForCausalLM.from_pretrained(\n",
+    "        model_name, trust_remote_code=True, \n",
+    "        device_map=\"auto\",\n",
+    "    )\n",
+    "    model.model_parallel = True\n",
+    "\n",
+    "    tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)\n",
+    "    \n",
+    "    tokenizer.padding_side = \"left\"\n",
+    "    if base_model == 'qwen':\n",
+    "        tokenizer.eos_token_id = tokenizer.convert_tokens_to_ids('<|endoftext|>')\n",
+    "        tokenizer.pad_token_id = tokenizer.convert_tokens_to_ids('<|extra_0|>')\n",
+    "    if not tokenizer.pad_token or tokenizer.pad_token_id == tokenizer.eos_token_id:\n",
+    "        tokenizer.add_special_tokens({'pad_token': '[PAD]'})\n",
+    "        model.resize_token_embeddings(len(tokenizer))\n",
+    "    \n",
+    "    model = PeftModel.from_pretrained(model, peft_model)\n",
+    "    model = model.eval()\n",
+    "    return model, tokenizer\n",
+    "\n",
+    "\n",
+    "def test_demo(model, tokenizer):\n",
+    "\n",
+    "    for task_name, input, instruction in zip(demo_tasks, demo_inputs, demo_instructions):\n",
+    "        prompt = 'Instruction: {instruction}\\nInput: {input}\\nAnswer: '.format(\n",
+    "            input=input, \n",
+    "            instruction=instruction\n",
+    "        )\n",
+    "        inputs = tokenizer(\n",
+    "            prompt, return_tensors='pt',\n",
+    "            padding=True, max_length=512,\n",
+    "            return_token_type_ids=False\n",
+    "        )\n",
+    "        inputs = {key: value.to(model.device) for key, value in inputs.items()}\n",
+    "        res = model.generate(\n",
+    "            **inputs, max_length=512, do_sample=False,\n",
+    "            eos_token_id=tokenizer.eos_token_id\n",
+    "        )\n",
+    "        output = tokenizer.decode(res[0], skip_special_tokens=True)\n",
+    "        print(f\"\\n==== {task_name} ====\\n\")\n",
+    "        print(output)\n",
+    "    "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Llama2-7B"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "application/json": {
+       "ascii": false,
+       "bar_format": null,
+       "colour": null,
+       "elapsed": 0.006228446960449219,
+       "initial": 0,
+       "n": 0,
+       "ncols": null,
+       "nrows": null,
+       "postfix": null,
+       "prefix": "Loading checkpoint shards",
+       "rate": null,
+       "total": 2,
+       "unit": "it",
+       "unit_divisor": 1000,
+       "unit_scale": false
+      },
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "0d58aff745fb486780792c86384fe702",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "Using pad_token, but it is not set yet.\n",
+      "/root/.conda/envs/torch2/lib/python3.9/site-packages/transformers/tokenization_utils_base.py:2436: UserWarning: `max_length` is ignored when `padding`=`True` and there is no truncation strategy. To pad to max length, use `padding='max_length'`.\n",
+      "  warnings.warn(\n",
+      "/root/.conda/envs/torch2/lib/python3.9/site-packages/transformers/generation/configuration_utils.py:362: UserWarning: `do_sample` is set to `False`. However, `temperature` is set to `0.6` -- this flag is only used in sample-based generation modes. You should set `do_sample=True` or unset `temperature`.\n",
+      "  warnings.warn(\n",
+      "/root/.conda/envs/torch2/lib/python3.9/site-packages/transformers/generation/configuration_utils.py:367: UserWarning: `do_sample` is set to `False`. However, `top_p` is set to `0.9` -- this flag is only used in sample-based generation modes. You should set `do_sample=True` or unset `top_p`.\n",
+      "  warnings.warn(\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "==== Financial Sentiment Analysis ====\n",
+      "\n",
+      "Instruction: What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.\n",
+      "Input: Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano\n",
+      "Answer:  positive\n",
+      "\n",
+      "==== Financial Relation Extraction ====\n",
+      "\n",
+      "Instruction: Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be \"relation1: word1, word2; relation2: word3, word4\". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.\n",
+      "Input: Wednesday, July 8, 2015 10:30AM IST (5:00AM GMT) Rimini Street Comment on Oracle Litigation Las Vegas, United States Rimini Street, Inc., the leading independent provider of enterprise software support for SAP AG’s (NYSE:SAP) Business Suite and BusinessObjects software and Oracle Corporation’s (NYSE:ORCL) Siebel , PeopleSoft , JD Edwards , E-Business Suite , Oracle Database , Hyperion and Oracle Retail software, today issued a statement on the Oracle litigation.\n",
+      "Answer:  product_or_material_produced: PeopleSoft, software; parent_organization: Siebel, Oracle Corporation; industry: Oracle Corporation, software; product_or_material_produced: Oracle Corporation, software; product_or_material_produced: Oracle Corporation, software\n",
+      "\n",
+      "==== Financial Headline Classification ====\n",
+      "\n",
+      "Instruction: Does the news headline talk about price in the past? Please choose an answer from {Yes/No}.\n",
+      "Input: april gold down 20 cents to settle at $1,116.10/oz\n",
+      "Answer:  Yes\n",
+      "\n",
+      "==== Financial Named Entity Recognition ====\n",
+      "\n",
+      "Instruction: Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.\n",
+      "Input: Subject to the terms and conditions of this Agreement , Bank agrees to lend to Borrower , from time to time prior to the Commitment Termination Date , equipment advances ( each an \" Equipment Advance \" and collectively the \" Equipment Advances \").\n",
+      "Answer:  Bank is an organization, Borrower is a person.\n"
+     ]
+    }
+   ],
+   "source": [
+    "base_model = 'llama2'\n",
+    "peft_model = 'FinGPT/fingpt-mt_llama2-7b_lora' if FROM_REMOTE else 'finetuned_models/MT-llama2-linear_202309241345'\n",
+    "\n",
+    "model, tokenizer = load_model(base_model, peft_model, FROM_REMOTE)\n",
+    "test_demo(model, tokenizer)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Qwen-7B"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "The model is automatically converting to bf16 for faster inference. If you want to disable the automatic precision, please manually add bf16/fp16/fp32=True to \"AutoModelForCausalLM.from_pretrained\".\n",
+      "Try importing flash-attention for faster inference...\n",
+      "Warning: import flash_attn rotary fail, please install FlashAttention rotary to get higher efficiency https://github.com/Dao-AILab/flash-attention/tree/main/csrc/rotary\n",
+      "Warning: import flash_attn rms_norm fail, please install FlashAttention layer_norm to get higher efficiency https://github.com/Dao-AILab/flash-attention/tree/main/csrc/layer_norm\n",
+      "Warning: import flash_attn fail, please install FlashAttention to get higher efficiency https://github.com/Dao-AILab/flash-attention\n"
+     ]
+    },
+    {
+     "data": {
+      "application/json": {
+       "ascii": false,
+       "bar_format": null,
+       "colour": null,
+       "elapsed": 0.004647493362426758,
+       "initial": 0,
+       "n": 0,
+       "ncols": null,
+       "nrows": null,
+       "postfix": null,
+       "prefix": "Loading checkpoint shards",
+       "rate": null,
+       "total": 8,
+       "unit": "it",
+       "unit_divisor": 1000,
+       "unit_scale": false
+      },
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "e1978e69ea784778acd1813cc0647c3e",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Loading checkpoint shards:   0%|          | 0/8 [00:00<?, ?it/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/root/.conda/envs/torch2/lib/python3.9/site-packages/transformers/generation/configuration_utils.py:367: UserWarning: `do_sample` is set to `False`. However, `top_p` is set to `0.8` -- this flag is only used in sample-based generation modes. You should set `do_sample=True` or unset `top_p`.\n",
+      "  warnings.warn(\n",
+      "/root/.conda/envs/torch2/lib/python3.9/site-packages/transformers/generation/configuration_utils.py:377: UserWarning: `do_sample` is set to `False`. However, `top_k` is set to `0` -- this flag is only used in sample-based generation modes. You should set `do_sample=True` or unset `top_k`.\n",
+      "  warnings.warn(\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "==== Financial Sentiment Analysis ====\n",
+      "\n",
+      "Instruction: What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.\n",
+      "Input: Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano\n",
+      "Answer: positive\n",
+      "\n",
+      "==== Financial Relation Extraction ====\n",
+      "\n",
+      "Instruction: Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be \"relation1: word1, word2; relation2: word3, word4\". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.\n",
+      "Input: Wednesday, July 8, 2015 10:30AM IST (5:00AM GMT) Rimini Street Comment on Oracle Litigation Las Vegas, United States Rimini Street, Inc., the leading independent provider of enterprise software support for SAP AG’s (NYSE:SAP) Business Suite and BusinessObjects software and Oracle Corporation’s (NYSE:ORCL) Siebel , PeopleSoft , JD Edwards , E-Business Suite , Oracle Database , Hyperion and Oracle Retail software, today issued a statement on the Oracle litigation.\n",
+      "Answer: subsidiary: PeopleSoft, JD Edwards\n",
+      "\n",
+      "==== Financial Headline Classification ====\n",
+      "\n",
+      "Instruction: Does the news headline talk about price in the past? Please choose an answer from {Yes/No}.\n",
+      "Input: april gold down 20 cents to settle at $1,116.10/oz\n",
+      "Answer: Yes\n",
+      "\n",
+      "==== Financial Named Entity Recognition ====\n",
+      "\n",
+      "Instruction: Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.\n",
+      "Input: Subject to the terms and conditions of this Agreement , Bank agrees to lend to Borrower , from time to time prior to the Commitment Termination Date , equipment advances ( each an \" Equipment Advance \" and collectively the \" Equipment Advances \").\n",
+      "Answer: Bank is an organization, Borrower is a person.\n"
+     ]
+    }
+   ],
+   "source": [
+    "base_model = 'qwen'\n",
+    "peft_model = 'FinGPT/fingpt-mt_qwen-7b_lora' if FROM_REMOTE else 'finetuned_models/MT-qwen-linear_202309221011'\n",
+    "\n",
+    "model, tokenizer = load_model(base_model, peft_model, FROM_REMOTE)\n",
+    "test_demo(model, tokenizer)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Falcon-7B"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "application/json": {
+       "ascii": false,
+       "bar_format": null,
+       "colour": null,
+       "elapsed": 0.004422426223754883,
+       "initial": 0,
+       "n": 0,
+       "ncols": null,
+       "nrows": null,
+       "postfix": null,
+       "prefix": "Loading checkpoint shards",
+       "rate": null,
+       "total": 2,
+       "unit": "it",
+       "unit_divisor": 1000,
+       "unit_scale": false
+      },
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "e12fadfbaa6048538bbeef26ed563b28",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "Using pad_token, but it is not set yet.\n",
+      "/root/.conda/envs/torch2/lib/python3.9/site-packages/transformers/generation/utils.py:1411: UserWarning: You have modified the pretrained model configuration to control generation. This is a deprecated strategy to control generation and will be removed soon, in a future version. Please use a generation configuration file (see https://huggingface.co/docs/transformers/main_classes/text_generation )\n",
+      "  warnings.warn(\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "==== Financial Sentiment Analysis ====\n",
+      "\n",
+      "Instruction: What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.\n",
+      "Input: Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano\n",
+      "Answer: positive\n",
+      "\n",
+      "==== Financial Relation Extraction ====\n",
+      "\n",
+      "Instruction: Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be \"relation1: word1, word2; relation2: word3, word4\". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.\n",
+      "Input: Wednesday, July 8, 2015 10:30AM IST (5:00AM GMT) Rimini Street Comment on Oracle Litigation Las Vegas, United States Rimini Street, Inc., the leading independent provider of enterprise software support for SAP AG’s (NYSE:SAP) Business Suite and BusinessObjects software and Oracle Corporation’s (NYSE:ORCL) Siebel, PeopleSoft, JD Edwards, E-Business Suite, Oracle Database, Hyperion and Oracle Retail software, today issued a statement on the Oracle litigation.\n",
+      "Answer: product_or_material_produced: PeopleSoft, Oracle Database\n",
+      "\n",
+      "==== Financial Headline Classification ====\n",
+      "\n",
+      "Instruction: Does the news headline talk about price in the past? Please choose an answer from {Yes/No}.\n",
+      "Input: april gold down 20 cents to settle at $1,116.10/oz\n",
+      "Answer: Yes\n",
+      "\n",
+      "==== Financial Named Entity Recognition ====\n",
+      "\n",
+      "Instruction: Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.\n",
+      "Input: Subject to the terms and conditions of this Agreement, Bank agrees to lend to Borrower, from time to time prior to the Commitment Termination Date, equipment advances ( each an \" Equipment Advance \" and collectively the \" Equipment Advances \").\n",
+      "Answer: Bank is an organization, Borrower is a person.\n"
+     ]
+    }
+   ],
+   "source": [
+    "base_model = 'falcon'\n",
+    "peft_model = 'FinGPT/fingpt-mt_falcon-7b_lora' if FROM_REMOTE else 'finetuned_models/MT-falcon-linear_202309210126'\n",
+    "\n",
+    "model, tokenizer = load_model(base_model, peft_model, FROM_REMOTE)\n",
+    "test_demo(model, tokenizer)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# ChatGLM2-6B"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "application/json": {
+       "ascii": false,
+       "bar_format": null,
+       "colour": null,
+       "elapsed": 0.004460573196411133,
+       "initial": 0,
+       "n": 0,
+       "ncols": null,
+       "nrows": null,
+       "postfix": null,
+       "prefix": "Loading checkpoint shards",
+       "rate": null,
+       "total": 7,
+       "unit": "it",
+       "unit_divisor": 1000,
+       "unit_scale": false
+      },
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "8bddd025a6514946b5f07f55e9c38f58",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Loading checkpoint shards:   0%|          | 0/7 [00:00<?, ?it/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "==== Financial Sentiment Analysis ====\n",
+      "\n",
+      "Instruction: What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.\n",
+      "Input: Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano\n",
+      "Answer:  positive\n",
+      "\n",
+      "==== Financial Relation Extraction ====\n",
+      "\n",
+      "Instruction: Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be \"relation1: word1, word2; relation2: word3, word4\". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.\n",
+      "Input: Wednesday, July 8, 2015 10:30AM IST (5:00AM GMT) Rimini Street Comment on Oracle Litigation Las Vegas, United States Rimini Street, Inc., the leading independent provider of enterprise software support for SAP AG’s (NYSE:SAP) Business Suite and BusinessObjects software and Oracle Corporation’s (NYSE:ORCL) Siebel , PeopleSoft , JD Edwards , E-Business Suite , Oracle Database , Hyperion and Oracle Retail software, today issued a statement on the Oracle litigation.\n",
+      "Answer:  product_or_material_produced: Oracle, Oracle Database; developer: Oracle, Oracle; product_or_material_produced: Oracle, Oracle Database\n",
+      "\n",
+      "==== Financial Headline Classification ====\n",
+      "\n",
+      "Instruction: Does the news headline talk about price in the past? Please choose an answer from {Yes/No}.\n",
+      "Input: april gold down 20 cents to settle at $1,116.10/oz\n",
+      "Answer:  Yes\n",
+      "\n",
+      "==== Financial Named Entity Recognition ====\n",
+      "\n",
+      "Instruction: Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.\n",
+      "Input: Subject to the terms and conditions of this Agreement , Bank agrees to lend to Borrower , from time to time prior to the Commitment Termination Date , equipment advances ( each an \" Equipment Advance \" and collectively the \" Equipment Advances \").\n",
+      "Answer:  Bank is an organization, Borrower is a person.\n"
+     ]
+    }
+   ],
+   "source": [
+    "base_model = 'chatglm2'\n",
+    "peft_model = 'FinGPT/fingpt-mt_chatglm2-6b_lora' if FROM_REMOTE else 'finetuned_models/MT-chatglm2-linear_202309201120'\n",
+    "\n",
+    "model, tokenizer = load_model(base_model, peft_model, FROM_REMOTE)\n",
+    "test_demo(model, tokenizer)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# BLOOM-7B1"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "application/json": {
+       "ascii": false,
+       "bar_format": null,
+       "colour": null,
+       "elapsed": 0.004486799240112305,
+       "initial": 0,
+       "n": 0,
+       "ncols": null,
+       "nrows": null,
+       "postfix": null,
+       "prefix": "Loading checkpoint shards",
+       "rate": null,
+       "total": 2,
+       "unit": "it",
+       "unit_divisor": 1000,
+       "unit_scale": false
+      },
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "32ee0b5e2df049a0b9e458c779e09a68",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "==== Financial Sentiment Analysis ====\n",
+      "\n",
+      "Instruction: What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.\n",
+      "Input: Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano\n",
+      "Answer: positive\n",
+      "\n",
+      "==== Financial Relation Extraction ====\n",
+      "\n",
+      "Instruction: Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be \"relation1: word1, word2; relation2: word3, word4\". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.\n",
+      "Input: Wednesday, July 8, 2015 10:30AM IST (5:00AM GMT) Rimini Street Comment on Oracle Litigation Las Vegas, United States Rimini Street, Inc., the leading independent provider of enterprise software support for SAP AG’s (NYSE:SAP) Business Suite and BusinessObjects software and Oracle Corporation’s (NYSE:ORCL) Siebel , PeopleSoft , JD Edwards , E-Business Suite , Oracle Database , Hyperion and Oracle Retail software, today issued a statement on the Oracle litigation.\n",
+      "Answer: product_or_material_produced: software provider, Software\n",
+      "\n",
+      "==== Financial Headline Classification ====\n",
+      "\n",
+      "Instruction: Does the news headline talk about price in the past? Please choose an answer from {Yes/No}.\n",
+      "Input: april gold down 20 cents to settle at $1,116.10/oz\n",
+      "Answer: Yes\n",
+      "\n",
+      "==== Financial Named Entity Recognition ====\n",
+      "\n",
+      "Instruction: Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.\n",
+      "Input: Subject to the terms and conditions of this Agreement , Bank agrees to lend to Borrower , from time to time prior to the Commitment Termination Date , equipment advances ( each an \" Equipment Advance \" and collectively the \" Equipment Advances \").\n",
+      "Answer: Bank is an organization, Borrower is a person.\n"
+     ]
+    }
+   ],
+   "source": [
+    "base_model = 'bloom'\n",
+    "peft_model = 'FinGPT/fingpt-mt_bloom-7b1_lora' if FROM_REMOTE else 'finetuned_models/MT-bloom-linear_202309211510'\n",
+    "\n",
+    "model, tokenizer = load_model(base_model, peft_model, FROM_REMOTE)\n",
+    "test_demo(model, tokenizer)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# MPT-7B"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/root/.cache/huggingface/modules/transformers_modules/mpt-7b-peft-compatible/attention.py:148: UserWarning: Using `attn_impl: torch`. If your model does not use `alibi` or `prefix_lm` we recommend using `attn_impl: flash` otherwise we recommend using `attn_impl: triton`.\n",
+      "  warnings.warn('Using `attn_impl: torch`. If your model does not use `alibi` or ' + '`prefix_lm` we recommend using `attn_impl: flash` otherwise ' + 'we recommend using `attn_impl: triton`.')\n",
+      "The model weights are not tied. Please use the `tie_weights` method before using the `infer_auto_device` function.\n"
+     ]
+    },
+    {
+     "data": {
+      "application/json": {
+       "ascii": false,
+       "bar_format": null,
+       "colour": null,
+       "elapsed": 0.004449605941772461,
+       "initial": 0,
+       "n": 0,
+       "ncols": null,
+       "nrows": null,
+       "postfix": null,
+       "prefix": "Loading checkpoint shards",
+       "rate": null,
+       "total": 2,
+       "unit": "it",
+       "unit_divisor": 1000,
+       "unit_scale": false
+      },
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "0440bc96112344c493c8a1f5dd76f319",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "Using pad_token, but it is not set yet.\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "==== Financial Sentiment Analysis ====\n",
+      "\n",
+      "Instruction: What is the sentiment of this news? Please choose an answer from {negative/neutral/positive}.\n",
+      "Input: Glaxo's ViiV Healthcare Signs China Manufacturing Deal With Desano\n",
+      "Answer: positive\n",
+      "\n",
+      "==== Financial Relation Extraction ====\n",
+      "\n",
+      "Instruction: Given phrases that describe the relationship between two words/phrases as options, extract the word/phrase pair and the corresponding lexical relationship between them from the input text. The output format should be \"relation1: word1, word2; relation2: word3, word4\". Options: product/material produced, manufacturer, distributed by, industry, position held, original broadcaster, owned by, founded by, distribution format, headquarters location, stock exchange, currency, parent organization, chief executive officer, director/manager, owner of, operator, member of, employer, chairperson, platform, subsidiary, legal form, publisher, developer, brand, business division, location of formation, creator.\n",
+      "Input: Wednesday, July 8, 2015 10:30AM IST (5:00AM GMT) Rimini Street Comment on Oracle Litigation Las Vegas, United States Rimini Street, Inc., the leading independent provider of enterprise software support for SAP AG’s (NYSE:SAP) Business Suite and BusinessObjects software and Oracle Corporation’s (NYSE:ORCL) Siebel, PeopleSoft, JD Edwards, E-Business Suite, Oracle Database, Hyperion and Oracle Retail software, today issued a statement on the Oracle litigation.\n",
+      "Answer: product_or_material_produced: Hyperion, software\n",
+      "\n",
+      "==== Financial Headline Classification ====\n",
+      "\n",
+      "Instruction: Does the news headline talk about price in the past? Please choose an answer from {Yes/No}.\n",
+      "Input: april gold down 20 cents to settle at $1,116.10/oz\n",
+      "Answer: Yes\n",
+      "\n",
+      "==== Financial Named Entity Recognition ====\n",
+      "\n",
+      "Instruction: Please extract entities and their types from the input sentence, entity types should be chosen from {person/organization/location}.\n",
+      "Input: Subject to the terms and conditions of this Agreement, Bank agrees to lend to Borrower, from time to time prior to the Commitment Termination Date, equipment advances ( each an \" Equipment Advance \" and collectively the \" Equipment Advances \").\n",
+      "Answer: Bank is an organization, Borrower is a person.\n"
+     ]
+    }
+   ],
+   "source": [
+    "base_model = 'mpt'\n",
+    "peft_model = 'FinGPT/fingpt-mt_mpt-7b_lora' if FROM_REMOTE else 'finetuned_models/MT-mpt-linear_202309230221'\n",
+    "\n",
+    "model, tokenizer = load_model(base_model, peft_model, FROM_REMOTE)\n",
+    "test_demo(model, tokenizer)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "torch2",
+   "language": "python",
+   "name": "torch2"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.12"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}

external/FinGPT/fingpt/FinGPT_Benchmark/readme.md ADDED Viewed

	@@ -0,0 +1,169 @@

+# FinGPT's Benchmark
+[FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets
+](https://arxiv.org/abs/2310.04793)
+The datasets we used, and the multi-task financial LLMs models are available at <https://huggingface.co/FinGPT>
+---
+Before you start, make sure you have the correct versions of the key packages installed.
+```
+transformers==4.32.0
+peft==0.5.0
+```
+[Weights & Biases](https://wandb.ai/site) is a good tool for tracking model training and inference, you need to register, get a free API, and create a new project.
+wandb produces some nice charts like the following:
+<img width="440" alt="image" src="https://github.com/AI4Finance-Foundation/FinGPT/assets/31713746/04a08b3d-58e3-47aa-8b07-3ec6ff9dfea4">
+<img width="440" alt="image" src="https://github.com/AI4Finance-Foundation/FinGPT/assets/31713746/f207a64b-622d-4a41-8e0f-1959a2d25450">
+<img width="440" alt="image" src="https://github.com/AI4Finance-Foundation/FinGPT/assets/31713746/e7699c64-7c3c-4130-94b3-59688631120a">
+<img width="440" alt="image" src="https://github.com/AI4Finance-Foundation/FinGPT/assets/31713746/65ca7853-3d33-4856-80e5-f03476efcc78">
+## Ready-to-use Demo
+For users who want ready-to-use financial multi-task language models, please refer to `demo.ipynb`.
+Following this notebook, you're able to test Llama2-7B, ChatGLM2-6B, MPT-7B, BLOOM-7B, Falcon-7B, or Qwen-7B with any of the following tasks:
+- Financial Sentiment Analysis
+- Headline Classification
+- Named Entity Recognition
+- Financial Relation Extraction
+We suggest users follow the instruction template and task prompts that we used in our training process. Demos are shown in `demo.ipynb`. Due to the limited diversity of the financial tasks and datasets we used, models might not respond correctly to out-of-scope instructions. We'll delve into the generalization ability more in our future works.
+## Prepare Data & Base Models
+For the base models we used, we recommend pre-downloading them and save to `base_models/`.
+Refer to the `parse_model_name()` function in `utils.py` for the huggingface models we used for each LLM. (We use base models rather than any instruction-tuned version or chat version, except for ChatGLM2)
+---
+For the datasets we used, download our processed instruction tuning data from huggingface. Take FinRED dataset as an example:
+```
+import datasets
+dataset = datasets.load_dataset('FinGPT/fingpt-finred')
+# save to local disk space (recommended)
+dataset.save_to_disk('data/fingpt-finred')
+```
+Then `finred` became an available task option for training.
+We use different datasets at different phases of our instruction tuning paradigm.
+- Task-specific Instruction Tuning: `sentiment-train / finred-re / ner / headline`
+- Multi-task Instruction Tuning: `sentiment-train & finred & ner & headline`
+- Zero-shot Aimed Instruction Tuning: `finred-cls & ner-cls & headline-cls -> sentiment-cls (test)`
+You may download the datasets according to your needs. We also provide processed datasets for ConvFinQA and FinEval, but they are not used in our final work.
+### prepare data from scratch
+To prepare training data from raw data, you should follow `data/prepate_data.ipynb`.
+We don't include any source data from other open-source financial datasets in our repository. So if you want to do it from scratch, you need to find the corresponding source data and put them in `data/` before you start.
+---
+## Instruction Tuning
+`train.sh` contains examples of instruction tuning with this repo.
+If you don't have training data & base models in your local disk, pass `--from_remote true` in addition.
+### Task-specific Instruction Tuning
+```
+#chatglm2
+deepspeed train_lora.py \
+--run_name headline-chatglm2-linear \
+--base_model chatglm2 \
+--dataset headline \
+--max_length 512 \
+--batch_size 4 \
+--learning_rate 1e-4 \
+--num_epochs 8
+```
+Please be aware that "localhost:2" refers to a particular GPU device.
+```
+#llama2-13b
+deepspeed -i "localhost:2" train_lora.py \
+--run_name sentiment-llama2-13b-8epoch-16batch \
+--base_model llama2-13b-nr \
+--dataset sentiment-train \
+--max_length 512 \
+--batch_size 16 \
+--learning_rate 1e-5 \
+--num_epochs 8 \
+--from_remote True \
+>train.log 2>&1 &
+```
+use
+```
+tail -f train.log
+```
+to check the training log
+### Multi-task Instruction Tuning
+```
+deepspeed train_lora.py \
+--run_name MT-falcon-linear \
+--base_model falcon \
+--dataset sentiment-train,headline,finred*3,ner*15 \
+--max_length 512 \
+--batch_size 4 \
+--learning_rate 1e-4 \
+--num_epochs 4
+```
+### Zero-shot Aimed Instruction Tuning
+```
+deepspeed train_lora.py \
+--run_name GRCLS-sentiment-falcon-linear-small \
+--base_model falcon \
+--test_dataset sentiment-cls-instruct \
+--dataset headline-cls-instruct,finred-cls-instruct*2,ner-cls-instruct*7 \
+--max_length 512 \
+--batch_size 4 \
+--learning_rate 1e-4 \
+--num_epochs 1 \
+--log_interval 10 \
+--warmup_ratio 0 \
+--scheduler linear \
+--evaluation_strategy steps \
+--eval_steps 100 \
+--ds_config config_hf.json
+```
+---
+## Evaluation for Financial Tasks
+Refer to `Benchmarks/evaluate.sh` for evaluation script on all Financial Tasks.
+You can evaluate your trained model on multiple tasks together. For example:
+```
+python benchmarks.py \
+--dataset fpb,fiqa,tfns,nwgi,headline,ner,re \
+--base_model llama2 \
+--peft_model ../finetuned_models/MT-llama2-linear_202309241345 \
+--batch_size 8 \
+--max_length 512
+```
+```
+#llama2-13b sentiment analysis
+CUDA_VISIBLE_DEVICES=1 python benchmarks.py \
+--dataset fpb,fiqa,tfns,nwgi \
+--base_model llama2-13b-nr \
+--peft_model ../finetuned_models/sentiment-llama2-13b-8epoch-16batch_202310271908  \
+--batch_size 8 \
+--max_length 512 \
+--from_remote True
+```
+For Zero-shot Evaluation on Sentiment Analysis, we use multiple prompts and evaluate each of them.
+The task indicators are `fiqa_mlt` and `fpb_mlt`.

external/FinGPT/fingpt/FinGPT_Benchmark/train.sh ADDED Viewed

	@@ -0,0 +1,547 @@

+export CUDA_VISIBLE_DEVICES=0,1,2,3
+export NCCL_IGNORE_DISABLED_P2P=1
+export TRANSFORMERS_NO_ADVISORY_WARNINGS=1
+export TOKENIZERS_PARALLELISM=0
+#---- Generalization ----
+# deepspeed train_lora.py \
+# --run_name GRCLS-sentiment-chatglm2-linear-1e-4lr \
+# --base_model chatglm2 \
+# --dataset headline-cls-instruct,finred-cls-instruct*2,ner-cls-instruct*7 \
+# --test_dataset sentiment-cls-instruct \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 1 \
+# --log_interval 10 \
+# --warmup_ratio 0.03 \
+# --scheduler linear \
+# --evaluation_strategy steps \
+# --ds_config config_hf.json
+# deepspeed train_lora.py \
+# --run_name GRCLS-sentiment-llama2-linear-small \
+# --base_model llama2 \
+# --test_dataset sentiment-cls-instruct \
+# --dataset headline-cls-instruct,finred-cls-instruct*2,ner-cls-instruct*7 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 1 \
+# --log_interval 10 \
+# --warmup_ratio 0 \
+# --scheduler linear \
+# --evaluation_strategy steps \
+# --eval_steps 100 \
+# --ds_config config_hf.json
+# deepspeed train_lora.py \
+# --run_name GRCLS-sentiment-falcon-linear-small \
+# --base_model falcon \
+# --test_dataset sentiment-cls-instruct \
+# --dataset headline-cls-instruct,finred-cls-instruct*2,ner-cls-instruct*7 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 1 \
+# --log_interval 10 \
+# --warmup_ratio 0 \
+# --scheduler linear \
+# --evaluation_strategy steps \
+# --eval_steps 100 \
+# --ds_config config_hf.json
+# deepspeed train_lora.py \
+# --run_name GRCLS-sentiment-qwen-linear-small \
+# --base_model qwen \
+# --test_dataset sentiment-cls-instruct \
+# --dataset headline-cls-instruct,finred-cls-instruct*2,ner-cls-instruct*7 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 1 \
+# --log_interval 10 \
+# --warmup_ratio 0 \
+# --scheduler linear \
+# --evaluation_strategy steps \
+# --eval_steps 100 \
+# --ds_config config_hf.json
+# deepspeed train_lora.py \
+# --run_name GRCLS-sentiment-bloom-linear-small \
+# --base_model bloom \
+# --test_dataset sentiment-cls-instruct \
+# --dataset headline-cls-instruct,finred-cls-instruct*2,ner-cls-instruct*7 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 1 \
+# --log_interval 10 \
+# --warmup_ratio 0 \
+# --scheduler linear \
+# --evaluation_strategy steps \
+# --eval_steps 100 \
+# --ds_config config_hf.json
+# deepspeed train_lora.py \
+# --run_name GRCLS-sentiment-mpt-linear-small \
+# --base_model mpt \
+# --dataset headline-cls-instruct,finred-cls-instruct*2,ner-cls-instruct*7 \
+# --test_dataset sentiment-cls-instruct \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 1 \
+# --log_interval 10 \
+# --warmup_ratio 0.03 \
+# --scheduler linear \
+# --evaluation_strategy steps \
+# --eval_steps 100 \
+# --ds_config config_hf.json
+#---- Multi-Task ----
+# deepspeed train_lora.py \
+# --run_name MT-chatglm2-linear \
+# --base_model chatglm2 \
+# --dataset sentiment-train,headline,finred*3,ner*15 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name MT-falcon-linear \
+# --base_model falcon \
+# --dataset sentiment-train,headline,finred*3,ner*15 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name MT-qwen-linear \
+# --base_model qwen \
+# --dataset sentiment-train,headline,finred*3,ner*15 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name MT-mpt-linear \
+# --base_model mpt \
+# --dataset sentiment-train,headline,finred*3,ner*15 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name MT-bloom-linear \
+# --base_model bloom \
+# --dataset sentiment-train,headline,finred*3,ner*15 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name MT-llama2-linear \
+# --base_model llama2 \
+# --dataset sentiment-train,headline,finred*3,ner*15 \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 4 \
+# --log_interval 10
+#---- FinEval ----
+# deepspeed train_lora.py \
+# --run_name fineval-internlm-linear \
+# --base_model internlm \
+# --dataset data/fingpt-fineval \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 50 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name fineval-llama2-linear \
+# --base_model llama2 \
+# --dataset data/fingpt-fineval \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 50 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name fineval-chatglm2-linear \
+# --base_model chatglm2 \
+# --dataset data/fingpt-fineval \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 50 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name fineval-falcon-linear \
+# --base_model falcon \
+# --dataset data/fingpt-fineval \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 50 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name fineval-qwen-linear \
+# --base_model qwen \
+# --dataset data/fingpt-fineval \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 50 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name fineval-mpt-linear \
+# --base_model mpt \
+# --dataset data/fingpt-fineval \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 50 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name fineval-bloom-linear \
+# --base_model bloom \
+# --dataset data/fingpt-fineval \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 50 \
+# --log_interval 10
+#---- ConvFinQA ----
+# deepspeed train_lora.py \
+# --run_name convfinqa-llama2-linear \
+# --base_model llama2 \
+# --ds_config config_hf.json \
+# --dataset data/fingpt-convfinqa \
+# --max_length 2048 \
+# --batch_size 1 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name convfinqa-chatglm2-linear \
+# --base_model chatglm2 \
+# --dataset data/fingpt-convfinqa \
+# --max_length 2048 \
+# --batch_size 1 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name convfinqa-falcon-linear \
+# --base_model falcon \
+# --dataset data/fingpt-convfinqa \
+# --max_length 2048 \
+# --batch_size 1 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name convfinqa-qwen-linear \
+# --base_model qwen \
+# --dataset data/fingpt-convfinqa \
+# --max_length 2048 \
+# --batch_size 1 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name convfinqa-mpt-linear \
+# --base_model mpt \
+# --dataset data/fingpt-convfinqa \
+# --max_length 2048 \
+# --batch_size 1 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+# deepspeed train_lora.py \
+# --run_name convfinqa-bloom-linear \
+# --base_model bloom \
+# --dataset data/fingpt-convfinqa \
+# --max_length 2048 \
+# --batch_size 1 \
+# --learning_rate 1e-4 \
+# --num_epochs 4
+#---- NER ----
+# deepspeed train_lora.py \
+# --run_name ner-llama2-linear \
+# --base_model llama2 \
+# --dataset data/fingpt-ner \
+# --ds_config config_hf.json \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 100 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name ner-chatglm2-linear \
+# --base_model chatglm2 \
+# --dataset data/fingpt-ner \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 100 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name ner-falcon-linear \
+# --base_model falcon \
+# --dataset data/fingpt-ner \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 100 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name ner-qwen-linear \
+# --base_model qwen \
+# --dataset data/fingpt-ner \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 100 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name ner-mpt-linear \
+# --base_model mpt \
+# --dataset data/fingpt-ner \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 100 \
+# --log_interval 10
+# deepspeed train_lora.py \
+# --run_name ner-bloom-linear \
+# --base_model bloom \
+# --dataset data/fingpt-ner \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 100 \
+# --log_interval 10
+#---- Headline (IE) ----
+# deepspeed train_lora.py \
+# --run_name headline-internlm-linear \
+# --base_model internlm \
+# --dataset data/fingpt-headline \
+# --ds_config config_hf.json \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name headline-llama2-linear \
+# --base_model llama2 \
+# --dataset data/fingpt-headline \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name headline-chatglm2-linear \
+# --base_model chatglm2 \
+# --dataset data/fingpt-headline \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name headline-falcon-linear \
+# --base_model falcon \
+# --dataset data/fingpt-headline \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name headline-qwen-linear \
+# --base_model qwen \
+# --dataset data/fingpt-headline \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name headline-mpt-linear \
+# --base_model mpt \
+# --dataset data/fingpt-headline \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name headline-bloom-linear \
+# --base_model bloom \
+# --dataset data/fingpt-headline \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+#---- Sentiment Analysis ----
+# deepspeed train_lora.py \
+# --run_name sentiment-internlm-linear \
+# --base_model internlm \
+# --dataset data/fingpt-sentiment-train \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name sentiment-llama2-linear \
+# --base_model llama2 \
+# --dataset data/fingpt-sentiment-train \
+# --ds_config config_hf.json \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name sentiment-chatglm2-linear \
+# --base_model chatglm2 \
+# --dataset data/fingpt-sentiment-train \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name sentiment-falcon-linear \
+# --base_model falcon \
+# --dataset data/fingpt-sentiment-train \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name sentiment-qwen-linear \
+# --base_model qwen \
+# --dataset data/fingpt-sentiment-train \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name sentiment-mpt-linear \
+# --base_model mpt \
+# --dataset data/fingpt-sentiment-train \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name sentiment-bloom-linear \
+# --base_model bloom \
+# --dataset data/fingpt-sentiment-train \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+#---- Relation Extraction ----
+# deepspeed train_lora.py \
+# --run_name finred-llama2-linear \
+# --base_model llama2 \
+# --dataset data/fingpt-finred-re \
+# --ds_config config_hf.json \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name finred-chatglm2-linear \
+# --base_model chatglm2 \
+# --dataset data/fingpt-finred-re \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name finred-falcon-linear \
+# --base_model falcon \
+# --dataset data/fingpt-finred-re \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name finred-qwen-linear \
+# --base_model qwen \
+# --dataset data/fingpt-finred-re \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name finred-mpt-linear \
+# --base_model mpt \
+# --dataset data/fingpt-finred-re \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8
+# deepspeed train_lora.py \
+# --run_name finred-bloom-linear \
+# --base_model bloom \
+# --dataset data/fingpt-finred-re \
+# --max_length 512 \
+# --batch_size 4 \
+# --learning_rate 1e-4 \
+# --num_epochs 8

external/FinGPT/fingpt/FinGPT_Benchmark/train_lora.py ADDED Viewed

	@@ -0,0 +1,198 @@

+import os
+import sys
+import argparse
+from datetime import datetime
+from functools import partial
+import datasets
+import torch
+from torch.utils.tensorboard import SummaryWriter
+import wandb
+from transformers import (
+    AutoTokenizer,
+    AutoModel,
+    AutoModelForCausalLM,
+    TrainingArguments,
+    Trainer,
+    DataCollatorForSeq2Seq
+)
+from transformers.trainer import TRAINING_ARGS_NAME
+from transformers.integrations import TensorBoardCallback
+# Importing LoRA specific modules
+from peft import (
+    TaskType,
+    LoraConfig,
+    get_peft_model,
+    get_peft_model_state_dict,
+    prepare_model_for_int8_training,
+    set_peft_model_state_dict
+)
+from utils import *
+# Replace with your own api_key and project name
+os.environ['WANDB_API_KEY'] = 'ecf1e5e4f47441d46822d38a3249d62e8fc94db4'
+os.environ['WANDB_PROJECT'] = 'fingpt-benchmark'
+def main(args):
+    """
+    Main function to execute the training script.
+    :param args: Command line arguments
+    """
+    # Parse the model name and determine if it should be fetched from a remote source
+    model_name = parse_model_name(args.base_model, args.from_remote)
+    # Load the pre-trained causal language model
+    model = AutoModelForCausalLM.from_pretrained(
+        model_name,
+        # load_in_8bit=True,
+        # device_map="auto",
+        trust_remote_code=True
+    )
+    # Print model architecture for the first process in distributed training
+    if args.local_rank == 0:
+        print(model)
+    # Load tokenizer associated with the pre-trained model
+    tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
+    # Apply model specific tokenization settings
+    if args.base_model != 'mpt':
+        tokenizer.padding_side = "left"
+    if args.base_model == 'qwen':
+        tokenizer.eos_token_id = tokenizer.convert_tokens_to_ids('<|endoftext|>')
+        tokenizer.pad_token_id = tokenizer.convert_tokens_to_ids('<|extra_0|>')
+    # Ensure padding token is set correctly
+    if not tokenizer.pad_token or tokenizer.pad_token_id == tokenizer.eos_token_id:
+        tokenizer.add_special_tokens({'pad_token': '[PAD]'})
+        model.resize_token_embeddings(len(tokenizer))
+    # Load training and testing datasets
+    dataset_list = load_dataset(args.dataset, args.from_remote)
+    dataset_train = datasets.concatenate_datasets([d['train'] for d in dataset_list]).shuffle(seed=42)
+    if args.test_dataset:
+        dataset_list = load_dataset(args.test_dataset, args.from_remote)
+    dataset_test = datasets.concatenate_datasets([d['test'] for d in dataset_list])
+    dataset = datasets.DatasetDict({'train': dataset_train, 'test': dataset_test})
+    # Display first sample from the training dataset
+    print(dataset['train'][0])
+    # Filter out samples that exceed the maximum token length and remove unused columns
+    dataset = dataset.map(partial(tokenize, args, tokenizer))
+    print('original dataset length: ', len(dataset['train']))
+    dataset = dataset.filter(lambda x: not x['exceed_max_length'])
+    print('filtered dataset length: ', len(dataset['train']))
+    dataset = dataset.remove_columns(['instruction', 'input', 'output', 'exceed_max_length'])
+    print(dataset['train'][0])
+    # Create a timestamp for model saving
+    current_time = datetime.now()
+    formatted_time = current_time.strftime('%Y%m%d%H%M')
+    # Set up training arguments
+    training_args = TrainingArguments(
+        output_dir=f'finetuned_models/{args.run_name}_{formatted_time}', # 保存位置
+        logging_steps=args.log_interval,
+        num_train_epochs=args.num_epochs,
+        per_device_train_batch_size=args.batch_size,
+        per_device_eval_batch_size=args.batch_size,
+        gradient_accumulation_steps=args.gradient_steps,
+        dataloader_num_workers=args.num_workers,
+        learning_rate=args.learning_rate,
+        warmup_ratio=args.warmup_ratio,
+        lr_scheduler_type=args.scheduler,
+        save_steps=args.eval_steps,
+        eval_steps=args.eval_steps,
+        fp16=True,
+        # fp16_full_eval=True,
+        deepspeed=args.ds_config,
+        evaluation_strategy=args.evaluation_strategy,
+        load_best_model_at_end=args.load_best_model,
+        remove_unused_columns=False,
+        report_to='wandb',
+        run_name=args.run_name
+    )
+    if not args.base_model == 'mpt':
+        model.gradient_checkpointing_enable()
+    model.enable_input_require_grads()
+    model.is_parallelizable = True
+    model.model_parallel = True
+    model.config.use_cache = (
+        False
+    )
+    # model = prepare_model_for_int8_training(model
+    # setup peft for lora
+    peft_config = LoraConfig(
+        task_type=TaskType.CAUSAL_LM,
+        inference_mode=False,
+        r=8,
+        lora_alpha=32,
+        lora_dropout=0.1,
+        target_modules=lora_module_dict[args.base_model],
+        bias='none',
+    )
+    model = get_peft_model(model, peft_config)
+    # Initialize TensorBoard for logging
+    writer = SummaryWriter()
+    # Initialize the trainer
+    trainer = Trainer(
+        model=model,
+        args=training_args,
+        train_dataset=dataset["train"],
+        eval_dataset=dataset["test"],
+        data_collator=DataCollatorForSeq2Seq(
+            tokenizer, padding=True,
+            return_tensors="pt"
+        ),
+        callbacks=[TensorBoardCallback(writer)],
+    )
+    # if torch.__version__ >= "2" and sys.platform != "win32":
+    #     model = torch.compile(model)
+    # Clear CUDA cache and start training
+    torch.cuda.empty_cache()
+    trainer.train()
+    writer.close()
+    # Save the fine-tuned model
+    model.save_pretrained(training_args.output_dir)
+if __name__ == "__main__":
+    # Argument parser for command line arguments
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--local_rank", default=0, type=int)
+    parser.add_argument("--run_name", default='local-test', type=str)
+    parser.add_argument("--dataset", required=True, type=str)
+    parser.add_argument("--test_dataset", type=str)
+    parser.add_argument("--base_model", required=True, type=str, choices=['chatglm2', 'llama2', 'llama2-13b', 'llama2-13b-nr', 'baichuan', 'falcon', 'internlm', 'qwen', 'mpt', 'bloom'])
+    parser.add_argument("--max_length", default=512, type=int)
+    parser.add_argument("--batch_size", default=4, type=int, help="The train batch size per device")
+    parser.add_argument("--learning_rate", default=1e-4, type=float, help="The learning rate")
+    parser.add_argument("--num_epochs", default=8, type=float, help="The training epochs")
+    parser.add_argument("--gradient_steps", default=8, type=float, help="The gradient accumulation steps")
+    parser.add_argument("--num_workers", default=8, type=int, help="dataloader workers")
+    parser.add_argument("--log_interval", default=20, type=int)
+    parser.add_argument("--warmup_ratio", default=0.05, type=float)
+    parser.add_argument("--ds_config", default='./config_new.json', type=str)
+    parser.add_argument("--scheduler", default='linear', type=str)
+    parser.add_argument("--instruct_template", default='default')
+    parser.add_argument("--evaluation_strategy", default='steps', type=str)
+    parser.add_argument("--load_best_model", default='False', type=bool)
+    parser.add_argument("--eval_steps", default=0.1, type=float)
+    parser.add_argument("--from_remote", default=False, type=bool)
+    args = parser.parse_args()
+    # Login to Weights and Biases
+    wandb.login()
+    # Run the main function
+    main(args)

external/FinGPT/fingpt/FinGPT_Benchmark/utils.py ADDED Viewed

	@@ -0,0 +1,216 @@

+import os
+import datasets
+# A dictionary to store various prompt templates.
+template_dict = {
+    'default': 'Instruction: {instruction}\nInput: {input}\nAnswer: '
+}
+# A dictionary to store the LoRA module mapping for different models.
+lora_module_dict = {
+    'chatglm2': ['query_key_value'],
+    'falcon': ['query_key_value'],
+    'bloom': ['query_key_value'],
+    'internlm': ['q_proj', 'k_proj', 'v_proj'],
+    'llama2': ['q_proj', 'k_proj', 'v_proj'],
+    'llama2-13b': ['q_proj', 'k_proj', 'v_proj'],
+    'llama2-13b-nr': ['q_proj', 'k_proj', 'v_proj'],
+    'qwen': ["c_attn"],
+    'mpt': ['Wqkv'],
+    'baichuan': ['q_proj', 'k_proj', 'v_proj'],
+}
+def get_prompt(template, instruction, input_text):
+    """
+    Generates a prompt based on a predefined template, instruction, and input.
+    Args:
+    template (str): The key to select the prompt template from the predefined dictionary.
+    instruction (str): The instruction text to be included in the prompt.
+    input_text (str): The input text to be included in the prompt.
+    Returns:
+    str: The generated prompt.
+    Raises:
+    KeyError: If the provided template key is not found in the template dictionary.
+    """
+    if not instruction:
+        return input_text
+    if template not in template_dict:
+        raise KeyError(f"Template '{template}' not found. Available templates: {', '.join(template_dict.keys())}")
+    return template_dict[template].format(instruction=instruction, input=input_text)
+def test_mapping(args, feature):
+    """
+    Generate a mapping for testing purposes by constructing a prompt based on given instructions and input.
+    Args:
+    args (Namespace): A namespace object that holds various configurations, including the instruction template.
+    feature (dict): A dictionary containing 'instruction' and 'input' fields used to construct the prompt.
+    Returns:
+    dict: A dictionary containing the generated prompt.
+    Raises:
+    ValueError: If 'instruction' or 'input' are not provided in the feature dictionary.
+    """
+    # Ensure 'instruction' and 'input' are present in the feature dictionary.
+    if 'instruction' not in feature or 'input' not in feature:
+        raise ValueError("Both 'instruction' and 'input' need to be provided in the feature dictionary.")
+    # Construct the prompt using the provided instruction and input.
+    prompt = get_prompt(
+        args.instruct_template,
+        feature['instruction'],
+        feature['input']
+    )
+    return {
+        "prompt": prompt,
+    }
+def tokenize(args, tokenizer, feature):
+    """
+    Tokenizes the input prompt and target/output for model training or evaluation.
+    Args:
+    args (Namespace): A namespace object containing various settings and configurations.
+    tokenizer (Tokenizer): A tokenizer object used to convert text into tokens.
+    feature (dict): A dictionary containing 'input', 'instruction', and 'output' fields.
+    Returns:
+    dict: A dictionary containing tokenized 'input_ids', 'labels', and a flag 'exceed_max_length'.
+    """
+    # Generate the prompt.
+    prompt = get_prompt(
+        args.instruct_template,
+        feature['instruction'],
+        feature['input']
+    )
+    # Tokenize the prompt.
+    prompt_ids = tokenizer(
+        prompt,
+        padding=False,
+        max_length=args.max_length,
+        truncation=True
+    )['input_ids']
+    # Tokenize the target/output.
+    target_ids = tokenizer(
+        feature['output'].strip(),
+        padding=False,
+        max_length=args.max_length,
+        truncation=True,
+        add_special_tokens=False
+    )['input_ids']
+    # Combine tokenized prompt and target output.
+    input_ids = prompt_ids + target_ids
+    # Check if the combined length exceeds the maximum allowed length.
+    exceed_max_length = len(input_ids) >= args.max_length
+    # Add an end-of-sequence (EOS) token if it's not already present
+    # and if the sequence length is within the limit.
+    if input_ids[-1] != tokenizer.eos_token_id and not exceed_max_length:
+        input_ids.append(tokenizer.eos_token_id)
+    # Create label IDs for training.
+    # The labels should start from where the prompt ends, and be padded for the prompt portion.
+    label_ids = [tokenizer.pad_token_id] * len(prompt_ids) + input_ids[len(prompt_ids):]
+    return {
+        "input_ids": input_ids,
+        "labels": label_ids,
+        "exceed_max_length": exceed_max_length
+    }
+def parse_model_name(name, from_remote=False):
+    """
+    Parse the model name and return the appropriate path based on whether
+    the model is to be fetched from a remote source or from a local source.
+    Args:
+    - name (str): Name of the model.
+    - from_remote (bool): If True, return the remote path, else return the local path.
+    Returns:
+    - str: The appropriate path for the given model name.
+    """
+    model_paths = {
+        'chatglm2': ('THUDM/chatglm2-6b', 'base_models/chatglm2-6b'),
+        'llama2': ('meta-llama/Llama-2-7b-hf', 'base_models/Llama-2-7b-hf'),
+        'llama2-13b': ('meta-llama/Llama-2-13b-hf', 'base_models/Llama-2-13b-hf'),
+        'llama2-13b-nr': ('NousResearch/Llama-2-13b-hf', 'base_models/Llama-2-13b-hf'),
+        'falcon': ('tiiuae/falcon-7b', 'base_models/falcon-7b'),
+        'internlm': ('internlm/internlm-7b', 'base_models/internlm-7b'),
+        'qwen': ('Qwen/Qwen-7B', 'base_models/Qwen-7B'),
+        'baichuan': ('baichuan-inc/Baichuan2-7B-Base', 'base_models/Baichuan2-7B-Base'),
+        'mpt': ('cekal/mpt-7b-peft-compatible', 'base_models/mpt-7b-peft-compatible'),
+        'bloom': ('bigscience/bloom-7b1', 'base_models/bloom-7b1')
+    }
+    if name in model_paths:
+        return model_paths[name][0] if from_remote else model_paths[name][1]
+    else:
+        valid_model_names = ', '.join(model_paths.keys())
+        raise ValueError(f"Undefined base model '{name}'. Valid model names are: {valid_model_names}")
+def load_dataset(names, from_remote=False):
+    """
+    Load one or multiple datasets based on the provided names and source location.
+    Args:
+    names (str): A comma-separated list of dataset names. Each name can be followed by '*n' to indicate replication.
+    from_remote (bool): If True, load the dataset from Hugging Face's model hub. Otherwise, load it from a local disk.
+    Returns:
+    List[Dataset]: A list of loaded datasets. Each dataset is possibly replicated based on the input names.
+    """
+    # Split the dataset names by commas for handling multiple datasets
+    dataset_names = names.split(',')
+    dataset_list = []
+    for name in dataset_names:
+        # Initialize replication factor to 1
+        replication_factor = 1
+        dataset_name = name
+        # Check if the dataset name includes a replication factor
+        if '*' in name:
+            dataset_name, replication_factor = name.split('*')
+            replication_factor = int(replication_factor)
+            if replication_factor < 1:
+                raise ValueError("Replication factor must be a positive integer.")
+        # Construct the correct dataset path or name based on the source location
+        dataset_path_or_name = ('FinGPT/fingpt-' if from_remote else 'data/fingpt-') + dataset_name
+        if not os.path.exists(dataset_path_or_name) and not from_remote:
+            raise FileNotFoundError(f"The dataset path {dataset_path_or_name} does not exist.")
+        # Load the dataset
+        try:
+            tmp_dataset = datasets.load_dataset(dataset_path_or_name) if from_remote else datasets.load_from_disk(
+                dataset_path_or_name)
+        except Exception as e:
+            raise RuntimeError(f"Failed to load the dataset: {str(e)}")
+        # Check for 'test' split and create it from 'train' if necessary
+        if 'test' not in tmp_dataset:
+            if 'train' in tmp_dataset:
+                tmp_dataset = tmp_dataset['train']
+                tmp_dataset = tmp_dataset.train_test_split(test_size=0.2, shuffle=True, seed=42)
+            else:
+                raise ValueError("The dataset must contain a 'train' or 'test' split.")
+        # Append the possibly replicated dataset to the list
+        dataset_list.extend([tmp_dataset] * replication_factor)
+    return dataset_list