takarajordan (Jordan Legg)

posted an update 2 days ago

Post

173

yooo Tongyi-MAI/Z-Image-Turbo IS SOOOO SICK!

Congrats to the team you absolutely cooked with this.

posted an update 12 days ago

Post

3164

Two weeks ago I had an engaging discussion with locals in Cockermouth about AI and the broader industry, a reminder that hearing candid perspectives beyond our professional circles is invaluable and something anyone working full-time in this field should make time for.

Thank you!

posted an update 27 days ago

Post

258

🌞 LOVABLE IS CRACKED

Built a golden hour tracker in under 15 minutes with Lovable: uses your phone’s Geolocation API, the SunCalc library, and runs fully client-side with no servers. https://goldenhour.404missing.link

posted an update 3 months ago

Post

466

Yay I made an in memory vector DB in pure golang, check it out here https://github.com/takara-ai/serverlessVector

posted an update 3 months ago

Post

2633

Are we really back to storing access tokens in plain text again?

{
  "mcpServers": {
    "hf-mcp-server": {
      "url": "https://huggingface.co/mcp",
      "headers": {
        "Authorization": "Bearer <YOUR_HF_TOKEN>"
      }
    }
  }
}

3 replies

·

posted an update 3 months ago

Post

3040

I'm currently looking into what makes a scientific paper more popular than others on a platform like Hugging Face. I conducted a huge array of tests, content length, time based information even semantic feature extraction to get to some sort of answer around...

What actually drives popularity of these papers, why do some papers get zero upvotes and why do some get thousands?

The answer is absolutely nothing. Yes that's right. Nothing about the actual paper itself drives popularity, the paper's popularity is driven by external factors like it's authors, external marketing and others.

So next time you see a research paper with a lot of upvotes, just remember it's not because of the efforts of the authors. Remain objective.

posted an update 3 months ago

Post

247

cron + LLM api is cracked

2 replies

·

reacted to tomaarsen's post with ❤️ 4 months ago

Post

4419

😎 I just published Sentence Transformers v5.1.0, and it's a big one. 2x-3x speedups of SparseEncoder models via ONNX and/or OpenVINO backends, easier distillation data preparation with hard negatives mining, and more:

1️⃣ Faster ONNX and OpenVINO backends for SparseEncoder models
Usage is as simple as backend="onnx" or backend="openvino" when initializing a SparseEncoder to get started, but I also included utility functions for optimization, dynamic quantization, and static quantization, plus benchmarks.

2️⃣ New n-tuple-scores output format from mine_hard_negatives
This new output format is immediately compatible with the MarginMSELoss and SparseMarginMSELoss for training SentenceTransformer, CrossEncoder, and SparseEncoder losses.

3️⃣ Gathering across devices
When doing multi-GPU training using a loss that has in-batch negatives (e.g. MultipleNegativesRankingLoss), you can now use gather_across_devices=True to load in-batch negatives from the other devices too! Essentially a free lunch, pretty big impact potential in my evals.

4️⃣ Trackio support
If you also upgrade transformers, and you install trackio with pip install trackio, then your experiments will also automatically be tracked locally with trackio. Just open up localhost and have a look at your losses/evals, no logins, no metric uploading.

5️⃣ MTEB Documentation
We've added some documentation on evaluating SentenceTransformer models properly with MTEB. It's rudimentary as the documentation on the MTEB side is already great, but it should get you started.

Plus many more smaller features & fixes (crash fixes, compatibility with datasets v4, FIPS compatibility, etc.).

See the full release notes here: https://github.com/UKPLab/sentence-transformers/releases/tag/v5.1.0

Big thanks to all of the contributors for helping with the release, many of the features from this release were proposed by others. I have a big list of future potential features that I'd love to add, but I'm

reacted to openfree's post with 🔥 4 months ago

Post

3783

🚀 GPT-OSS 120B & 20B - Use Both Models in One Space!

openfree/OpenAI-gpt-oss
VIDraft/gpt-oss-RAG

🎯 Two Models, One Space!
GPT-OSS hit #1 on HF just 2 hours after release! 🏆
Now you can use both models conveniently in a single space.
📋 Model Selection Made Easy!
Just pick from the dropdown ✅
├── GPT-OSS-120B (Complex tasks)
└── GPT-OSS-20B (Quick chats)
💫 How to Use (Takes 30 seconds!)

Sign in → With your HF account 🔐
Select model → Choose what you need 📌
Apply → Click! ⚡
Start chatting → That's it! 💬

🌈 Perfect For:

120B → Deep analysis, professional work 🧠
20B → Fast responses, casual conversations ⚡

No installation needed - just use it in your browser! 🌐
✨ Special Features

🎨 Beautiful gradient UI
🌙 Dark mode support
🔄 Real-time model switching
🆓 Completely free!

👉 Try it now! It's really that simple!

#GPT-OSS #HuggingFace #FreeAI #EasyToUse

2 replies

·

posted an update 4 months ago

Post

286

What do you all actually think about the open source OpenAI models? Are they legitimately any good or are they hype?

3 replies

·

posted an update 7 months ago

Post

389

Cool to see the new model lightonai/Reason-ModernColBERT

Made with late interaction I'd love to recreate the dataset to see a proper apache 2.0 version!

reacted to clem's post with ❤️ 7 months ago

Post

4114

What are you using to evaluate models or AI systems? So far we're building lighteval & leaderboards on the hub but still feels early & a lot more to build. What would be useful to you?

6 replies

·

replied to clem's post 7 months ago

I'm using https://artificialanalysis.ai/ just because it puts everything in one place! It's not the best resource but these days I'm all about saving time.

replied to their post 8 months ago

@ThomasTheMaker if you make an issue on the repo, I'll look into it!

replied to their post 8 months ago

@ThomasTheMaker it's just the raw attention and transformer architecture in golang designed for serverless so performance will definitely be less than ggml and llama.cpp since it's not accelerated by GPU's but if you're into edge AI CPU only, this is the first, only and best way to compute attention.

Quantization can definitely be supported as it's just a math model!

posted an update 8 months ago

Post

646

🎌 Two months in, https://github.com/takara-ai/go-attention has passed 429 stars on GitHub.

We built this library at takara.ai to bring attention mechanisms and transformer layers to Go — in a form that's lightweight, clean, and dependency-free.

We’re proud to say that every part of this project reflects what we set out to do.

- Pure Go — no external dependencies, built entirely on the Go standard library
- Core support for DotProductAttention and MultiHeadAttention
- Full transformer layers with LayerNorm, feed-forward networks, and residual connections
- Designed for edge, embedded, and real-time environments where simplicity and performance matter

Thank you to everyone who has supported this so far — the stars, forks, and feedback mean a lot.

4 replies

·

posted an update 8 months ago

Post

1598

AI research over coffee ☕️
No abstracts, just bullet points.
Start your day here: https://tldr.takara.ai

1 reply

·

replied to samchain's post 9 months ago

This is a pretty big update for sure. The models have improved significantly which is great for everyone involved, especially the end user. Those datasets look very promising as well!

replied to wassemgtk's post 9 months ago

Sounds interesting, I’ll check it out!

replied to etemiz's post 9 months ago

This is a really interesting post. I’ve been looking at the DeepSeek models for sure. This shows a pretty nice improvement, would love to see some example changes!

Jordan Legg PRO

AI & ML interests

Recent Activity

Organizations

Jordan Legg PRO

AI & ML interests

Recent Activity

Organizations

takarajordan's activity