πŸ‘€ Running F5-TTS on a Refurbished Mac Studio M2 Ultra: A Setup Guide

Date Created: 2025-05-08
By: 16BitMiker
[ BACK.. ]

Setting up a high-performance text-to-speech system like F5-TTS on Apple Silicon might seem daunting, but with the right steps, it’s surprisingly smooth. I recently configured F5-TTS on a newly refurbished Mac Studio with the M2 Ultra chip, and I’m here to walk you through the entire installation processβ€”what worked, what didn’t, and how to get everything running beautifully on macOS.

Let’s dive in. πŸ§ πŸ”Š

πŸ“‹ What Is F5-TTS?

F5-TTS is a state-of-the-art, non-autoregressive text-to-speech system developed by SWivid. It uses flow matching and Diffusion Transformers (DiT) to synthesize high-quality speech quickly and efficiently. It supports real-time inference, multilingual output, and even voice cloningβ€”all in an open-source package.

πŸ”οΈ System Prep: Starting Fresh on macOS M2 Ultra

Here’s the software and hardware context:

Let’s break it down step by step.

πŸ“¦ Step-by-Step Installation

βœ… Step 1: Install Homebrew and Core Dependencies

βœ… Step 2: Clone the Repository

βœ… Step 3: Set Up a Virtual Environment

βœ… Step 4: Install the Package

F5-TTS uses a pyproject.toml, so install it in editable mode:

βœ… Step 5: Install PyTorch with MPS (Metal Performance Shaders) Support

Then verify GPU support:

βœ… Output:

πŸ”½ Step 6: Download Pretrained Model Checkpoints

πŸ“ Note: This is a ~1.3 GB file and may take a few seconds depending on your internet speed.

▢️ Step 7: Run the Inference CLI

First, grab a sample reference voice:

Then run the inference:

βœ… Result:

🌐 Bonus: Run the Gradio Web UI

Want a user-friendly way to test your voices?

Then open your browser and visit:

πŸ‘‰ http://localhost:7860

You’ll get a full-featured web interface with:

πŸ”§ Troubleshooting and Notes

Python Compatibility

F5-TTS officially recommends Python 3.10 due to some library constraints. I used Python 3.13 without issues, but if you run into errors (especially with librosa or numpy), consider downgrading.

Missing Models

If inference fails with a checkpoint error, double-check your directory:

Missing CLI Commands

If f5-tts_infer-cli or f5-tts_infer-gradio doesn't work, try running the Python modules directly:

πŸ” Relaunching F5-TTS After a System Restart

If you’ve rebooted your Mac or closed your terminal and want to relaunch F5-TTS, just follow these quick steps:

  1. βœ… Open Terminal and navigate to your project:

  1. βœ… Reactivate your virtual environment:

  1. βœ… (Optional) Verify access to CLI tools:

  1. βœ… Run Inference Again:

  1. βœ… Or launch the Gradio interface:

🧠 Pro Tip: If you frequently reboot, consider adding an alias to your .zshrc:

Then you can just type:

And you're ready to roll. πŸš€

πŸ‘₯ What Worked Well

βœ… PyTorch MPS support was plug-and-play
βœ… CLI tools launched without issue
βœ… Model and reference audio integration was seamless
βœ… The Gradio UI worked out-of-the-box
βœ… Voice cloning was fast and expressive

πŸš€ Final Thoughts

Setting up F5-TTS on an Apple Silicon Mac Studio was not only possibleβ€”it was enjoyable. Performance was smooth, and the installation process was relatively painless thanks to modern Python packaging and MPS support.

If you’re looking to explore real-time voice synthesis, prototyping audiobooks, or building a personalized voice assistant, F5-TTS is a solid open-source starting point.

πŸ“š Read More