Excel is an incredibly powerful tool for data management, but exporting that data into a universally compatible format like CSV (Comma Separated Values) often leads to frustration. This is particularly true for users dealing with languages that utilize special characters, such as Vietnamese, Japanese, or French. A standard conversion often results in garbled text or question marks replacing your essential data.
This guide provides a comprehensive walkthrough on how to correctly export Excel files to CSV while preserving Unicode characters. whether you are using Excel 2010, 2016, or the latest Office 365 version, these methods will ensure your data integrity remains intact for import into databases, Outlook, or legacy systems.
The Standard Conversion Method (And Its Limitations)
For simple English-only data, the native Excel save feature works perfectly. However, it is important to understand the process before diving into the advanced UTF-8 methods.
To convert a standard file:
- Open the Excel workbook you wish to convert.
- Navigate to the File tab and select Save As (or press F12 for a shortcut).
Excel file menu showing Save As option selected
- In the Save as type dropdown menu, select *CSV (Comma delimited) (.csv)**.
Excel offers several CSV variations:
- CSV (Comma delimited): The standard format for Windows applications.
- CSV (Macintosh): Optimized for macOS legacy systems.
- CSV (MS-DOS): A legacy command-line format.
Dropdown menu in Excel showing CSV Comma delimited option
- Select your destination folder and click Save.
When you do this, Excel will likely throw two warning dialog boxes. The first warns that only the active sheet will be saved. The second warns that some features (like formatting, formulas, and bold text) will be lost in the CSV format. Click Yes to proceed.
The Problem: This standard method typically uses ANSI encoding. If your file contains Vietnamese names (e.g., “Nguyễn”) or other non-ASCII characters, they may turn into “Nguy?n” or other corrupted strings once opened in another application. To fix this, we must enforce UTF-8 encoding.
The Ultimate Solution: Using Notepad to Force UTF-8
The most reliable way to preserve special characters when exporting from Excel is to use an intermediate step involving Windows Notepad. This method works by saving the raw data as Unicode text first, which supports all character sets, and then converting that text structure into a CSV format.
Step 1: Save as Unicode Text
Instead of saving directly as CSV, we will first save the file as a text file that supports foreign characters.
- In Excel, go to File > Save As.
- In the “Save as type” dropdown, select *Unicode Text (.txt)**.
- Save the file to your computer.
Excel Save As dialog box selecting Unicode Text format
Step 2: Replace Tabs with Commas
The Unicode Text format separates data columns with “Tabs” instead of commas. To make it a true CSV, we need to swap these.
- Close Excel.
- Open the newly created
.txtfile using Notepad (Right-click > Open with > Notepad). - Highlight a blank space that represents a “Tab” character (the space between two columns of data) and copy it (Ctrl+C).
- Press Ctrl+H to open the Find and Replace dialog.
- In the Find what field, paste the Tab character you just copied (or press Ctrl+V).
- In the Replace with field, type a single comma (
,). - Click Replace All.
Notepad Find and Replace dialog replacing tabs with commas
Step 3: Save with UTF-8 Encoding
This is the most critical step. You must strictly define the encoding when saving the final file.
- In Notepad, go to File > Save As.
- Look for the Encoding dropdown menu at the bottom of the dialog box. It usually defaults to “ANSI”. Change this to UTF-8.
- In the File name field, change the extension from
.txtto.csv(e.g.,data.csv). - Click Save.
Save As dialog in Notepad with UTF-8 encoding selected
You now have a fully compatible CSV file that retains all Vietnamese accents and special symbols.
Alternative Method: Using Google Sheets
If the Notepad method feels too manual, Google Sheets offers an excellent cloud-based alternative. Google’s native environment handles UTF-8 encoding automatically, making it a seamless converter.
Steps to Convert:
- Open Google Sheets in your browser and create a new blank spreadsheet.
- Go to File > Import and upload your original Excel file.
- Once the data is loaded, verify that all special characters are displaying correctly.
- Go to File > Download > Comma Separated Values (.csv).
Google Sheets File menu downloading as Comma Separated Values
The file will download immediately with correct UTF-8 encoding, ready for use in any other application. This method is particularly useful if you are working on a Mac where Windows Notepad is not available, or if you need to share the file quickly with a team.
Verification: Checking Your Data
After converting your file using either method, it is best practice to verify the integrity of the data before uploading it to a database or CRM.
Do not just double-click the CSV file to open it in Excel, as Excel might attempt to re-interpret the encoding and display it incorrectly again. Instead:
- Right-click the new CSV file.
- Select Open with > Notepad (or any plain text editor like Sublime Text or Notepad++).
- Visually inspect the data to ensure characters like
ư,ô,ê, or Kanji characters are legible and not replaced by weird symbols.
CSV file opened in Notepad showing correct character display
Conclusion
Converting Excel to CSV seems like a basic task, but for users in Vietnam or those handling international data, the default settings in Excel often fall short. By utilizing the “Unicode Text” workaround with Notepad or leveraging the cloud capabilities of Google Sheets, you can ensure 100% data accuracy.
Choosing the right method depends on your workflow. The Notepad method is excellent for offline privacy and strictly controlled environments, while Google Sheets offers speed and convenience. Mastering these simple techniques will save you hours of data cleaning and formatting errors in the future.
References
- Microsoft Support: Save a workbook to text format (.txt or .csv).
- Google Editors Help: Import and export file sets.
- RFC 4180: Common Format and MIME Type for Comma-Separated Values (CSV) Files.










Discussion about this post