15,321 questions
-4
votes
1
answer
71
views
Is the source code representation in the Go Programming Language the same as in Git?
git(1) uses one representation when storing source code (not binary) by default which IIRC is utf-8 (no working-tree-encoding). Is this the same default as in Go?
I tried comparing to Git but have ...
3
votes
1
answer
178
views
Why do some web browsers incorrectly display currencies produced by NumberFormatter
Why some older web browsers (not all) incorrectly display currency symbols when symbols are produced by php NumberFormatter() but display correct symbol when given as html entity. Since correct symbol ...
1
vote
1
answer
69
views
spring boot web polish characters encoding problem
In Spring Boot 2.6.0 java 8 application configuration class has content:
@Configuration
public class Internationalization implements WebMvcConfigurer {
// localeResolver, localeChangeInterceptor,...
Advice
0
votes
2
replies
104
views
Force all character columns in a list of data frames to UTF-8 before uploading to a UTF-8-only database
I have a list of two tables in R.
Each data frame contains several character and numeric columns. One of the columns is a company name column (for example, Company_Name).
The target database only ...
0
votes
0
answers
39
views
GitLab CI Allure Report Shows Garbled Characters in Console Output
I'm experiencing intermittent issues with Allure reports in GitLab CI where the console log occasionally displays garbled/corrupted characters instead of readable output.
This doesn't happen ...
20
votes
4
answers
2k
views
Why doesn't printing a Unicode character with wprintf work?
I made a small C program that should print an emoji:
#include <stdio.h>
#include <windows.h>
int main(void) {
SetConsoleOutputCP(CP_UTF8);
printf("\U0001F625\n"); // 😥
...
0
votes
2
answers
126
views
Parsing UTF-8 XML using DefaultHandler: when / how does it become UTF-16 in Java?
I have a Java program that was working perfectly in Corretto 17, but is now having character set encoding issues in Corretto 25.
I am reading a UTF-8 encoded XML from an external API. The code is ...
4
votes
1
answer
113
views
Are spelling variations of encoding identifiers for "setlocale" standardized or documented?
This question has to do with syntactic conventions for string encoding identifiers in locale names passed to setlocale in C, focusing on the particular example of UTF-8. My preliminary observation is ...
0
votes
0
answers
88
views
Avoid encoding emojis when using yq --prettyPrint
When I convert my JSON file to YAML, i want any unicode chars to be part of the content and not escaped with backslash and quotes. Example: This is my file.json
What can I do in order to make YQ not ...
0
votes
0
answers
47
views
How to show GBK encoded header in Mac version Unity editor properly?
I work on my project on both Mac and Windows. Some editor headers were written in Visual Studio on Windows in Chinese, and were encoded in GBK(default I guess). However they don't show properly on mac ...
1
vote
0
answers
42
views
How can I know what encoding to use when reading & writing CFDataRef entries to macOS Keychain via the C-based SecItem API?
I'm writing FFI bindings to macOS' C-based Keychain Services API for use in a plugin.
The Keychain Services API takes CFDataRef values, which allows storing arbitrary bytes to the Keychain, but the ...
-1
votes
1
answer
51
views
How to pass special chars to nodemailer?
I want to pass a string to nodemailer so that it results in
=C2=A0
i tried to pass in both 'Â ' (the literal chars), "\xC2\xA0", etc.
But they always result in =C3=A9
A longer example in ...
3
votes
1
answer
122
views
How to efficiently split a text file with an arbitrary Charset without damaging code points?
Given a valid text file file and its java.nio.charset.Charset how can I efficiently (preferably using RandomAccessFile.seek() or InputStream.skip(), without reading the whole file) split it into two ...
-5
votes
1
answer
122
views
Swift: text/string vs. raw bytes? [closed]
I gave the following Swift-code to ChatGPT:
let data = text.data(using: .utf8)
It answered me:
"This line takes your string and turns it into raw bytes (data) that can be stored, sent over the ...
0
votes
1
answer
144
views
Why does Wikipedia claim UTF-16 is obsolete when Javascript uses it?
The Wikipedia page for UTF-16 claims that it is obsolete, saying:
UTF-16 is the only encoding (still) allowed on the web that is incompatible with 8-bit ASCII. However it has never gained popularity ...
1
vote
0
answers
32
views
Clipper (CP437) Character Display Issue on AlmaLinux 9.6 Minimal Server Terminal [duplicate]
I'm working with an AlmaLinux 9.6 Minimal server that hosts a Clipper-programmed system. This Clipper system uses the CP437 character set.
I've successfully configured client computers to display the ...
0
votes
0
answers
83
views
How to print Greek letters in C++ [duplicate]
Recently, I have been working on a small program that requires me to print messages in Greek. Nothing fancy, just printing greek sentences in the terminal. For example:
#include <iostream>
int ...
0
votes
0
answers
46
views
Accents replaced with a strange character in Symfony 6.4 [duplicate]
For 3 days, the characters with French accents no longer appear and are replaced by a �.
Everything was fine before.
In phpmyadmin the display is correct.
It's only on the remote server, in local all ...
3
votes
1
answer
150
views
Python subprocess: Trouble receiving non-ASCII characters from output of yt-dlp
(Windows, Python 3.9.6, yt-dlp 2025.06.09)
I have a youtube playlist whose titles contain both korean and latin characters. I can print the titles of the videos in the playlist using the below command:...
0
votes
0
answers
76
views
How to tell if these two identical-looking but differently encoded strings are the same? [duplicate]
I'm trying to import some data downloaded from Google Sheets.
The tab, when editing on the website, is called "Kodály". In Ruby, if I look at the individual characters, I see this:
>> ...
0
votes
1
answer
52
views
glib iconv() - force conversion to single bytes
I have my own personal movie database system, within which context I NEVER want to see "extended" characters (with accents, umlauts, etc.) in any text fields.
MS Co-pilot tells me that i ...
1
vote
0
answers
180
views
How to set meta charset to utf-16 instead of utf-8 in Next.js?
I'm working on a Next.js application (currently using version 15.3.2) and I'm looking to ensure the charset meta tag is correctly set in the HTML head. Typically, in a standard HTML file, I would just ...
0
votes
0
answers
34
views
Problem with character encoding on apache server while receiving a Stripe webhook
I'm currently trying to implement a payment system using Stripe. After a payment, Stripe sends a webhook containing information about the purchase. However, before even a single line of php code is ...
1
vote
0
answers
290
views
Function can't handle different string encodings. If you are using non-ASCII characters, save the source file in the local encoding
I have a CAPL script done in Windows with a shebang at the beginning as follows:
/*@!Encoding:65001*/
and this warning message pops up with green highlighting on a TriggerPDU2() function.
Function ...
0
votes
0
answers
49
views
Emoji characters display as � in Gmail sent via Google Apps Script – how to fix encoding? [duplicate]
I'm using a Google Apps Script to send weekly parent updates from a Google Sheet. The email content is plain text, and I want to include emojis (🎯, 🧩, 📌) as section headers. However, when the ...
1
vote
1
answer
107
views
How to display non-English text that the user inputs in the console correctly?
The console doesn't display the non-English text input into it correctly, when it's later outputting them. However the text displays correctly when the user inputs them.
Some of the non-English ...
0
votes
2
answers
89
views
Wrong encoding with Python decode in Windows command prompt
When I run decode on a byte string encoded as UTF-8 I get ANSI encoding in a Windows command prompt.
>python --version
Python 3.13.0
>python -c "print(b'\xc3\x96'.decode('utf-8'))" >...
2
votes
0
answers
73
views
CharsetDecoder with DirectByteBuffer and HeapByteBuffer performance difference
My use case is the following: reading DirectByteBuffers from the network and decoding them into UTF-8 strings. I’ve observed that using a UTF_8 CharsetDecoder with a DirectByteBuffer is 3–4 times ...
-2
votes
1
answer
73
views
Docker and MySQL 8.0 [closed]
I run a docker container with a mysql 8.0 database, Connect to it, check that the encoding is suitable for Cyrillic
mysql > SHOW VARIABLES LIKE 'collation_database';
+--------------------+----------...
0
votes
1
answer
37
views
Google Search Console export not giving the same URL
I'm going through a list of URLs that google have tried to index that gives 503. We have fixed a number of these, and I wanted to go through the list to test which are still left.
However, I ...
0
votes
2
answers
66
views
SSRS CSV export via PHP cURL returns incorrect encoding (e.g. "–" instead of "–"), but works when downloaded via browser
I'm using PHP with cURL to programmatically download SSRS (SQL Server Reporting Services) reports in CSV format. The download works, but the character encoding is off. For example:
Text that should be ...
3
votes
0
answers
149
views
Text encoding on Windows in C++
I updated GCC to the recently released version 15.1 (via Msys2, on Windows 10).
On cppreference, I noticed the page regarding the <text_encoding> header, which should be supported by GCC version ...
0
votes
0
answers
156
views
MLflow does not upload images stored using mlflow.log_image()
I am working with Mlflow==2.19.0 in a Red Hat Enterprise Linux Server release 7.9 (Maipo). Everythig works fine except with the log_image method that for some reason is converting parts of the string ...
1
vote
0
answers
96
views
UnicodeDecodeError using PostgreSQL DB for DRF project on Windows
I'm working on a team project on DRF, which uses PostgreSQL-based DB, my local PostgreSQL version is 17. The problem is that all of my colleagues have Ubuntu OS on their devices (and the project ...
1
vote
1
answer
122
views
Why does Encoding.UTF8.GetMaxByteCount(1) return 6?
The TLDR here is simple: What's a sequence of chars that would make either UTF8's Encoding or Encoder return 6 (or even 5) bytes for a single char, as GetMaxByteCount implies it might?
The non-TLDR:
...
0
votes
1
answer
74
views
Copy Excel file xslx encoding problems
I am trying to copy an xslx file from a remote system within a jenkins groovy script. This xslx file is encoded with windows-1252, so I give this Charset to the FileReader and FileWriter:
...
1
vote
1
answer
471
views
Handling UTF-8 Special Characters in GitLab CI/CD Pipeline on Windows
I'm creating a merge request in gitlab with a gitlab pipeline. I set special characters in .gitlab-ci.yml and provide it to a python script, which is creating the merge request.
But in the merge ...
1
vote
1
answer
49
views
How to recover original characters from data with broken characters?
We have a bunch of translations loaded in our Database which are showing broken characters where there should be accented latin characters like 'É'. I'm not sure where the encoding went wrong but we ...
0
votes
0
answers
40
views
PHP Dealing with Latin to UTF8 [duplicate]
It's a very old PHP problem, but I'm working with an old SQL database for a private project. The SQL databsae is latin encoded and has for example "Andr\xc3\xa9" representing André.
The ...
3
votes
0
answers
125
views
Why does setlocale cause fputs to produce garbled output in the Windows console?
My Windows system ANSI code page is GBK, and I am working on a Windows console program.
// SetConsoleOutputCP(65001); // Neither code page 936 nor 65001 affects the result.
char utf8[] = { 0xE4, 0xB8, ...
1
vote
1
answer
84
views
Why does fn:encode-for-uri('§') result in %C2%A7 rather than just %A7?
In Oxygen XML Editor 27.0, using the "XPath/XQuery Builder" (which, as far as I know, makes use of Saxon as XPath/XQuery processor), when I execute the XPath 2.0 query encode-for-uri('§'), I ...
0
votes
1
answer
66
views
Potential Encoding error in printed output while using R from SAS IML?
I am using R from SAS using proc iml . I want to import a dataset from sas (in the example sashelp.class) and print the first 5 rows.
options set=R_HOME="C:/PROGRA~1/R/R-44~1.2"; * Position ...
0
votes
1
answer
65
views
Converting string to data - What happens, when the wrong encoding it used?
Let`s say I've got a string with characters, which doesn't exist in ASCII.
When I use the correct encoding everything works fine.
let example = "Testing, ÜÄÖ ?ß 123 ..."
let data = example....
-2
votes
1
answer
86
views
How do I encode a JavaScript file included in HTML? - I get `…` rather than `…`
Attempt:
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta http-equiv="Content-Type" content="text/html;charset=...
0
votes
1
answer
106
views
Parsing of CSV-File leads to "NBSP" and "SPA" Characters in Windows
I am parsing a CSV document to write a specification in SysML v2, which is essentially just a simple text file.
Parsing it on Linux I get the result I desire. I use a SysOn-Docker to display my SysML ...
0
votes
1
answer
92
views
load data stopped prematurely
I uploaded a file with 4 million rows to a table.
The table is a database of vehicle license plates along with all the characteristics of that vehicle (color, tire size, engine capacity, and more).
...
0
votes
1
answer
84
views
Turkish character problem in Node.js fs.readFile
I have a XML file which is created in Windows.
It has some Turkish characters in it.
<?xml version="1.0" encoding="iso-8859-9"?><?xml-stylesheet type="text/xsl" ?...
0
votes
2
answers
81
views
JSON with °(degree) sign / special character is not sent to API as expected
I am using the above code in Oracle, to send a payload to an API. It's working for all the payload except when the comment field has a Degree sign(°).
/////////////////////////////////////////////////...
1
vote
1
answer
74
views
Postgres filenames with german umlaut
I would like to read some rtf files I received from an outside source into a table with Postgres.
I am using the pg_stat_file() function to check whether or not the file exists and aim to read its ...
0
votes
0
answers
103
views
Oracle SQL*Loader inserts ? instead of correct Unicode characters (JDBC works fine)
I am working with an Oracle database that uses the US7ASCII character set. When inserting Unicode characters using a Java JDBC program, the values are stored correctly, but when inserting the same ...