Задать вопрос

WarGot

7

вклад
11

вопросов
25

ответов
16%

решений

Ответы пользователя по тегу Кодировка символов

Как определить кодировку строки на PHP?

WarGot @WarGot

Как я понял Вы про curl и страницу которую парсим. Попробуй вот это, тут перекодировка в utf8, нужные куски выдрать тебе думаю будет не проблема

<?php

/**
 * @author 
 * @copyright 2014
 */

function curl_exec_utf8($ch) {
    $data = curl_exec($ch);
    if (!is_string($data)) return $data;

    unset($charset);
    $content_type = curl_getinfo($ch, CURLINFO_CONTENT_TYPE);

    /* 1: HTTP Content-Type: header */
    preg_match( '@([\w/+]+)(;\s*charset=(\S+))?@i', $content_type, $matches );
    if ( isset( $matches[3] ) )
        $charset = $matches[3];

    /* 2: <meta> element in the page */
    if (!isset($charset)) {
        preg_match( '@<meta\s+http-equiv="Content-Type"\s+content="([\w/]+)(;\s*charset=([^\s"]+))?@i', $data, $matches );
        if ( isset( $matches[3] ) )
            $charset = $matches[3];
    }

    /* 3: <xml> element in the page */
    if (!isset($charset)) {
        preg_match( '@<\?xml.+encoding="([^\s"]+)@si', $data, $matches );
        if ( isset( $matches[1] ) )
            $charset = $matches[1];
    }

    /* 4: PHP's heuristic detection */
    if (!isset($charset)) {
        $encoding = mb_detect_encoding($data);
        if ($encoding)
            $charset = $encoding;
    }

    /* 5: Default for HTML */
    if (!isset($charset)) {
        if (strstr($content_type, "text/html") === 0)
            $charset = "ISO 8859-1";
    }

    /* Convert it if it is anything but UTF-8 */
    /* You can change "UTF-8"  to "UTF-8//IGNORE" to 
       ignore conversion errors and still output something reasonable */
    if (isset($charset) && strtoupper($charset) != "UTF-8")
        $data = iconv($charset, 'UTF-8', $data);

    return $data;
}

?>

Ответ написан более трёх лет назад

2 комментария

Самые активные сегодня

Daemon23RUS
- 3 ответа
- 0 вопросов
dim5x
- 2 ответа
- 0 вопросов
Талян
- 1 ответ
- 0 вопросов
traffic-plumber
- 1 ответ
- 0 вопросов
Сергей delphinpro
- 1 ответ
- 0 вопросов
Ivan_shev
- 1 ответ
- 0 вопросов

Как определить кодировку строки на PHP?

Войдите на сайт