Ответы пользователя по тегу Алгоритмы
  • Как определять недействительные клики, кликфрод, cкликивание и накрутку показов?

    @boss_lexa Автор вопроса
    нашел исследование

    Как я понял данная задача является классификацией: fraud или нет.

    fraud Сигналы

    Short sessions
    Low variance in observed behavior, e.g. time between sessions High click-through rate
    Very high session count but almost no click-throughs (in a short period of time)


    Алгоритмы машинного обучения с лучшей точностью и скоростью
    NaiveBayes, RandomForest and RIDOR


    Анализируемые данные
    Total number of sessions
    Total number of click-throughs
    Distribution of time between sessions
    Number of sessions marked as spurious
    User IP address(es)
    Distribution of session, engagement and visibility times
    Distribution of time to rst click-through and mouse-over
    Number of invalid sessions
    
    @RELATION user
    @ATTRIBUTE session_count NUMERIC
    @ATTRIBUTE tot_session_time NUMERIC
    @ATTRIBUTE avg_session_time NUMERIC
    @ATTRIBUTE session_dev NUMERIC
    @ATTRIBUTE tot_visibility_time NUMERIC
    @ATTRIBUTE avg_visibility_time NUMERIC
    @ATTRIBUTE visibility_dev NUMERIC
    @ATTRIBUTE avg_time_to_first_mouseover NUMERIC
    @ATTRIBUTE first_mouseover_dev NUMERIC
    @ATTRIBUTE tot_engagement_time NUMERIC
    @ATTRIBUTE avg_engagement_time NUMERIC
    @ATTRIBUTE engagement_dev NUMERIC
    @ATTRIBUTE spurious_count NUMERIC
    @ATTRIBUTE click_thru_count NUMERIC
    @ATTRIBUTE invalid_count NUMERIC
    @ATTRIBUTE avg_time_between_sessions NUMERIC
    @ATTRIBUTE time_between_sessions_dev NUMERIC
    @ATTRIBUTE avg_time_to_first_click_thru NUMERIC
    @ATTRIBUTE time_to_first_click_thru_dev NUMERIC
    @ATTRIBUTE click_count NUMERIC
    @ATTRIBUTE class {legitimate,fraudulent}
    
    user_id STRING,
    session_count INT,
    total_session_time INT,
    average_session_time INT,
    session_time_deviation INT,
    total_visibility_time INT,
    average_visibility_time INT,
    average_time_to_first_mouseover INT,
    total_engagement_time INT,
    average_engagement_time INT,
    spurious_sessions INT,
    click_throughs INT,
    invalid_sessions INT,
    average_time_between_sessions INT,
    average_time_to_first_clickthrough INT

    Ответ написан
    Комментировать