데이터 과학 multiclass-classification

6

3 클래스로 멀티 클래스 분류 설정을 시도하고 있습니다. 클래스 분포는 대부분의 데이터가 3 개 클래스 중 1 개에 해당하므로 왜곡됩니다. (클래스 레이블은 1,2,3이며, 데이터의 67.28 %가 클래스 레이블 1에, 클래스 2에 11.99 %, 클래스 3에 남아 있음) 이 데이터 세트에서 다중 클래스 분류기를 훈련하고 있으며 다음과 같은 성능을 얻습니다. Precision …

102 multiclass-classification evaluation

4

XGBoost를 통한 불평형 멀티 클래스 데이터

이 배포판에는 3 개의 수업이 있습니다. Class 0: 0.1169 Class 1: 0.7668 Class 2: 0.1163 그리고 xgboost분류에 사용 하고 있습니다. 라는 매개 변수가 있음을 알고 scale_pos_weight있습니다. 그러나 '멀티 클래스'의 경우 어떻게 처리되며 어떻게 올바르게 설정할 수 있습니까?

20 classification xgboost multiclass-classification unbalanced-classes

1

몇 개의 LSTM 셀을 사용해야합니까?

사용해야하는 최소, 최대 및 "합리적인"양의 LSTM 셀과 관련된 경험 법칙 (또는 실제 규칙)이 있습니까? 특히 TensorFlow 및 속성의 BasicLSTMCell 과 관련이 num_units있습니다. 분류 문제가 다음과 같이 정의되었다고 가정하십시오. t - number of time steps n - length of input vector in each time step m - length of output vector …

12 rnn machine-learning r predictive-modeling random-forest python language-model sentiment-analysis encoding machine-learning deep-learning neural-network dataset caffe classification xgboost multiclass-classification unbalanced-classes time-series descriptive-statistics python r clustering machine-learning python deep-learning tensorflow machine-learning python predictive-modeling probability scikit-learn svm machine-learning python classification gradient-descent regression research python neural-network deep-learning convnet keras python tensorflow machine-learning deep-learning tensorflow python r bigdata visualization rstudio pandas pyspark dataset time-series multilabel-classification machine-learning neural-network ensemble-modeling kaggle machine-learning linear-regression cnn convnet machine-learning tensorflow association-rules machine-learning predictive-modeling training model-selection neural-network keras deep-learning deep-learning convnet image-classification predictive-modeling prediction machine-learning python classification predictive-modeling scikit-learn machine-learning python random-forest sampling training recommender-system books python neural-network nlp deep-learning tensorflow python matlab information-retrieval search search-engine deep-learning convnet keras machine-learning python cross-validation sampling machine-learning

1

텍스트 분류 문제 : Word2Vec / NN이 최선의 방법입니까?

텍스트 단락을 제공하여 시스템을 분류하고 컨텍스트를 식별 할 수있는 시스템을 설계하려고합니다. 사용자 생성 텍스트 단락 (댓글 / 질문 / 응답 등)으로 훈련 트레이닝 세트의 각 항목은로 태그됩니다. 예를 들어 ( "카테고리 1", "텍스트 단락") 수백 가지 범주가 있습니다 그러한 시스템을 구축하는 가장 좋은 방법은 무엇입니까? 몇 가지 다른 옵션을 살펴본 …

10 neural-network text-mining multiclass-classification word-embeddings

2

마지막 레이어에서 Keras Multiple“Softmax”가 가능합니까?

Keras의 마지막 레이어에서 다중 소프트 맥스를 구현할 수 있습니까? 따라서 노드 1-4의 합은 1입니다. 5-8 = 1; 기타 다른 네트워크 디자인을해야합니까?

10 machine-learning keras multiclass-classification

4

SGDClassifier : 이전에 알려지지 않은 레이블이있는 온라인 학습 / partial_fit

내 훈련 세트에는 초기 학습에 사용되는 약 50 만 개의 항목이 포함되어 있습니다. 매주 ~ 5k 개의 항목이 추가됩니다. 그러나 동일한 양이 "사라집니다"(시간이 지나면 삭제해야하는 사용자 데이터이므로). 따라서 나중에 전체 데이터 세트에 액세스 할 수 없으므로 온라인 학습을 사용합니다. 현재 SGDClassifier작동 하는 것을 사용하고 있지만 큰 문제는 새로운 범주가 나타나고 …

9 multiclass-classification online-learning

1

멀티 클래스 데이터 세트에서 분류가 잘못되는 불균형 데이터

39 개의 카테고리 / 클래스와 850 만 개의 레코드가있는 텍스트 분류 작업을하고 있습니다. (향후 데이터 및 카테고리가 증가 할 것입니다). 내 데이터의 구조 또는 형식은 다음과 같습니다. ---------------------------------------------------------------------------------------- | product_title | Key_value_pairs | taxonomy_id | ---------------------------------------------------------------------------------------- Samsung S7 Edge | Color:black,Display Size:5.5 inch,Internal | 211 Storage:128 GB, RAM:4 GB,Primary Camera:12 …

9 machine-learning python classification scikit-learn multiclass-classification

«multiclass-classification» 태그된 질문